The Sign Language Interchange Format: Harmonising Sign Language Datasets for Computational Processing

Link:
Autor/in:
Verlag/Körperschaft:
Universität Hamburg
Erscheinungsjahr:
2023
Medientyp:
Text
Schlagworte:
  • sign language
  • corpus annotation
  • language data representation
  • data conversion
Beschreibung:
  • This upload is the accepted version of the paper. For the closed access published version follow the URL in the citation.

    Cite as

    M. Schulder, S. Bigeard, T. Hanke and M. Kopf, "The Sign Language Interchange Format: Harmonising Sign Language Datasets For Computational Processing"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops: Sign Language Translation and Avatar Technology, Rhodes Island, Greece, 2023, doi: 10.1109/ICASSPW59220.2023.10193022. URL: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10193022&isnumber=10192577

    Abstract

    We introduce the Sign Language Interchange Format (SLIF), a new format for representing annotations and lexical inventories of sign language datasets. The format is designed as an intermediate step in data preparation for language technologies, unifying the annotation conventions of different corpora for further use. Complex gloss notations and implicit relations between tiers are made explicit through a hierarchy of machine-readable container structures. Sample implementations for converting to and from the new format are provided.

  • ©2023 IEEE
relatedIdentifier:
DOI 10.25592/uhhfdm.12015 DOI 10.25592/uhhfdm.12681 DOI 10.25592/uhhfdm.12050
Lizenzen:
  • https://creativecommons.org/licenses/by/4.0/legalcode
  • info:eu-repo/semantics/openAccess
Quellsystem:
Forschungsdatenrepositorium der UHH

Interne Metadaten
Quelldatensatz
oai:fdr.uni-hamburg.de:12051