The Hamburg MapTask Corpus (HAMATAC)

Link:
Autor/in:
Verlag/Körperschaft:
Universität Hamburg
Erscheinungsjahr:
2010
Medientyp:
Datensatz
Schlagworte:
  • adult L2 acquisition
  • learner corpus
  • task-oriented communication
  • successive bilingualism
  • L2 data
  • adult bilingualism
  • simultaneous bilingualism
  • map task
  • EXMARaLDA
  • linguistics
  • German
Beschreibung:
  • Audio and two video recordings of map tasks with adult L2 users of German and one L1 speaker. The speakers' L1 and their L2 proficiencies vary. The maps used for the tasks are available.

    The Hamburg MapTask Corpus (HAMATAC) is a spoken language corpus documenting the performance of 24 L2 learners of German in a map task. HAMATAC was recorded and transcribed in project Z2 at the Research Centre on Multilingualism. The current version 0.3 contains a new communication with video recording as well as the resources known from the previous version, e.g. orthographic transcriptions of the recordings, manual annotation of disfluencies and automatic annotation of part-of-speech and lemmas.

     

    CLARIN Metadata summary for The Hamburg MapTask Corpus (HAMATAC) (CMDI-based)

    Title: The Hamburg MapTask Corpus (HAMATAC)
    Description: Audio and two video recordings of map tasks with adult L2 users of German and one L1 speaker. The speakers' L1 and their L2 proficiencies vary. The maps used for the tasks are available.
    Publication date: 2010-09-16
    Data owner: Hamburger Zentrum für Sprachkorpora, Max-Brauer-Allee 60 / D-22765 Hamburg, corpora@uni-hamburg.de
    Contributors: Hamburger Zentrum für Sprachkorpora, Max-Brauer-Allee 60 / D-22765 Hamburg, corpora@uni-hamburg.de (compiler)
    Project: Z2 "Computer Assisted Methods for the creation and analysis of multilingual data", German Research Foundation (DFG)
    Keywords: adult L2 acquisition, learner corpus, task-oriented communication, successive bilingualism, L2 data, adult bilingualism, simultaneous bilingualism, map task, EXMARaLDA
    Language: German (deu)
    Size: 28 speakers (16 female, 12 male), 26 communications, 26 recordings, 208 minutes, 26 transcriptions, 22898 words
    Annotation types: transcription (manual): orthographic transcription/simplified HIAT, pos: Fine-grained part of speech tagging using TreeTagger and the STTS tagset., pos-sup: superordinate part of Speech (manual, STTS tagset), c: indicates that the automatic pos-annotation is incorrect, lemma: lemma (TreeTagger), disfluency: manual annotation of disfluency phenomena, pho: manual annotation of phonetic phenomena
    Temporal Coverage: 2009-10-28/2013-06-19
    Spatial Coverage: Hamburg, DE
    Genre: discourse
    Modality: spoken

     

Beziehungen:
DOI 10.25592/uhhfdm.1479
Lizenz:
  • info:eu-repo/semantics/restrictedAccess
Quellsystem:
Forschungsdatenrepositorium der UHH

Interne Metadaten
Quelldatensatz
oai:fdr.uni-hamburg.de:1480