INEL Tavda Mansi Corpus
- Link:
- Autor/in:
- Beteiligte Personen:
-
- Wagner-Nagy, Beáta
- Arkhipov, Alexandre
- Brykina, Maria
- Lazarenko, Elena
- Riaposov, Aleksandr
- Verlag/Körperschaft:
- Universität Hamburg
- Erscheinungsjahr:
- 2025
- Medientyp:
- Datensatz
- Schlagworte:
-
- Uralic
- Mansi
- Tavda Mansi
- endangered language
- language contact
- language documentation
- legacy data
- INEL
- AdWHH
- text corpus
- parallel texts
- folklore
- tales
- narrative
- song
- transcription
- morphological glossing
- part-of-speech
- borrowings
- dialogue
- English translation
- Russian translation
- EXMARaLDA
- ELAN
- XML
- ISO/TEI
- German translation
- Hungarian translation
- existential predication
- locative predication
- possessive predication
- Ob-Ugric languages
- semantic role
- syntactic function
- Beschreibung:
-
Corpus Citation
Sipőcz, Katalin & Wagner-Nagy, Beáta. 2025. INEL Tavda Mansi Corpus. Version 1.0. Publication date 2025-05-15. https://hdl.handle.net/11022/0000-0007-FE69-6. Archived at Universität Hamburg. In: The INEL corpora of indigenous Northern Eurasian languages. https://hdl.handle.net/11022/0000-0007-F45A-1
Corpus Description
The present corpus of Tavda Mansi has been created as part of the long-term research project INEL (“Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages”) in the context of the Academies’ Programme, coordinated by the Union of the German Academies of Sciences and Humanities.The INEL Tavda Mansi corpus at hand fills a gap in the documentation of the indigenous languages of Northern Eurasia and makes possible further descriptions of the language. Mansi is a relatively good described language: there are numerous descriptions and a corpus is also available, however, the Tavda variety is not included in the existing corpora.
The analysis of materials from the Tavda variety has already been conducted by Norbert Szilágyi., but he did not produce a corpus that could be searched and evaluated electronically. However, he has made his materials available under the URL: https://norbertszilagyi91.wixsite.com/tawdamansi. In the material published in the INEL corpus, the analyses differ significantly from Szilágyi's analysis. For the sake of comparison, the texts analysed by Szilágyi are appended to the corpus, and the Hungarian translations he provided have been retained, but some places have been corrected.
The INEL Tavda Mansi Corpus contains texts texts from different sources:- Kannisto, Artturi and Matti Liimola 1951: Wogulische Volksdichtung gesammelt und übersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola Volume I. Texte mythischen Inhalts. [Mémoires de la Société Finno-Ougrienne 101]. Helsinki: Suomalais-Ugrilainen Seura.
- Kannisto, Artturi and Matti Liimola 1955: Wogulische Volksdichtung gesammelt und übersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola Volume II. Kriegs und Heldensagen. [Mémoires de la Société Finno-Ougrienne 109]. Helsinki: Suomalais-Ugrilainen Seura.
- Kannisto, Artturi and Matti Liimola 1956: Wogulische Volksdichtung gesammelt und übersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola Volume III. Märchen. [Mémoires de la Société Finno-Ougrienne 111]. Helsinki: Suomalais-Ugrilainen Seura.
- Kannisto, Artturi and Matti Liimola 1958: Wogulische Volksdichtung gesammelt und übersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola Volume IV. Bärenlieder. [Mémoires de la Société Finno-Ougrienne 114]. Helsinki: Suomalais-Ugrilainen Seura.
- Kannisto, Artturi and Matti Liimola 1963: Wogulische Volksdichtung gesammelt und übersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola Volume VI. Schicksalslieder, Klagelieder, Kinderreime, Rätsel, Verschiedenes. [Mémoires de la Société Finno-Ougrienne 134]. Helsinki: Suomalais-Ugrilainen Seura.
- Munkácsi, Bernát 1896: Vogul népköltési gyűjtemény IV. Életképek. Budapest: Magyar Tudományos Akadémia.
Corpus size
The corpus currently contains 29 transcripts with 2,042 utterances and 11,879 tokens.
Funding
The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies’ Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The
Academies’ Programme is coordinated by the Union of the German Academies of Sciences and Humanities.Searching the corpus
The corpus can be downloaded from the ZFDM Repository using the links provided below and browsed or searched locally using the EXMARaLDA software or, alternatively, ELAN.
Online search with Tsakorpus platform is available at https://inel.corpora.uni-hamburg.de/TavdaMansiCorpus/search.
Remote search with EXMARaLDA is also possible without downloading all the files (see https://inel.corpora.uni-hamburg.de/portal/help/en/index.php).
See the user documentation (section 3) for details on transcription, annotation tiers and annotation tags. Find further information and links on the Mansi Corpus page at the INEL Resources portal: https://inel.corpora.uni-hamburg.de/portal/corpora/mansi/.
- Lizenzen:
-
- https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode
- info:eu-repo/semantics/openAccess
- Quellsystem:
- Forschungsdatenrepositorium der UHH
Interne Metadaten
- Quelldatensatz
- oai:fdr.uni-hamburg.de:17513