Dataset Open Access
Däbritz, Chris Lasse;
Gusev, Valentin;
Stoynova, Natalia
Corpus Citation
Däbritz, Chris Lasse; Gusev, Valentin; Stoynova, Natalia. 2024. INEL Evenki Corpus. Version 2.0. Publication date 2024-12-31. Archived at Universität Hamburg. https://hdl.handle.net/11022/0000-0007-FE38-D. In: The INEL corpora of indigenous Northern Eurasian languages. https://hdl.handle.net/11022/0000-0007-F45A-1
Corpus Description
The INEL Evenki Corpus has been created within the long-term INEL project (Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages), 2016–2033.
The corpus makes possible typologically aware corpus-based grammatical research on the Evenki (< Tungusic) language and expands the documentation of the lesser described indigenous languages of Northern Eurasia.
The INEL Evenki Corpus covers Northern (Taimyr, Khantayskoe Ozero, Ilimpi, Yerbogachyon) and Southern (Sym, Barhahan, and to a smaller extent Stony Tunguska and Nepa) Evenki dialects. These are exactly the dialects which are or were in contact with other languages included in the INEL project, that is first and foremost Dolgan and Selkup. The INEL Evenki Corpus contains texts from different sources:
Each text in the corpus is provided with morphological glossing, translation into English, Russian, and German, as well as annotation of Russian borrowings. Some texts also have annotations for syntactic functions, semantic roles, information status, as well as for existential, locative, and possessive predication.
Corpus size
New in release 2.0
Funding
The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies’ Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies’ Programme is coordinated by the Union of the German Academies of Sciences and Humanities.
Contributions/Acknowledgements
Searching the corpus
The corpus can be downloaded from the ZFDM Repository using the links provided below and browsed or searched locally using the EXMARaLDA software or, alternatively, ELAN.
Online search with Tsakorpus platform is available at https://inel.corpora.uni-hamburg.de/EvenkiCorpus/search.
Remote search with EXMARaLDA is also possible without downloading all the files (see https://inel.corpora.uni-hamburg.de/portal/help/en/index.php#search).
See the user documentation (section 3) for details on transcription, annotation tiers and annotation tags. Find further information and links on the Evenki Corpus page at the INEL Resources portal: https://inel.corpora.uni-hamburg.de/portal/corpora/evenki/.
Name | Size | |
---|---|---|
evenki-2.0-documentation.pdf
md5:8c3472ec27035d8d56c70d50b57dc55d |
2.5 MB | Download |
evenki-2.0-lite.zip
md5:395717280876078cd33d54382b9717e1 |
61.5 MB | Download |
evenki-2.0-mp3.zip
md5:1476fa0e1374563b41e2c32850d2d4aa |
1.0 GB | Download |
evenki-2.0-standard.zip
md5:e578975ec4c2517a30e7aed597338e15 |
2.0 GB | Download |