Dataset Open Access

INEL Selkup Corpus

Brykina, Maria; Orlova, Svetlana; Wagner-Nagy, Beáta

Data manager(s)
Ferger, Anne; Jettka, Daniel; Lehmberg, Timm
Researcher(s)
Wagner-Nagy, Be´ata; Arkhipov, Alexandre; Brykina, Maria; Orlova, Svetlana

Corpus Citation

Brykina, Maria; Orlova, Svetlana; Wagner-Nagy, Beáta. 2018. INEL Selkup Corpus. Version 0.1. Publication date 2018-12-31. Archived in Hamburger Zentrum für Sprachkorpora. https://hdl.handle.net/11022/0000-0007-CAE5-3. In: Wagner-Nagy, Beáta; Arkhipov, Alexandre; Ferger, Anne; Jettka, Daniel; Lehmberg, Timm (eds.). 2018. The INEL corpora of indigenous Northern Eurasian languages.

Corpus Description

The INEL Selkup corpus has been created within the long-term INEL project ("Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages”), 2016–2033. The corpus makes possible typologically aware corpus-based grammatical research on the Selkup language and expands the documentation of the lesser described indigenous languages of Northern Eurasia.

The INEL Selkup corpus is composed of texts from the archive of Angelina Ivanovna Kuzmina (1924–2002), who gathered a large amount of material on Selkup in almost all regions where the Selkup people lived in 1962–1977. Most texts in the corpus originate from the handwritten part of the archive, the others come from sound recordings made by A.I. Kuzmina, transcribed and translated within the INEL project.

Each text in the corpus is provided with morphological glossing, translation into English, Russian and German, as well as annotation of Russian borrowings. Some texts also have annotations for syntactic functions, semantic roles and information status.

Funding

The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies’ Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies’ Programme is coordinated by the Union of the German Academies of Sciences and Humanities.

Contributions/Acknowledgements

Sound materials of Angelina Kuzmina were transcribed and translated by native speakers of Selkup:

  • Svetlana Nikitichna Sankevich (Kunina), oral transcription and Russian translation of texts in Northern dialects
  • Evgeniya Sergeevna Smorgunova (Irikova), oral and written transcription and Russian translation of audio texts in Northern dialects
  • Valentina Vladimirovna Tamel`kina, oral transcription and Russian translation of audio texts in Northern dialects

The web-based search interface is using the Tsakonian Corpus platform developed by Dr. Timofey Arkhangelskiy, Humboldt Research Fellow at IFUU, Hamburg University

Files (1.8 GB)
Name Size
selkup-0.1-documentation.pdf
md5:60b26d49612226939f460c7822720f53
1.6 MB Download
selkup-0.1-mp3only.zip
md5:2f43cca0d13b8abd772dd06b7309ef0f
576.7 MB Download
selkup-0.1-noaudio.zip
md5:c3161c97775b926bb2c050cb6ccff1bc
530.1 MB Download
selkup-0.1.zip
md5:d4029b839bdfc70d07f381a1bac298c8
704.2 MB Download

Cite record as