Dataset Open Access
Brykina, Maria; Orlova, Svetlana; Wagner-Nagy, Beáta
<?xml version='1.0' encoding='UTF-8'?> <record xmlns="http://www.loc.gov/MARC21/slim"> <leader>00000nmm##2200000uu#4500</leader> <datafield tag="542" ind1=" " ind2=" "> <subfield code="l">open</subfield> </datafield> <datafield tag="909" ind1="C" ind2="O"> <subfield code="o">oai:fdr.uni-hamburg.de:9754</subfield> <subfield code="p">user-inel</subfield> <subfield code="p">user-adwhh</subfield> </datafield> <datafield tag="773" ind1=" " ind2=" "> <subfield code="a">11022/0000-0007-F4D9-1</subfield> <subfield code="i">isCitedBy</subfield> <subfield code="n">handle</subfield> </datafield> <datafield tag="773" ind1=" " ind2=" "> <subfield code="a">10.25592/uhhfdm.9721</subfield> <subfield code="i">isVersionOf</subfield> <subfield code="n">doi</subfield> </datafield> <controlfield tag="005">20250922124401.0</controlfield> <datafield tag="540" ind1=" " ind2=" "> <subfield code="u">https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode</subfield> <subfield code="a">Creative Commons Attribution Non Commercial Share Alike 4.0 International</subfield> </datafield> <datafield tag="260" ind1=" " ind2=" "> <subfield code="c">2021-12-31</subfield> </datafield> <datafield tag="041" ind1=" " ind2=" "> <subfield code="a">sel</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">endangered language</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">indigenous language</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">L1 data</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">language contact</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">language documentation</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">INEL</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">folklore</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">narrative</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">monologue</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">annotated</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">morphological glossing</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">borrowings</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">code-switching</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">semantic roles</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">syntactic functions</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">information status</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">English translation</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">German translation</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">Russian translation</subfield> </datafield> <datafield tag="245" ind1=" " ind2=" "> <subfield code="a">INEL Selkup Corpus</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">2137037</subfield> <subfield code="u">https://www.fdr.uni-hamburg.de/record/9754/files/selkup-2.0-documentation.pdf</subfield> <subfield code="z">md5:89c2b2bb43ec96a964dfed3c78b8f8b4</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">2860057415</subfield> <subfield code="u">https://www.fdr.uni-hamburg.de/record/9754/files/selkup-2.0-mp3only.zip</subfield> <subfield code="z">md5:926ea692e12aa94e800d4048d7a72844</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">2136536614</subfield> <subfield code="u">https://www.fdr.uni-hamburg.de/record/9754/files/selkup-2.0-noaudio.zip</subfield> <subfield code="z">md5:f52eb35a19af7dff2ed15b789bbdc9ad</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">4764183965</subfield> <subfield code="u">https://www.fdr.uni-hamburg.de/record/9754/files/selkup-2.0.zip</subfield> <subfield code="z">md5:586223f15565e20aa3c2fd34d000b603</subfield> </datafield> <datafield tag="650" ind1="1" ind2="7"> <subfield code="a">cc-by</subfield> <subfield code="2">opendefinition.org</subfield> </datafield> <datafield tag="520" ind1=" " ind2=" "> <subfield code="a"><p><strong>Corpus Citation</strong></p> <p><em>Brykina, Maria; Orlova, Svetlana; Wagner-Nagy, Be&aacute;ta. 2021. &ldquo;INEL Selkup Corpus.&rdquo; Version 2 .0. Publication date<br> 2021-12-31.&nbsp;<a href="https://hdl.handle.net/11022/0000-0007-F4D9-1">https://hdl.handle.net/11022/0000-0007-F4D9-1</a>. Archived at Universit&auml;t Hamburg. In: The INEL corpora<br> of indigenous Northern Eurasian languages. <a href="https://hdl.handle.net/11022/0000-0007-F45A-1">https://hdl.handle.net/11022/0000-0007-F45A-1</a></em></p> <p><strong>Corpus Description</strong></p> <p>The INEL Selkup corpus has been created within the long-term INEL project (&quot;Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages&rdquo;), 2016&ndash;2033. The corpus enables typologically aware corpus-based grammatical research on the Selkup language and expands the documentation of the lesser described indigenous languages of Northern Eurasia.</p> <p>The INEL Selkup corpus is composed of texts from the archive of Angelina Ivanovna Kuzmina (1924&ndash;2002), who gathered a large amount of material on Selkup in almost all regions where the Selkup people lived between 1962&ndash;1977. The archive was transferred by A.I. Kuzmina to Eugen&nbsp;Helimski and acquired by the Universit&auml;t Hamburg in 2001. Most texts in the corpus originate from the handwritten part of the archive, the others come from sound recordings made by A.I. Kuzmina, transcribed and translated within the INEL project.</p> <p><strong>Funding</strong></p> <p>The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies&rsquo; Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies&rsquo; Programme is coordinated by the&nbsp;Union of the German Academies of Sciences and Humanities.</p> <p><strong>Contributions/Acknowledgements</strong></p> <p>Audio recordings made by Angelina Kuzmina were transcribed and translated by native speakers of Selkup:</p> <ul> <li>Irina Anatolyevna Korobejnikova, written transcription and Russian translation of audio in Central and Southern dialects</li> <li>Natalya Platonovna Izhenbina, written transcription and Russian translation of audio in Southern dialects</li> <li>Svetlana Nikitichna Sankevich (Kunina), oral transcription and Russian translation of audio in Northern dialects</li> <li>Evgeniya Sergeevna Smorgunova (Irikova), oral and written transcription and Russian translation of audio in Northern dialects</li> <li>Valentina Vladimirovna Tamelkina, oral transcription and Russian translation of audio in Northern dialects</li> </ul> <p>For individual contributions to the collecting, transcribing and analyzing of individual texts, please refer to the user documentation and to the corpus metadata.</p> <p>The web-based search interface is using the Tsakonian Corpus platform developed by Dr. Timofey Arkhangelskiy, Humboldt Research Fellow at IFUU, Hamburg University</p> <p><strong>New in release 2 .0</strong></p> <ul> <li>The corpus now contains 352 transcripts from 89 speakers, representing the dialects of Taz, Upper Tolka,</li> <li>Baikha (Northern), Narym and Tym (Central), Middle Ob, Chaya and Ket (Southern). These contain 14509</li> <li>sentences and 81498 words in total.</li> <li>Many texts have been provided with annotations for syntactic functions and semantic roles.&nbsp;</li> <li>Corrections to audio transcriptions, glossing and other annotations.</li> <li>Dialectal attribution of several speakers has been revised.</li> <li>The remaining n on-glossed texts from the Kuzmina archive have also been added to the corpus for&nbsp;completeness. These include 3 texts from the written part of the archive and 40 audio recordings, for 20 of&nbsp;which a preliminary transcription is provided.</li> </ul></subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">dataset</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">user-adwhh</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">user-inel</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Orlova, Svetlana</subfield> <subfield code="u">Universität Hamburg</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Wagner-Nagy, Beáta</subfield> <subfield code="u">Universität Hamburg</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Wagner-Nagy, Be´ata</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">res</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Arkhipov, Alexandre</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">res</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Brykina, Maria</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">res</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Orlova, Svetlana</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">res</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Ferger, Anne</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">dtm</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Jettka, Daniel</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">dtm</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Lazarenko, Elena</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">dtm</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Lehmberg, Timm</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">dtm</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Riaposov, Aleksandr</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">dtm</subfield> </datafield> <datafield tag="024" ind1=" " ind2=" "> <subfield code="a">10.25592/uhhfdm.9754</subfield> <subfield code="2">doi</subfield> </datafield> <datafield tag="100" ind1=" " ind2=" "> <subfield code="a">Brykina, Maria</subfield> <subfield code="u">Universität Hamburg</subfield> </datafield> <controlfield tag="001">9754</controlfield> </record>