Dataset Open Access
<?xml version='1.0' encoding='UTF-8'?> <record xmlns="http://www.loc.gov/MARC21/slim"> <leader>00000nmm##2200000uu#4500</leader> <datafield tag="542" ind1=" " ind2=" "> <subfield code="l">open</subfield> </datafield> <datafield tag="909" ind1="C" ind2="O"> <subfield code="o">oai:fdr.uni-hamburg.de:17676</subfield> <subfield code="p">user-inel</subfield> <subfield code="p">user-adwhh</subfield> <subfield code="p">user-uhh</subfield> </datafield> <datafield tag="773" ind1=" " ind2=" "> <subfield code="a">10.25592/uhhfdm.17675</subfield> <subfield code="i">isVersionOf</subfield> <subfield code="n">doi</subfield> </datafield> <controlfield tag="005">20250722105913.0</controlfield> <datafield tag="540" ind1=" " ind2=" "> <subfield code="u">https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode</subfield> <subfield code="a">Creative Commons Attribution Non Commercial Share Alike 4.0 International</subfield> </datafield> <datafield tag="260" ind1=" " ind2=" "> <subfield code="c">2025-07-17</subfield> </datafield> <datafield tag="041" ind1=" " ind2=" "> <subfield code="a">xal</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">endangered language</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">indigenous language</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">language contact</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">language documentation</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">INEL</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">folklore</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">narrative</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">monologue</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">morphological glossing</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">English translation</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">Russian translation</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">EXMARaLDA</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">ELAN</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">XML</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">ISO/TEI</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">Mongolic languages</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">annotated corpus</subfield> </datafield> <datafield tag="245" ind1=" " ind2=" "> <subfield code="a">INEL Kalmyk Corpus</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">758388</subfield> <subfield code="u">https://www.fdr.uni-hamburg.de/record/17676/files/kalmyk-1.0-documentation.pdf</subfield> <subfield code="z">md5:944f208fd72cd8658f479d33bc98fcd6</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">8667672</subfield> <subfield code="u">https://www.fdr.uni-hamburg.de/record/17676/files/kalmyk-1.0-lite.zip</subfield> <subfield code="z">md5:9ae0dfd2e3bf56af4206eacc9e9326f3</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">377964899</subfield> <subfield code="u">https://www.fdr.uni-hamburg.de/record/17676/files/kalmyk-1.0-mp3.zip</subfield> <subfield code="z">md5:e3fbf20d950fd90e65b52f316dafd704</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">1835703820</subfield> <subfield code="u">https://www.fdr.uni-hamburg.de/record/17676/files/kalmyk-1.0-standard.zip</subfield> <subfield code="z">md5:45d29e3248b62b4549e464b207c91afe</subfield> </datafield> <datafield tag="650" ind1="1" ind2="7"> <subfield code="a">cc-by</subfield> <subfield code="2">opendefinition.org</subfield> </datafield> <datafield tag="520" ind1=" " ind2=" "> <subfield code="a"><p><strong>Corpus citation</strong></p> <p><em>Baranova, Vlada</em>. 2025. INEL Kalmyk Corpus. Archived at Universit&auml;t Hamburg. Version 1.0. Publication date 2025-07-17.&nbsp;<a href="https://hdl.handle.net/11022/0000-0007-FFB1-2">https://hdl.handle.net/11022/0000-0007-FFB1-2</a>. Archived at Universit&auml;t Hamburg. In: <em>The INEL Corpora of Indigenous Northern Eurasian Languages</em>.&nbsp;<a href="https://hdl.handle.net/11022/0000-0007-F45A-1">https://hdl.handle.net/11022/0000-0007-F45A-1</a>.</p> <p><strong>Corpus Description</strong></p> <p>The INEL Kalmyk Corpus has been created within the long-term INEL project (&quot;Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages&quot;), 2016&ndash;2033.</p> <p>The corpus consists of transcribed audio recordings collected in the Republic of Kalmykia between 2007 and 2018 in the Ketchenerovsky District (Derbet&nbsp; and Torgut dialect).</p> <p>All texts in the corpus are provided with interlinear morpheme-by-morpheme glosses and translation into English and Russian. All texts for which the audio recordings were accessible are time-aligned with them.&nbsp;</p> <p><strong>Corpus Size</strong></p> <p>The corpus contains <strong>55 </strong>texts, <strong>2,076 </strong>sentences, and <strong>19,742&nbsp;</strong>tokens. The total duration of the audio recordings is <strong>4 </strong>hours and <strong>23 </strong>minutes.</p> <p><strong>Funding</strong></p> <p>The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies&rsquo; Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies&rsquo; Programme is coordinated by the Union of the German Academies of Sciences and Humanities.</p> <p><strong>Contributions / Acknowledgements</strong></p> <p>Native speakers generously shared their knowledge of Kalmyk, making the creation of this corpus possible. Zamira Xejchieva and Galina Cabdy`rova assisted with oral transcription and the Russian translation of the audio materials.</p> <p>Part of the materials were recorded during joint expeditions of St. Petersburg University and the Institute for Linguistic Studies of the Russian Academy of Sciences in 2007&ndash;2008, under the direction of Elena Perekhvalskaya and Sergey Say.</p> <p>This corpus primarily follows the transcription system and partially adopts the glossing conventions developed by a research team led by Sergey Say, with input from other expedition participants.</p> <p><strong>Searching the corpus</strong></p> <p>The corpus can be downloaded from the ZFDM Repository using the links provided below and browsed or searched locally using the&nbsp;<a href="https://exmaralda.org/">EXMARaLDA</a>&nbsp;software or, alternatively,&nbsp;<a href="https://archive.mpi.nl/tla/elan">ELAN</a>.</p> <p>Online search with Tsakorpus platform is available at&nbsp;<a href="https://inel.corpora.uni-hamburg.de/KalmykCorpus/search">https://inel.corpora.uni-hamburg.de/KalmykCorpus/search</a>.</p> <p>Remote search with EXMARaLDA is also possible without downloading all the files (see&nbsp;<a href="https://inel.corpora.uni-hamburg.de/portal/help/en/index.php">https://inel.corpora.uni-hamburg.de/portal/help/en/index.php</a>).</p> <p>See the user documentation&nbsp;(section 3) for details on transcription, annotation tiers and annotation tags.<br> Find further information and links on the Kalmyk Corpus page at the INEL Resources portal:&nbsp;<a href="https://inel.corpora.uni-hamburg.de/portal/corpora/kalmyk/">https://inel.corpora.uni-hamburg.de/portal/corpora/kalmyk/</a>.</p></subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">dataset</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">user-adwhh</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">user-inel</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">user-uhh</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Lazarenko, Elena</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">dtm</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Riaposov, Aleksandr</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">dtm</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Arkhipov, Alexandre</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">edt</subfield> </datafield> <datafield tag="024" ind1=" " ind2=" "> <subfield code="a">10.25592/uhhfdm.17676</subfield> <subfield code="2">doi</subfield> </datafield> <datafield tag="024" ind1=" " ind2=" "> <subfield code="a">11022/0000-0007-FFB1-2</subfield> <subfield code="2">handle</subfield> <subfield code="q">alternateidentifier</subfield> </datafield> <datafield tag="100" ind1=" " ind2=" "> <subfield code="a">Baranova, Vlada</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="0">(orcid)0000-0003-1642-4003</subfield> </datafield> <controlfield tag="001">17676</controlfield> </record>