Dataset Open Access
<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-3" xsi:schemaLocation="http://datacite.org/schema/kernel-3 http://schema.datacite.org/meta/kernel-3/metadata.xsd">
<identifier identifierType="DOI">10.25592/uhhfdm.17676</identifier>
<creators>
<creator>
<creatorName>Baranova, Vlada</creatorName>
<nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0003-1642-4003</nameIdentifier>
<affiliation>Universität Hamburg</affiliation>
</creator>
</creators>
<titles>
<title>INEL Kalmyk Corpus</title>
</titles>
<publisher>Universität Hamburg</publisher>
<publicationYear>2025</publicationYear>
<subjects>
<subject>endangered language</subject>
<subject>indigenous language</subject>
<subject>language contact</subject>
<subject>language documentation</subject>
<subject>INEL</subject>
<subject>folklore</subject>
<subject>narrative</subject>
<subject>monologue</subject>
<subject>morphological glossing</subject>
<subject>English translation</subject>
<subject>Russian translation</subject>
<subject>EXMARaLDA</subject>
<subject>ELAN</subject>
<subject>XML</subject>
<subject>ISO/TEI</subject>
<subject>Mongolic languages</subject>
<subject>annotated corpus</subject>
</subjects>
<contributors>
<contributor contributorType="DataManager">
<contributorName>Lazarenko, Elena</contributorName>
<affiliation>Universität Hamburg</affiliation>
</contributor>
<contributor contributorType="DataManager">
<contributorName>Riaposov, Aleksandr</contributorName>
<affiliation>Universität Hamburg</affiliation>
</contributor>
<contributor contributorType="Editor">
<contributorName>Arkhipov, Alexandre</contributorName>
<affiliation>Universität Hamburg</affiliation>
</contributor>
</contributors>
<dates>
<date dateType="Issued">2025-07-17</date>
</dates>
<resourceType resourceTypeGeneral="Dataset"/>
<alternateIdentifiers>
<alternateIdentifier alternateIdentifierType="handle">11022/0000-0007-FFB1-2</alternateIdentifier>
<alternateIdentifier alternateIdentifierType="url">https://www.fdr.uni-hamburg.de/record/17676</alternateIdentifier>
</alternateIdentifiers>
<relatedIdentifiers>
<relatedIdentifier relatedIdentifierType="DOI" relationType="IsPartOf">10.25592/uhhfdm.17675</relatedIdentifier>
</relatedIdentifiers>
<version>1.0</version>
<rightsList>
<rights rightsURI="https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode">Creative Commons Attribution Non Commercial Share Alike 4.0 International</rights>
<rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
</rightsList>
<descriptions>
<description descriptionType="Abstract"><p><strong>Corpus citation</strong></p>
<p><em>Baranova, Vlada</em>. 2025. INEL Kalmyk Corpus. Archived at Universit&auml;t Hamburg. Version 1.0. Publication date 2025-07-17.&nbsp;<a href="https://hdl.handle.net/11022/0000-0007-FFB1-2">https://hdl.handle.net/11022/0000-0007-FFB1-2</a>. Archived at Universit&auml;t Hamburg. In: <em>The INEL Corpora of Indigenous Northern Eurasian Languages</em>.&nbsp;<a href="https://hdl.handle.net/11022/0000-0007-F45A-1">https://hdl.handle.net/11022/0000-0007-F45A-1</a>.</p>
<p><strong>Corpus Description</strong></p>
<p>The INEL Kalmyk Corpus has been created within the long-term INEL project (&quot;Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages&quot;), 2016&ndash;2033.</p>
<p>The corpus consists of transcribed audio recordings collected in the Republic of Kalmykia between 2007 and 2018 in the Ketchenerovsky District (Derbet&nbsp; and Torgut dialect).</p>
<p>All texts in the corpus are provided with interlinear morpheme-by-morpheme glosses and translation into English and Russian. All texts for which the audio recordings were accessible are time-aligned with them.&nbsp;</p>
<p><strong>Corpus Size</strong></p>
<p>The corpus contains <strong>55 </strong>texts, <strong>2,076 </strong>sentences, and <strong>19,742&nbsp;</strong>tokens. The total duration of the audio recordings is <strong>4 </strong>hours and <strong>23 </strong>minutes.</p>
<p><strong>Funding</strong></p>
<p>The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies&rsquo; Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies&rsquo; Programme is coordinated by the Union of the German Academies of Sciences and Humanities.</p>
<p><strong>Contributions / Acknowledgements</strong></p>
<p>Native speakers generously shared their knowledge of Kalmyk, making the creation of this corpus possible. Zamira Xejchieva and Galina Cabdy`rova assisted with oral transcription and the Russian translation of the audio materials.</p>
<p>Part of the materials were recorded during joint expeditions of St. Petersburg University and the Institute for Linguistic Studies of the Russian Academy of Sciences in 2007&ndash;2008, under the direction of Elena Perekhvalskaya and Sergey Say.</p>
<p>This corpus primarily follows the transcription system and partially adopts the glossing conventions developed by a research team led by Sergey Say, with input from other expedition participants.</p>
<p><strong>Searching the corpus</strong></p>
<p>The corpus can be downloaded from the ZFDM Repository using the links provided below and browsed or searched locally using the&nbsp;<a href="https://exmaralda.org/">EXMARaLDA</a>&nbsp;software or, alternatively,&nbsp;<a href="https://archive.mpi.nl/tla/elan">ELAN</a>.</p>
<p>Online search with Tsakorpus platform is available at&nbsp;<a href="https://inel.corpora.uni-hamburg.de/KalmykCorpus/search">https://inel.corpora.uni-hamburg.de/KalmykCorpus/search</a>.</p>
<p>Remote search with EXMARaLDA is also possible without downloading all the files (see&nbsp;<a href="https://inel.corpora.uni-hamburg.de/portal/help/en/index.php">https://inel.corpora.uni-hamburg.de/portal/help/en/index.php</a>).</p>
<p>See the user documentation&nbsp;(section 3) for details on transcription, annotation tiers and annotation tags.<br>
Find further information and links on the Kalmyk Corpus page at the INEL Resources portal:&nbsp;<a href="https://inel.corpora.uni-hamburg.de/portal/corpora/kalmyk/">https://inel.corpora.uni-hamburg.de/portal/corpora/kalmyk/</a>.</p></description>
</descriptions>
</resource>