Dataset Open Access
<?xml version='1.0' encoding='utf-8'?> <resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-3" xsi:schemaLocation="http://datacite.org/schema/kernel-3 http://schema.datacite.org/meta/kernel-3/metadata.xsd"> <identifier identifierType="DOI">10.25592/uhhfdm.17676</identifier> <creators> <creator> <creatorName>Baranova, Vlada</creatorName> <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0003-1642-4003</nameIdentifier> <affiliation>Universität Hamburg</affiliation> </creator> </creators> <titles> <title>INEL Kalmyk Corpus</title> </titles> <publisher>Universität Hamburg</publisher> <publicationYear>2025</publicationYear> <subjects> <subject>endangered language</subject> <subject>indigenous language</subject> <subject>language contact</subject> <subject>language documentation</subject> <subject>INEL</subject> <subject>folklore</subject> <subject>narrative</subject> <subject>monologue</subject> <subject>morphological glossing</subject> <subject>English translation</subject> <subject>Russian translation</subject> <subject>EXMARaLDA</subject> <subject>ELAN</subject> <subject>XML</subject> <subject>ISO/TEI</subject> <subject>Mongolic languages</subject> <subject>annotated corpus</subject> </subjects> <contributors> <contributor contributorType="DataManager"> <contributorName>Lazarenko, Elena</contributorName> <affiliation>Universität Hamburg</affiliation> </contributor> <contributor contributorType="DataManager"> <contributorName>Riaposov, Aleksandr</contributorName> <affiliation>Universität Hamburg</affiliation> </contributor> <contributor contributorType="Editor"> <contributorName>Arkhipov, Alexandre</contributorName> <affiliation>Universität Hamburg</affiliation> </contributor> </contributors> <dates> <date dateType="Issued">2025-07-17</date> </dates> <resourceType resourceTypeGeneral="Dataset"/> <alternateIdentifiers> <alternateIdentifier alternateIdentifierType="handle">11022/0000-0007-FFB1-2</alternateIdentifier> <alternateIdentifier alternateIdentifierType="url">https://www.fdr.uni-hamburg.de/record/17676</alternateIdentifier> </alternateIdentifiers> <relatedIdentifiers> <relatedIdentifier relatedIdentifierType="DOI" relationType="IsPartOf">10.25592/uhhfdm.17675</relatedIdentifier> </relatedIdentifiers> <version>1.0</version> <rightsList> <rights rightsURI="https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode">Creative Commons Attribution Non Commercial Share Alike 4.0 International</rights> <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights> </rightsList> <descriptions> <description descriptionType="Abstract"><p><strong>Corpus citation</strong></p> <p><em>Baranova, Vlada</em>. 2025. INEL Kalmyk Corpus. Archived at Universit&auml;t Hamburg. Version 1.0. Publication date 2025-07-17.&nbsp;<a href="https://hdl.handle.net/11022/0000-0007-FFB1-2">https://hdl.handle.net/11022/0000-0007-FFB1-2</a>. Archived at Universit&auml;t Hamburg. In: <em>The INEL Corpora of Indigenous Northern Eurasian Languages</em>.&nbsp;<a href="https://hdl.handle.net/11022/0000-0007-F45A-1">https://hdl.handle.net/11022/0000-0007-F45A-1</a>.</p> <p><strong>Corpus Description</strong></p> <p>The INEL Kalmyk Corpus has been created within the long-term INEL project (&quot;Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages&quot;), 2016&ndash;2033.</p> <p>The corpus consists of transcribed audio recordings collected in the Republic of Kalmykia between 2007 and 2018 in the Ketchenerovsky District (Derbet&nbsp; and Torgut dialect).</p> <p>All texts in the corpus are provided with interlinear morpheme-by-morpheme glosses and translation into English and Russian. All texts for which the audio recordings were accessible are time-aligned with them.&nbsp;</p> <p><strong>Corpus Size</strong></p> <p>The corpus contains <strong>55 </strong>texts, <strong>2,076 </strong>sentences, and <strong>19,742&nbsp;</strong>tokens. The total duration of the audio recordings is <strong>4 </strong>hours and <strong>23 </strong>minutes.</p> <p><strong>Funding</strong></p> <p>The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies&rsquo; Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies&rsquo; Programme is coordinated by the Union of the German Academies of Sciences and Humanities.</p> <p><strong>Contributions / Acknowledgements</strong></p> <p>Native speakers generously shared their knowledge of Kalmyk, making the creation of this corpus possible. Zamira Xejchieva and Galina Cabdy`rova assisted with oral transcription and the Russian translation of the audio materials.</p> <p>Part of the materials were recorded during joint expeditions of St. Petersburg University and the Institute for Linguistic Studies of the Russian Academy of Sciences in 2007&ndash;2008, under the direction of Elena Perekhvalskaya and Sergey Say.</p> <p>This corpus primarily follows the transcription system and partially adopts the glossing conventions developed by a research team led by Sergey Say, with input from other expedition participants.</p> <p><strong>Searching the corpus</strong></p> <p>The corpus can be downloaded from the ZFDM Repository using the links provided below and browsed or searched locally using the&nbsp;<a href="https://exmaralda.org/">EXMARaLDA</a>&nbsp;software or, alternatively,&nbsp;<a href="https://archive.mpi.nl/tla/elan">ELAN</a>.</p> <p>Online search with Tsakorpus platform is available at&nbsp;<a href="https://inel.corpora.uni-hamburg.de/KalmykCorpus/search">https://inel.corpora.uni-hamburg.de/KalmykCorpus/search</a>.</p> <p>Remote search with EXMARaLDA is also possible without downloading all the files (see&nbsp;<a href="https://inel.corpora.uni-hamburg.de/portal/help/en/index.php">https://inel.corpora.uni-hamburg.de/portal/help/en/index.php</a>).</p> <p>See the user documentation&nbsp;(section 3) for details on transcription, annotation tiers and annotation tags.<br> Find further information and links on the Kalmyk Corpus page at the INEL Resources portal:&nbsp;<a href="https://inel.corpora.uni-hamburg.de/portal/corpora/kalmyk/">https://inel.corpora.uni-hamburg.de/portal/corpora/kalmyk/</a>.</p></description> </descriptions> </resource>