Dataset Open Access

INEL Kalmyk Corpus

Baranova, Vlada


DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-3" xsi:schemaLocation="http://datacite.org/schema/kernel-3 http://schema.datacite.org/meta/kernel-3/metadata.xsd">
  <identifier identifierType="DOI">10.25592/uhhfdm.17676</identifier>
  <creators>
    <creator>
      <creatorName>Baranova, Vlada</creatorName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0003-1642-4003</nameIdentifier>
      <affiliation>Universität Hamburg</affiliation>
    </creator>
  </creators>
  <titles>
    <title>INEL Kalmyk Corpus</title>
  </titles>
  <publisher>Universität Hamburg</publisher>
  <publicationYear>2025</publicationYear>
  <subjects>
    <subject>endangered language</subject>
    <subject>indigenous language</subject>
    <subject>language contact</subject>
    <subject>language documentation</subject>
    <subject>INEL</subject>
    <subject>folklore</subject>
    <subject>narrative</subject>
    <subject>monologue</subject>
    <subject>morphological glossing</subject>
    <subject>English translation</subject>
    <subject>Russian translation</subject>
    <subject>EXMARaLDA</subject>
    <subject>ELAN</subject>
    <subject>XML</subject>
    <subject>ISO/TEI</subject>
    <subject>Mongolic languages</subject>
    <subject>annotated corpus</subject>
  </subjects>
  <contributors>
    <contributor contributorType="DataManager">
      <contributorName>Lazarenko, Elena</contributorName>
      <affiliation>Universität Hamburg</affiliation>
    </contributor>
    <contributor contributorType="DataManager">
      <contributorName>Riaposov, Aleksandr</contributorName>
      <affiliation>Universität Hamburg</affiliation>
    </contributor>
    <contributor contributorType="Editor">
      <contributorName>Arkhipov, Alexandre</contributorName>
      <affiliation>Universität Hamburg</affiliation>
    </contributor>
  </contributors>
  <dates>
    <date dateType="Issued">2025-07-17</date>
  </dates>
  <resourceType resourceTypeGeneral="Dataset"/>
  <alternateIdentifiers>
    <alternateIdentifier alternateIdentifierType="handle">11022/0000-0007-FFB1-2</alternateIdentifier>
    <alternateIdentifier alternateIdentifierType="url">https://www.fdr.uni-hamburg.de/record/17676</alternateIdentifier>
  </alternateIdentifiers>
  <relatedIdentifiers>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsPartOf">10.25592/uhhfdm.17675</relatedIdentifier>
  </relatedIdentifiers>
  <version>1.0</version>
  <rightsList>
    <rights rightsURI="https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode">Creative Commons Attribution Non Commercial Share Alike 4.0 International</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
  </rightsList>
  <descriptions>
    <description descriptionType="Abstract">&lt;p&gt;&lt;strong&gt;Corpus citation&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Baranova, Vlada&lt;/em&gt;. 2025. INEL Kalmyk Corpus. Archived at Universit&amp;auml;t Hamburg. Version 1.0. Publication date 2025-07-17.&amp;nbsp;&lt;a href="https://hdl.handle.net/11022/0000-0007-FFB1-2"&gt;https://hdl.handle.net/11022/0000-0007-FFB1-2&lt;/a&gt;. Archived at Universit&amp;auml;t Hamburg. In: &lt;em&gt;The INEL Corpora of Indigenous Northern Eurasian Languages&lt;/em&gt;.&amp;nbsp;&lt;a href="https://hdl.handle.net/11022/0000-0007-F45A-1"&gt;https://hdl.handle.net/11022/0000-0007-F45A-1&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Corpus Description&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The INEL Kalmyk Corpus has been created within the long-term INEL project (&amp;quot;Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages&amp;quot;), 2016&amp;ndash;2033.&lt;/p&gt;

&lt;p&gt;The corpus consists of transcribed audio recordings collected in the Republic of Kalmykia between 2007 and 2018 in the Ketchenerovsky District (Derbet&amp;nbsp; and Torgut dialect).&lt;/p&gt;

&lt;p&gt;All texts in the corpus are provided with interlinear morpheme-by-morpheme glosses and translation into English and Russian. All texts for which the audio recordings were accessible are time-aligned with them.&amp;nbsp;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Corpus Size&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The corpus contains &lt;strong&gt;55 &lt;/strong&gt;texts, &lt;strong&gt;2,076 &lt;/strong&gt;sentences, and &lt;strong&gt;19,742&amp;nbsp;&lt;/strong&gt;tokens. The total duration of the audio recordings is &lt;strong&gt;4 &lt;/strong&gt;hours and &lt;strong&gt;23 &lt;/strong&gt;minutes.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Funding&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies&amp;rsquo; Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies&amp;rsquo; Programme is coordinated by the Union of the German Academies of Sciences and Humanities.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Contributions / Acknowledgements&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Native speakers generously shared their knowledge of Kalmyk, making the creation of this corpus possible. Zamira Xejchieva and Galina Cabdy`rova assisted with oral transcription and the Russian translation of the audio materials.&lt;/p&gt;

&lt;p&gt;Part of the materials were recorded during joint expeditions of St. Petersburg University and the Institute for Linguistic Studies of the Russian Academy of Sciences in 2007&amp;ndash;2008, under the direction of Elena Perekhvalskaya and Sergey Say.&lt;/p&gt;

&lt;p&gt;This corpus primarily follows the transcription system and partially adopts the glossing conventions developed by a research team led by Sergey Say, with input from other expedition participants.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Searching the corpus&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The corpus can be downloaded from the ZFDM Repository using the links provided below and browsed or searched locally using the&amp;nbsp;&lt;a href="https://exmaralda.org/"&gt;EXMARaLDA&lt;/a&gt;&amp;nbsp;software or, alternatively,&amp;nbsp;&lt;a href="https://archive.mpi.nl/tla/elan"&gt;ELAN&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Online search with Tsakorpus platform is available at&amp;nbsp;&lt;a href="https://inel.corpora.uni-hamburg.de/KalmykCorpus/search"&gt;https://inel.corpora.uni-hamburg.de/KalmykCorpus/search&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Remote search with EXMARaLDA is also possible without downloading all the files (see&amp;nbsp;&lt;a href="https://inel.corpora.uni-hamburg.de/portal/help/en/index.php"&gt;https://inel.corpora.uni-hamburg.de/portal/help/en/index.php&lt;/a&gt;).&lt;/p&gt;

&lt;p&gt;See the user documentation&amp;nbsp;(section 3) for details on transcription, annotation tiers and annotation tags.&lt;br&gt;
Find further information and links on the Kalmyk Corpus page at the INEL Resources portal:&amp;nbsp;&lt;a href="https://inel.corpora.uni-hamburg.de/portal/corpora/kalmyk/"&gt;https://inel.corpora.uni-hamburg.de/portal/corpora/kalmyk/&lt;/a&gt;.&lt;/p&gt;</description>
  </descriptions>
</resource>

Cite record as