Dataset Open Access
Gusev, Valentin; Klooster, Tiina; Wagner-Nagy, Beáta
<?xml version='1.0' encoding='UTF-8'?> <record xmlns="http://www.loc.gov/MARC21/slim"> <leader>00000nmm##2200000uu#4500</leader> <datafield tag="542" ind1=" " ind2=" "> <subfield code="l">open</subfield> </datafield> <datafield tag="909" ind1="C" ind2="O"> <subfield code="o">oai:fdr.uni-hamburg.de:13882</subfield> <subfield code="p">user-inel</subfield> <subfield code="p">user-adwhh</subfield> </datafield> <datafield tag="773" ind1=" " ind2=" "> <subfield code="a">11022/0000-0007-FC25-4</subfield> <subfield code="i">isCitedBy</subfield> <subfield code="n">handle</subfield> </datafield> <datafield tag="773" ind1=" " ind2=" "> <subfield code="a">10.25592/uhhfdm.9740</subfield> <subfield code="i">isVersionOf</subfield> <subfield code="n">doi</subfield> </datafield> <controlfield tag="005">20250922131412.0</controlfield> <datafield tag="540" ind1=" " ind2=" "> <subfield code="u">https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode</subfield> <subfield code="a">Creative Commons Attribution Non Commercial Share Alike 4.0 International</subfield> </datafield> <datafield tag="260" ind1=" " ind2=" "> <subfield code="c">2023-12-29</subfield> </datafield> <datafield tag="041" ind1=" " ind2=" "> <subfield code="a">xas</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">endangered language</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">indigenous language</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">L1 data</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">language contact</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">language documentation</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">INEL</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">folklore</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">narrative</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">monologue</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">annotated</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">morphological glossing</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">borrowings</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">code-switching</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">semantic roles</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">syntactic functions</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">information status</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">English translation</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">German translation</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">Russian translation</subfield> </datafield> <datafield tag="245" ind1=" " ind2=" "> <subfield code="a">INEL Kamas Corpus</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">229232</subfield> <subfield code="u">https://www.fdr.uni-hamburg.de/record/13882/files/kamas-2.0-documentation.pdf</subfield> <subfield code="z">md5:be551320e8e3f9f09ff95843c8da92d8</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">492709288</subfield> <subfield code="u">https://www.fdr.uni-hamburg.de/record/13882/files/kamas-2.0-mp3only.zip</subfield> <subfield code="z">md5:35631d0a5c5ecdb7f186829f5e87c6fd</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">84942035</subfield> <subfield code="u">https://www.fdr.uni-hamburg.de/record/13882/files/kamas-2.0-noaudio.zip</subfield> <subfield code="z">md5:145417dbd05f5304f9fc5a487352f95c</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">4100274255</subfield> <subfield code="u">https://www.fdr.uni-hamburg.de/record/13882/files/kamas-2.0.zip</subfield> <subfield code="z">md5:d09850583132ebe49983c98957c3c4cd</subfield> </datafield> <datafield tag="650" ind1="1" ind2="7"> <subfield code="a">cc-by</subfield> <subfield code="2">opendefinition.org</subfield> </datafield> <datafield tag="520" ind1=" " ind2=" "> <subfield code="a"><p><strong>Corpus Citation</strong></p> <p><em>Gusev, Valentin; Klooster, Tiina; Wagner-Nagy, Be&aacute;ta.</em> 2023. &ldquo;INEL Kamas Corpus.&rdquo; Version 2.0. Publication date 2023-12-31. <a href="http://hdl.handle.net/11022/0000-0007-FC25-4">http://hdl.handle.net/11022/0000-0007-FC25-4</a>. Archived at Universit&auml;t Hamburg. In: The INEL corpora of indigenous Northern Eurasian languages.<a href="https://hdl.handle.net/11022/0000-0007-F45A-1">https://hdl.handle.net/11022/0000-0007-F45A-1</a>.</p> <p><strong>Corpus Description</strong></p> <p>The INEL Kamas corpus has been created within the long-term INEL project (&quot;Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages&quot;), 2016&ndash;2033. The corpus makes possible typologically aware corpus-based grammatical research on the Kamas language and expands the documentation of the lesser described indigenous languages of Northern Eurasia.</p> <p>The INEL Kamas corpus consists of two parts: folklore texts collected by Kai Donner in 1912&ndash;1914, and transcribed audio recordings of the last speaker of Kamas, Klavdiya Plotnikova, made between 1964 and 1970.</p> <p>Each text in the corpus is provided with morphological glossing, translation into English, Russian and German, as well as annotation of syntactic functions, semantic roles, Russian borrowings and code-switching. Some texts also have annotations for information status.</p> <p><strong>New in release 2.0</strong></p> <ul> <li>In texts from Donner&rsquo;s collection, phonetic transcription according to Klumpp|s edition of Donner&rsquo;s manuscripts has been added&nbsp;(as stl tier)</li> <li>Five texts which were originally split between different tapes have been merged, as well as respective parts of recordings. Sentences in each resulting text are numbered throughout <ul> <li>PKZ_196X_Alenushka_flk + PKZ_196X_Alenushka_continuation_flk &gt; PKZ_196X_Alenushka_flk</li> <li>End of PKZ_196X_SU0226 starting from PKZ_196X_SU0226.203 (210) + PKZ_196X_Alenushka2_continuation_flk &gt; PKZ_196X_Alenushka2_flk</li> <li>PKZ_196X_BlacksmithAndMerchant_flk + PKZ_196X_BlacksmithAndMerchant_cont_flk &gt; PKZ_196X_BlacksmithAndMerchant_flk</li> <li>PKZ_196X_Finist_flk + PKZ_196X_Finist_continuation_flk&nbsp;&gt;&nbsp;PKZ_196X_Finist_flk</li> <li>PKZ_196X_StupidWolf_flk + PKZ_196X_StupidWolf_continuation_flk &gt; PKZ_196X_StupidWolf_flk</li> </ul> </li> <li>Part of the texts are now annotated for existential, locative and possessive predication (ExLocPoss tier, by C.L.&nbsp;D&auml;britz)</li> <li>Numerous corrections in glosses, other annotations and transcriptions, including: <ul> <li>Fuller and more consistent transcription, glossing and annotations of borrowings</li> <li>Vowel length is marked in mp tier in <em>baːzoʔ</em> &lsquo;again&rsquo;, <em>b&uuml;ːzʼe</em> &lsquo;man&rsquo; and <em>saːgər</em> &lsquo;black&rsquo;</li> <li>Corrections in disambiguation of polysemous or homonymous morphemes:&nbsp;<br> -ziʔ&nbsp;&quot;INS&quot;/&quot;COM&quot;, -də &quot;LAT&quot;/&quot;3SG&quot;, mo- &quot;can/become/want | мочь/стать/хотеть&quot;</li> <li>Possessive suffix unmarked for case: &quot;NOM/GEN/ACC&quot; &gt; &quot;POSS&quot;</li> <li>Glosses for personal pronouns were changed to uniform labels: &quot;I | я&quot; &gt; &quot;PRO1SG&quot;, &quot;we | мы&quot; &gt; &quot;PRO1PL&quot;, &quot;you | ты&quot;&nbsp;&gt;&nbsp;&quot;PRO2SG&quot;, &quot;you.PL | вы&quot; &gt; &quot;PRO2PL&quot;</li> <li>Fuller annotations of code-switching and calques (CS tier)</li> </ul> </li> <li>Added ELAN *.eaf as a supplementary end-user file format for all transcripts</li> </ul> <p><strong>Funding</strong></p> <p>The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies&rsquo; Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies&rsquo; Programme is coordinated by the Union of the German Academies of Sciences and Humanities.</p> <p><strong>Contributions/Acknowledgements</strong></p> <ul> <li> <p>Recordings of Kamas speech made by Ago K&uuml;nnap in Abalakovo and by Tiit-Rein Viitso in Tartu provided by the Archive of Estonian Dialects and Kindred Languages of the University of Tartu, Estonia (AEDKL, or T&Uuml;EMSA).</p> </li> <li> <p>Recordings of Klavdiya Plotnikova made by Jaakko Yli-Paavola in Tallinn in 1970 provided by the Institute for the Languages of Finland archive, Helsinki (KOTUS).</p> </li> <li> <p>Scanned pages from the Kai Donners Kamassisches W&ouml;rterbuch (Joki 1944) containing texts collected by Kai Donner published online courtesy of the Finno-Ugrian Society.</p> </li> <li> <p>The web-based search interface is using the Tsakonian Corpus platform developed by Dr. Timofey Arkhangelskiy.</p> </li> </ul></subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">dataset</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">user-adwhh</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">user-inel</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Klooster, Tiina</subfield> <subfield code="u">Universität Hamburg</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Wagner-Nagy, Beáta</subfield> <subfield code="u">Universität Hamburg</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Wagner-Nagy, Beata</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">res</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Arkhipov, Alexandre</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">res</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Gusev, Valentin</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">res</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Klooster, Tiina</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">res</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Ferger, Anne</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">dtm</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Jettka, Daniel</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">dtm</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Lehmberg, Timm</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">dtm</subfield> </datafield> <datafield tag="024" ind1=" " ind2=" "> <subfield code="a">10.25592/uhhfdm.13882</subfield> <subfield code="2">doi</subfield> </datafield> <datafield tag="100" ind1=" " ind2=" "> <subfield code="a">Gusev, Valentin</subfield> <subfield code="u">Universität Hamburg</subfield> </datafield> <controlfield tag="001">13882</controlfield> </record>