Dataset Open Access
Gusev, Valentin; Klooster, Tiina; Wagner-Nagy, Beáta
<?xml version='1.0' encoding='utf-8'?> <resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-3" xsi:schemaLocation="http://datacite.org/schema/kernel-3 http://schema.datacite.org/meta/kernel-3/metadata.xsd"> <identifier identifierType="DOI">10.25592/uhhfdm.13882</identifier> <creators> <creator> <creatorName>Gusev, Valentin</creatorName> <affiliation>Universität Hamburg</affiliation> </creator> <creator> <creatorName>Klooster, Tiina</creatorName> <affiliation>Universität Hamburg</affiliation> </creator> <creator> <creatorName>Wagner-Nagy, Beáta</creatorName> <affiliation>Universität Hamburg</affiliation> </creator> </creators> <titles> <title>INEL Kamas Corpus</title> </titles> <publisher>Universität Hamburg</publisher> <publicationYear>2023</publicationYear> <subjects> <subject>endangered language</subject> <subject>indigenous language</subject> <subject>L1 data</subject> <subject>language contact</subject> <subject>language documentation</subject> <subject>INEL</subject> <subject>folklore</subject> <subject>narrative</subject> <subject>monologue</subject> <subject>annotated</subject> <subject>morphological glossing</subject> <subject>borrowings</subject> <subject>code-switching</subject> <subject>semantic roles</subject> <subject>syntactic functions</subject> <subject>information status</subject> <subject>English translation</subject> <subject>German translation</subject> <subject>Russian translation</subject> </subjects> <contributors> <contributor contributorType="Researcher"> <contributorName>Wagner-Nagy, Beata</contributorName> <affiliation>Universität Hamburg</affiliation> </contributor> <contributor contributorType="Researcher"> <contributorName>Arkhipov, Alexandre</contributorName> <affiliation>Universität Hamburg</affiliation> </contributor> <contributor contributorType="Researcher"> <contributorName>Gusev, Valentin</contributorName> <affiliation>Universität Hamburg</affiliation> </contributor> <contributor contributorType="Researcher"> <contributorName>Klooster, Tiina</contributorName> <affiliation>Universität Hamburg</affiliation> </contributor> <contributor contributorType="DataManager"> <contributorName>Ferger, Anne</contributorName> <affiliation>Universität Hamburg</affiliation> </contributor> <contributor contributorType="DataManager"> <contributorName>Jettka, Daniel</contributorName> <affiliation>Universität Hamburg</affiliation> </contributor> <contributor contributorType="DataManager"> <contributorName>Lehmberg, Timm</contributorName> <affiliation>Universität Hamburg</affiliation> </contributor> </contributors> <dates> <date dateType="Issued">2023-12-29</date> </dates> <resourceType resourceTypeGeneral="Dataset"/> <alternateIdentifiers> <alternateIdentifier alternateIdentifierType="url">https://www.fdr.uni-hamburg.de/record/13882</alternateIdentifier> </alternateIdentifiers> <relatedIdentifiers> <relatedIdentifier relatedIdentifierType="Handle" relationType="IsCitedBy">11022/0000-0007-FC25-4</relatedIdentifier> <relatedIdentifier relatedIdentifierType="DOI" relationType="IsPartOf">10.25592/uhhfdm.9740</relatedIdentifier> </relatedIdentifiers> <version>2.0</version> <rightsList> <rights rightsURI="https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode">Creative Commons Attribution Non Commercial Share Alike 4.0 International</rights> <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights> </rightsList> <descriptions> <description descriptionType="Abstract"><p><strong>Corpus Citation</strong></p> <p><em>Gusev, Valentin; Klooster, Tiina; Wagner-Nagy, Be&aacute;ta.</em> 2023. &ldquo;INEL Kamas Corpus.&rdquo; Version 2.0. Publication date 2023-12-31. <a href="http://hdl.handle.net/11022/0000-0007-FC25-4">http://hdl.handle.net/11022/0000-0007-FC25-4</a>. Archived at Universit&auml;t Hamburg. In: The INEL corpora of indigenous Northern Eurasian languages.<a href="https://hdl.handle.net/11022/0000-0007-F45A-1">https://hdl.handle.net/11022/0000-0007-F45A-1</a>.</p> <p><strong>Corpus Description</strong></p> <p>The INEL Kamas corpus has been created within the long-term INEL project (&quot;Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages&quot;), 2016&ndash;2033. The corpus makes possible typologically aware corpus-based grammatical research on the Kamas language and expands the documentation of the lesser described indigenous languages of Northern Eurasia.</p> <p>The INEL Kamas corpus consists of two parts: folklore texts collected by Kai Donner in 1912&ndash;1914, and transcribed audio recordings of the last speaker of Kamas, Klavdiya Plotnikova, made between 1964 and 1970.</p> <p>Each text in the corpus is provided with morphological glossing, translation into English, Russian and German, as well as annotation of syntactic functions, semantic roles, Russian borrowings and code-switching. Some texts also have annotations for information status.</p> <p><strong>New in release 2.0</strong></p> <ul> <li>In texts from Donner&rsquo;s collection, phonetic transcription according to Klumpp|s edition of Donner&rsquo;s manuscripts has been added&nbsp;(as stl tier)</li> <li>Five texts which were originally split between different tapes have been merged, as well as respective parts of recordings. Sentences in each resulting text are numbered throughout <ul> <li>PKZ_196X_Alenushka_flk + PKZ_196X_Alenushka_continuation_flk &gt; PKZ_196X_Alenushka_flk</li> <li>End of PKZ_196X_SU0226 starting from PKZ_196X_SU0226.203 (210) + PKZ_196X_Alenushka2_continuation_flk &gt; PKZ_196X_Alenushka2_flk</li> <li>PKZ_196X_BlacksmithAndMerchant_flk + PKZ_196X_BlacksmithAndMerchant_cont_flk &gt; PKZ_196X_BlacksmithAndMerchant_flk</li> <li>PKZ_196X_Finist_flk + PKZ_196X_Finist_continuation_flk&nbsp;&gt;&nbsp;PKZ_196X_Finist_flk</li> <li>PKZ_196X_StupidWolf_flk + PKZ_196X_StupidWolf_continuation_flk &gt; PKZ_196X_StupidWolf_flk</li> </ul> </li> <li>Part of the texts are now annotated for existential, locative and possessive predication (ExLocPoss tier, by C.L.&nbsp;D&auml;britz)</li> <li>Numerous corrections in glosses, other annotations and transcriptions, including: <ul> <li>Fuller and more consistent transcription, glossing and annotations of borrowings</li> <li>Vowel length is marked in mp tier in <em>baːzoʔ</em> &lsquo;again&rsquo;, <em>b&uuml;ːzʼe</em> &lsquo;man&rsquo; and <em>saːgər</em> &lsquo;black&rsquo;</li> <li>Corrections in disambiguation of polysemous or homonymous morphemes:&nbsp;<br> -ziʔ&nbsp;&quot;INS&quot;/&quot;COM&quot;, -də &quot;LAT&quot;/&quot;3SG&quot;, mo- &quot;can/become/want | мочь/стать/хотеть&quot;</li> <li>Possessive suffix unmarked for case: &quot;NOM/GEN/ACC&quot; &gt; &quot;POSS&quot;</li> <li>Glosses for personal pronouns were changed to uniform labels: &quot;I | я&quot; &gt; &quot;PRO1SG&quot;, &quot;we | мы&quot; &gt; &quot;PRO1PL&quot;, &quot;you | ты&quot;&nbsp;&gt;&nbsp;&quot;PRO2SG&quot;, &quot;you.PL | вы&quot; &gt; &quot;PRO2PL&quot;</li> <li>Fuller annotations of code-switching and calques (CS tier)</li> </ul> </li> <li>Added ELAN *.eaf as a supplementary end-user file format for all transcripts</li> </ul> <p><strong>Funding</strong></p> <p>The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies&rsquo; Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies&rsquo; Programme is coordinated by the Union of the German Academies of Sciences and Humanities.</p> <p><strong>Contributions/Acknowledgements</strong></p> <ul> <li> <p>Recordings of Kamas speech made by Ago K&uuml;nnap in Abalakovo and by Tiit-Rein Viitso in Tartu provided by the Archive of Estonian Dialects and Kindred Languages of the University of Tartu, Estonia (AEDKL, or T&Uuml;EMSA).</p> </li> <li> <p>Recordings of Klavdiya Plotnikova made by Jaakko Yli-Paavola in Tallinn in 1970 provided by the Institute for the Languages of Finland archive, Helsinki (KOTUS).</p> </li> <li> <p>Scanned pages from the Kai Donners Kamassisches W&ouml;rterbuch (Joki 1944) containing texts collected by Kai Donner published online courtesy of the Finno-Ugrian Society.</p> </li> <li> <p>The web-based search interface is using the Tsakonian Corpus platform developed by Dr. Timofey Arkhangelskiy.</p> </li> </ul></description> </descriptions> </resource>