Dataset Open Access

INEL Kamas Corpus

Gusev, Valentin; Klooster, Tiina; Wagner-Nagy, Beáta


DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-3" xsi:schemaLocation="http://datacite.org/schema/kernel-3 http://schema.datacite.org/meta/kernel-3/metadata.xsd">
  <identifier identifierType="DOI">10.25592/uhhfdm.13882</identifier>
  <creators>
    <creator>
      <creatorName>Gusev, Valentin</creatorName>
      <affiliation>Universität Hamburg</affiliation>
    </creator>
    <creator>
      <creatorName>Klooster, Tiina</creatorName>
      <affiliation>Universität Hamburg</affiliation>
    </creator>
    <creator>
      <creatorName>Wagner-Nagy, Beáta</creatorName>
      <affiliation>Universität Hamburg</affiliation>
    </creator>
  </creators>
  <titles>
    <title>INEL Kamas Corpus</title>
  </titles>
  <publisher>Universität Hamburg</publisher>
  <publicationYear>2023</publicationYear>
  <subjects>
    <subject>endangered language</subject>
    <subject>indigenous language</subject>
    <subject>L1 data</subject>
    <subject>language contact</subject>
    <subject>language documentation</subject>
    <subject>INEL</subject>
    <subject>folklore</subject>
    <subject>narrative</subject>
    <subject>monologue</subject>
    <subject>annotated</subject>
    <subject>morphological glossing</subject>
    <subject>borrowings</subject>
    <subject>code-switching</subject>
    <subject>semantic roles</subject>
    <subject>syntactic functions</subject>
    <subject>information status</subject>
    <subject>English translation</subject>
    <subject>German translation</subject>
    <subject>Russian translation</subject>
  </subjects>
  <contributors>
    <contributor contributorType="Researcher">
      <contributorName>Wagner-Nagy, Beata</contributorName>
      <affiliation>Universität Hamburg</affiliation>
    </contributor>
    <contributor contributorType="Researcher">
      <contributorName>Arkhipov, Alexandre</contributorName>
      <affiliation>Universität Hamburg</affiliation>
    </contributor>
    <contributor contributorType="Researcher">
      <contributorName>Gusev, Valentin</contributorName>
      <affiliation>Universität Hamburg</affiliation>
    </contributor>
    <contributor contributorType="Researcher">
      <contributorName>Klooster, Tiina</contributorName>
      <affiliation>Universität Hamburg</affiliation>
    </contributor>
    <contributor contributorType="DataManager">
      <contributorName>Ferger, Anne</contributorName>
      <affiliation>Universität Hamburg</affiliation>
    </contributor>
    <contributor contributorType="DataManager">
      <contributorName>Jettka, Daniel</contributorName>
      <affiliation>Universität Hamburg</affiliation>
    </contributor>
    <contributor contributorType="DataManager">
      <contributorName>Lehmberg, Timm</contributorName>
      <affiliation>Universität Hamburg</affiliation>
    </contributor>
  </contributors>
  <dates>
    <date dateType="Issued">2023-12-29</date>
  </dates>
  <resourceType resourceTypeGeneral="Dataset"/>
  <alternateIdentifiers>
    <alternateIdentifier alternateIdentifierType="url">https://www.fdr.uni-hamburg.de/record/13882</alternateIdentifier>
  </alternateIdentifiers>
  <relatedIdentifiers>
    <relatedIdentifier relatedIdentifierType="Handle" relationType="IsCitedBy">11022/0000-0007-FC25-4</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsPartOf">10.25592/uhhfdm.9740</relatedIdentifier>
  </relatedIdentifiers>
  <version>2.0</version>
  <rightsList>
    <rights rightsURI="https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode">Creative Commons Attribution Non Commercial Share Alike 4.0 International</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
  </rightsList>
  <descriptions>
    <description descriptionType="Abstract">&lt;p&gt;&lt;strong&gt;Corpus Citation&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Gusev, Valentin; Klooster, Tiina; Wagner-Nagy, Be&amp;aacute;ta.&lt;/em&gt; 2023. &amp;ldquo;INEL Kamas Corpus.&amp;rdquo; Version 2.0. Publication date 2023-12-31. &lt;a href="http://hdl.handle.net/11022/0000-0007-FC25-4"&gt;http://hdl.handle.net/11022/0000-0007-FC25-4&lt;/a&gt;. Archived at Universit&amp;auml;t Hamburg. In: The INEL corpora of indigenous Northern Eurasian languages.&lt;a href="https://hdl.handle.net/11022/0000-0007-F45A-1"&gt;https://hdl.handle.net/11022/0000-0007-F45A-1&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Corpus Description&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The INEL Kamas corpus has been created within the long-term INEL project (&amp;quot;Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages&amp;quot;), 2016&amp;ndash;2033. The corpus makes possible typologically aware corpus-based grammatical research on the Kamas language and expands the documentation of the lesser described indigenous languages of Northern Eurasia.&lt;/p&gt;

&lt;p&gt;The INEL Kamas corpus consists of two parts: folklore texts collected by Kai Donner in 1912&amp;ndash;1914, and transcribed audio recordings of the last speaker of Kamas, Klavdiya Plotnikova, made between 1964 and 1970.&lt;/p&gt;

&lt;p&gt;Each text in the corpus is provided with morphological glossing, translation into English, Russian and German, as well as annotation of syntactic functions, semantic roles, Russian borrowings and code-switching. Some texts also have annotations for information status.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;New in release 2.0&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;In texts from Donner&amp;rsquo;s collection, phonetic transcription according to Klumpp|s edition of Donner&amp;rsquo;s manuscripts has been added&amp;nbsp;(as stl tier)&lt;/li&gt;
	&lt;li&gt;Five texts which were originally split between different tapes have been merged, as well as respective parts of recordings. Sentences in each resulting text are numbered throughout
	&lt;ul&gt;
		&lt;li&gt;PKZ_196X_Alenushka_flk + PKZ_196X_Alenushka_continuation_flk &amp;gt; PKZ_196X_Alenushka_flk&lt;/li&gt;
		&lt;li&gt;End of PKZ_196X_SU0226 starting from PKZ_196X_SU0226.203 (210) + PKZ_196X_Alenushka2_continuation_flk &amp;gt; PKZ_196X_Alenushka2_flk&lt;/li&gt;
		&lt;li&gt;PKZ_196X_BlacksmithAndMerchant_flk + PKZ_196X_BlacksmithAndMerchant_cont_flk &amp;gt; PKZ_196X_BlacksmithAndMerchant_flk&lt;/li&gt;
		&lt;li&gt;PKZ_196X_Finist_flk + PKZ_196X_Finist_continuation_flk&amp;nbsp;&amp;gt;&amp;nbsp;PKZ_196X_Finist_flk&lt;/li&gt;
		&lt;li&gt;PKZ_196X_StupidWolf_flk + PKZ_196X_StupidWolf_continuation_flk &amp;gt; PKZ_196X_StupidWolf_flk&lt;/li&gt;
	&lt;/ul&gt;
	&lt;/li&gt;
	&lt;li&gt;Part of the texts are now annotated for existential, locative and possessive predication (ExLocPoss tier, by C.L.&amp;nbsp;D&amp;auml;britz)&lt;/li&gt;
	&lt;li&gt;Numerous corrections in glosses, other annotations and transcriptions, including:
	&lt;ul&gt;
		&lt;li&gt;Fuller and more consistent transcription, glossing and annotations of borrowings&lt;/li&gt;
		&lt;li&gt;Vowel length is marked in mp tier in &lt;em&gt;baːzoʔ&lt;/em&gt; &amp;lsquo;again&amp;rsquo;, &lt;em&gt;b&amp;uuml;ːzʼe&lt;/em&gt; &amp;lsquo;man&amp;rsquo; and &lt;em&gt;saːgər&lt;/em&gt; &amp;lsquo;black&amp;rsquo;&lt;/li&gt;
		&lt;li&gt;Corrections in disambiguation of polysemous or homonymous morphemes:&amp;nbsp;&lt;br&gt;
		-ziʔ&amp;nbsp;&amp;quot;INS&amp;quot;/&amp;quot;COM&amp;quot;, -də &amp;quot;LAT&amp;quot;/&amp;quot;3SG&amp;quot;, mo- &amp;quot;can/become/want | мочь/стать/хотеть&amp;quot;&lt;/li&gt;
		&lt;li&gt;Possessive suffix unmarked for case: &amp;quot;NOM/GEN/ACC&amp;quot; &amp;gt; &amp;quot;POSS&amp;quot;&lt;/li&gt;
		&lt;li&gt;Glosses for personal pronouns were changed to uniform labels: &amp;quot;I | я&amp;quot; &amp;gt; &amp;quot;PRO1SG&amp;quot;, &amp;quot;we | мы&amp;quot; &amp;gt; &amp;quot;PRO1PL&amp;quot;, &amp;quot;you | ты&amp;quot;&amp;nbsp;&amp;gt;&amp;nbsp;&amp;quot;PRO2SG&amp;quot;, &amp;quot;you.PL | вы&amp;quot; &amp;gt; &amp;quot;PRO2PL&amp;quot;&lt;/li&gt;
		&lt;li&gt;Fuller annotations of code-switching and calques (CS tier)&lt;/li&gt;
	&lt;/ul&gt;
	&lt;/li&gt;
	&lt;li&gt;Added ELAN *.eaf as a supplementary end-user file format for all transcripts&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Funding&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies&amp;rsquo; Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies&amp;rsquo; Programme is coordinated by the Union of the German Academies of Sciences and Humanities.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Contributions/Acknowledgements&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;
	&lt;p&gt;Recordings of Kamas speech made by Ago K&amp;uuml;nnap in Abalakovo and by Tiit-Rein Viitso in Tartu provided by the Archive of Estonian Dialects and Kindred Languages of the University of Tartu, Estonia (AEDKL, or T&amp;Uuml;EMSA).&lt;/p&gt;
	&lt;/li&gt;
	&lt;li&gt;
	&lt;p&gt;Recordings of Klavdiya Plotnikova made by Jaakko Yli-Paavola in Tallinn in 1970 provided by the Institute for the Languages of Finland archive, Helsinki (KOTUS).&lt;/p&gt;
	&lt;/li&gt;
	&lt;li&gt;
	&lt;p&gt;Scanned pages from the Kai Donners Kamassisches W&amp;ouml;rterbuch (Joki 1944) containing texts collected by Kai Donner published online courtesy of the Finno-Ugrian Society.&lt;/p&gt;
	&lt;/li&gt;
	&lt;li&gt;
	&lt;p&gt;The web-based search interface is using the Tsakonian Corpus platform developed by Dr. Timofey Arkhangelskiy.&lt;/p&gt;
	&lt;/li&gt;
&lt;/ul&gt;</description>
  </descriptions>
</resource>

Cite record as