Dataset Open Access

INEL Kamas Corpus

Gusev, Valentin; Klooster, Tiina; Wagner-Nagy, Beáta

Data manager(s)
Ferger, Anne; Jettka, Daniel; Lehmberg, Timm
Researcher(s)
Wagner-Nagy, Beata; Arkhipov, Alexandre; Gusev, Valentin; Klooster, Tiina

Corpus Citation

Gusev, Valentin; Klooster, Tiina; Wagner-Nagy, Beáta. 2019. "INEL Kamas Corpus." Version 1.0. Publication date 2019-12-15. http://hdl.handle.net/11022/0000-0007-DA6E-9. Archived in Hamburger Zentrum für Sprachkorpora. In: Wagner-Nagy, Beáta; Arkhipov, Alexandre; Ferger, Anne; Jettka, Daniel; Lehmberg, Timm (eds.). The INEL corpora of indigenous Northern Eurasian languages.

Corpus Description

The INEL Kamas corpus has been created within the long-term INEL project ("Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages"), 2016–2033. The corpus makes possible typologically aware corpus-based grammatical research on the Kamas language and expands the documentation of the lesser described indigenous languages of Northern Eurasia.

The INEL Kamas corpus consists of two parts: folklore texts collected by Kai Donner in 1912–1914, and transcribed audio recordings of the last speaker of Kamas, Klavdiya Plotnikova, made between 1964 and 1970.

Each text in the corpus is provided with morphological glossing, translation into English, Russian and German, as well as annotation of syntactic functions, semantic roles, Russian borrowings and code-switching. Some texts also have annotations for information status.

New in release 1.0

  • The totality of Klavdiya Plotnikova’s transcripts are now published, including all the tapes from the KOTUS archive, as well as the two recordings of Aleksandra Semyonova (21 more texts in total).
  • All the texts are now annotated for syntactic functions and semantic roles.
  • Numerous corrections in glosses and other annotations.

Funding

The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies’ Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies’ Programme is coordinated by the Union of the German Academies of Sciences and Humanities.

Contributions/Acknowledgements

  • Recordings of Kamas speech made by Ago Künnap in Abalakovo and by Tiit-Rein Viitso in Tartu provided by the Archive of Estonian Dialects and Kindred Languages of the University of Tartu, Estonia (AEDKL, or TÜEMSA).

  • Recordings of Klavdiya Plotnikova made by Jaakko Yli-Paavola in Tallinn in 1970 provided by the Institute for the Languages of Finland archive, Helsinki (KOTUS).

  • Scanned pages from the Kai Donners Kamassisches Wörterbuch (Joki 1944) containing texts collected by Kai Donner published online courtesy of the Finno-Ugrian Society.

  • The web-based search interface is using the Tsakonian Corpus platform developed by Dr. Timofey Arkhangelskiy.

Partner Organizations
The INEL project benefited greatly from cooperation with our partner institutions:

  • Institute of the World Culture, M.V. Lomonosov Moscow State University, Moscow
  • Department of Languages of the Peoples of Siberia, Tomsk State Pedagodical University, Tomsk
  • Institute of Philology, Siberian Branch of Russian Academy of Sciences, Novosibirsk
  • Taymyr House of Folk Art, Dudinka
  • Arctic State Institute Culture and Arts, Yakutsk

Files (6.1 GB)
Name Size
kamas-1.0-documentation.pdf
md5:8badf215bab81df15abb07415b413fc6
624.5 kB Download
kamas-1.0-mp3only.zip
md5:bec046e5859ddb2b8cdbab3204f5aac3
1.3 GB Download
kamas-1.0-noaudio.zip
md5:36c27a3619805e2c4eb393ef213e958c
69.9 MB Download
kamas-1.0.zip
md5:180df49fc112341715cb1b3f0b416dc9
4.7 GB Download

Cite record as