Dataset Open Access
Däbritz, Chris Lasse; Kudryakova, Nina; Stapert, Eugénie
<?xml version='1.0' encoding='UTF-8'?> <record xmlns="http://www.loc.gov/MARC21/slim"> <leader>00000nmm##2200000uu#4500</leader> <datafield tag="542" ind1=" " ind2=" "> <subfield code="l">open</subfield> </datafield> <datafield tag="909" ind1="C" ind2="O"> <subfield code="o">oai:fdr.uni-hamburg.de:11165</subfield> <subfield code="p">user-inel</subfield> <subfield code="p">user-adwhh</subfield> </datafield> <datafield tag="773" ind1=" " ind2=" "> <subfield code="a">11022/0000-0007-F9A7-4</subfield> <subfield code="i">isCitedBy</subfield> <subfield code="n">handle</subfield> </datafield> <datafield tag="773" ind1=" " ind2=" "> <subfield code="a">10.25592/uhhfdm.9746</subfield> <subfield code="i">isVersionOf</subfield> <subfield code="n">doi</subfield> </datafield> <controlfield tag="005">20250912121005.0</controlfield> <datafield tag="540" ind1=" " ind2=" "> <subfield code="u">https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode</subfield> <subfield code="a">Creative Commons Attribution Non Commercial Share Alike 4.0 International</subfield> </datafield> <datafield tag="260" ind1=" " ind2=" "> <subfield code="c">2022-11-30</subfield> </datafield> <datafield tag="041" ind1=" " ind2=" "> <subfield code="a">dlg</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">endangered language</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">indigenous language</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">L1 data</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">language contact</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">language documentation</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">INEL</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">folklore</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">narrative</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">monologue</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">annotated</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">morphological glossing</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">borrowings</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">code-switching</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">semantic roles</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">syntactic functions</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">information status</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">English translation</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">German translation</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">Russian translation</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">existential predication</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">locative predication</subfield> </datafield> <datafield tag="653" ind1=" " ind2=" "> <subfield code="a">non-verbal predication</subfield> </datafield> <datafield tag="245" ind1=" " ind2=" "> <subfield code="a">INEL Dolgan Corpus</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">1013434</subfield> <subfield code="u">https://www.fdr.uni-hamburg.de/record/11165/files/dolgan-2.0-documentation.pdf</subfield> <subfield code="z">md5:ec3647edf1b70e222e25ff62482a9ff8</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">2119555054</subfield> <subfield code="u">https://www.fdr.uni-hamburg.de/record/11165/files/dolgan-2.0-mp3only.zip</subfield> <subfield code="z">md5:831f74786313326775bc0b28ffbbc0f0</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">40432276</subfield> <subfield code="u">https://www.fdr.uni-hamburg.de/record/11165/files/dolgan-2.0-noaudio.zip</subfield> <subfield code="z">md5:6f85f1a09811f07606caf6747744eef2</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">11722255947</subfield> <subfield code="u">https://www.fdr.uni-hamburg.de/record/11165/files/dolgan-2.0.zip</subfield> <subfield code="z">md5:d5bf810b24538594f52a5899cca4e074</subfield> </datafield> <datafield tag="650" ind1="1" ind2="7"> <subfield code="a">cc-by</subfield> <subfield code="2">opendefinition.org</subfield> </datafield> <datafield tag="520" ind1=" " ind2=" "> <subfield code="a"><p><strong>Corpus Citation</strong></p> <p><em>D&auml;britz, Chris Lasse; Kudryakova, Nina; Stapert, Eug&eacute;nie. 2022. INEL Dolgan Corpus. Version 2.0. Publication date 2022-11-30. https://hdl.handle.net/11022/0000-0007-F9A7-4. Archived at Universit&auml;t Hamburg. In: The INEL corpora of indigenous Northern Eurasian languages. https://hdl.handle.net/11022/0000-0007-F45A-1. </em></p> <p><strong>Corpus Description</strong></p> <p>The INEL Dolgan corpus has been created within the long-term INEL project (&quot;Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages&rdquo;), 2016&ndash;2033. The corpus makes possible typologically aware corpus-based grammatical research on the Dolgan language and expands the documentation of the lesser described indigenous languages of Northern Eurasia.</p> <p>The INEL Dolgan corpus is composed of texts from different sources: 1. Published folklore texts from an edited volume (&quot;Fol&#39;klor Dolgan&quot;, P.E. Efremov 2000), 2. Transcripts of recordings obtained from the Taymyr House of Folk Art (TDNT) in Dudinka (1970s-2000s), 3. Transcripts from the collection of Dr. Eug&eacute;nie Stapert recorded on several fieldwork trips in 2007-2010, 4. Transcripts of recordings made on a fieldwork trip in 2017. The first group as well as parts of the third group were already transcribed and translated, the rest of the recordings was transcribed and translated within the INEL project.</p> <p>Each text in the corpus is provided with morphological glossing, translation into English, Russian and German, as well as annotation of Russian borrowings. Some texts also have annotations for syntactic functions, semantic roles and information structure/information status.</p> <p><strong>New in release 2.0</strong></p> <ul> <li>20 glossed transcripts (2864 utterances, 19989 tokens) with 03:33:14 hours of corresponding sound</li> <li>37 audio files with 10:00:36 hours of sound without glossed transcripts</li> <li>Corrections of grammatical analyses and glossing according to the findings in D&auml;britz&rsquo;s (2022) grammar, as well as cross-corpora harmonizations</li> <li>Additional corpus-wide annotation of Mongolic borrowings</li> <li>Additional corpus-wide annotation of existential, locative and possessive predication</li> <li>Corrections in further annotations, translations and metadata</li> </ul> <p><strong>Funding</strong></p> <p>The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies&rsquo; Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies&rsquo; Programme is coordinated by the Union of the German Academies of Sciences and Humanities.</p></subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">dataset</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">user-adwhh</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">user-inel</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Kudryakova, Nina</subfield> <subfield code="u">Taimyr House of Folk Art (TDNT)</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Stapert, Eugénie</subfield> <subfield code="u">Universiteit Leiden</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Wagner-Nagy, Be´ata</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">res</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Arkhipov, Alexandre</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">res</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Däbritz, Chris Lasse</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">res</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Kudryakova, Nina</subfield> <subfield code="u">Taimyr House of Folk Art (TDNT)</subfield> <subfield code="4">res</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Stapert, Eugénie</subfield> <subfield code="u">Universiteit Leiden</subfield> <subfield code="4">res</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Ferger, Anne</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">dtm</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Jettka, Daniel</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">dtm</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Lazarenko, Elena</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">dtm</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Lehmberg, Timm</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">dtm</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="a">Riaposov, Aleksandr</subfield> <subfield code="u">Universität Hamburg</subfield> <subfield code="4">dtm</subfield> </datafield> <datafield tag="024" ind1=" " ind2=" "> <subfield code="a">10.25592/uhhfdm.11165</subfield> <subfield code="2">doi</subfield> </datafield> <datafield tag="100" ind1=" " ind2=" "> <subfield code="a">Däbritz, Chris Lasse</subfield> <subfield code="u">Universität Hamburg</subfield> </datafield> <controlfield tag="001">11165</controlfield> </record>