Dataset Open Access
Wagner-Nagy, Beáta;
Sipőcz, Katalin
<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
<leader>00000nmm##2200000uu#4500</leader>
<datafield tag="650" ind1="1" ind2="7">
<subfield code="a">cc-by</subfield>
<subfield code="2">opendefinition.org</subfield>
</datafield>
<datafield tag="700" ind1=" " ind2=" ">
<subfield code="a">Sipőcz, Katalin</subfield>
<subfield code="u">University of Szeged</subfield>
<subfield code="0">(orcid)0000-0003-1146-6562</subfield>
</datafield>
<datafield tag="700" ind1=" " ind2=" ">
<subfield code="a">Wagner-Nagy, Beáta</subfield>
<subfield code="u">Universität Hamburg</subfield>
<subfield code="0">(orcid)0000-0002-6801-1895</subfield>
<subfield code="4">edt</subfield>
</datafield>
<datafield tag="700" ind1=" " ind2=" ">
<subfield code="a">Arkhipov, Alexandre</subfield>
<subfield code="u">Universität Hamburg</subfield>
<subfield code="0">(orcid)0000-0001-5395-0921</subfield>
<subfield code="4">edt</subfield>
</datafield>
<datafield tag="700" ind1=" " ind2=" ">
<subfield code="a">Brykina, Maria</subfield>
<subfield code="u">Universität Hamburg</subfield>
<subfield code="4">edt</subfield>
</datafield>
<datafield tag="700" ind1=" " ind2=" ">
<subfield code="a">Lazarenko, Elena</subfield>
<subfield code="u">Universität Hamburg</subfield>
<subfield code="4">dtm</subfield>
</datafield>
<datafield tag="700" ind1=" " ind2=" ">
<subfield code="a">Riaposov, Aleksandr</subfield>
<subfield code="u">Universität Hamburg</subfield>
<subfield code="4">dtm</subfield>
</datafield>
<datafield tag="980" ind1=" " ind2=" ">
<subfield code="a">dataset</subfield>
</datafield>
<controlfield tag="001">17513</controlfield>
<datafield tag="909" ind1="C" ind2="O">
<subfield code="o">oai:fdr.uni-hamburg.de:17513</subfield>
<subfield code="p">user-uhh</subfield>
<subfield code="p">user-inel</subfield>
<subfield code="p">user-adwhh</subfield>
</datafield>
<datafield tag="245" ind1=" " ind2=" ">
<subfield code="a">INEL Tavda Mansi Corpus</subfield>
</datafield>
<datafield tag="520" ind1=" " ind2=" ">
<subfield code="a"><p><strong>Corpus Citation</strong></p>
<p>Sipőcz, Katalin &amp; Wagner-Nagy, Be&aacute;ta. 2025. INEL Tavda Mansi Corpus. Version 1.0. Publication date 2025-05-15. <a href="https://hdl.handle.net/11022/0000-0007-FE69-6">https://hdl.handle.net/11022/0000-0007-FE69-6</a>. Archived at Universit&auml;t Hamburg. In: <em>The INEL corpora of indigenous Northern Eurasian languages. </em><a href="https://hdl.handle.net/11022/0000-0007-F45A-1">https://hdl.handle.net/11022/0000-0007-F45A-1</a></p>
<p><strong>Corpus Description</strong><br>
The present corpus of Tavda Mansi has been created as part of the long-term research project INEL (&ldquo;<em>Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages&rdquo;)&nbsp;</em>in the context of the Academies&rsquo; Programme, coordinated by the Union of the German Academies of Sciences and Humanities.</p>
<p>The INEL Tavda Mansi corpus at hand fills a gap in the documentation of the indigenous languages of Northern Eurasia and makes possible further descriptions of the language. Mansi is a relatively good described language: there are numerous descriptions and a corpus is also available, &nbsp;however, the Tavda variety is not included in the existing corpora.</p>
<p>The analysis of materials from the Tavda variety has already been conducted by Norbert Szil&aacute;gyi., but he did not produce a corpus that could be searched and evaluated electronically. However, he has made his materials available under the URL: <a href="https://norbertszilagyi91.wixsite.com/tawdamansi">https://norbertszilagyi91.wixsite.com/tawdamansi</a>. In the material published in the INEL corpus, the analyses differ significantly from Szil&aacute;gyi&#39;s analysis. For the sake of comparison, the texts analysed by Szil&aacute;gyi are appended to the corpus, and the Hungarian translations he provided have been retained, but some places have been corrected.<br>
<br>
The INEL Tavda Mansi Corpus contains texts texts from different sources:</p>
<ul>
<li>Kannisto, Artturi and Matti Liimola 1951: <em>Wogulische Volksdichtung </em>gesammelt und &uuml;bersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola<em> </em>Volume I. <em>Texte mythischen Inhalts</em>. [M&eacute;moires de la Soci&eacute;t&eacute; Finno-Ougrienne 101]. Helsinki: Suomalais-Ugrilainen Seura.</li>
<li>Kannisto, Artturi and Matti Liimola 1955: <em>Wogulische Volksdichtung </em>gesammelt und &uuml;bersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola<em> </em>Volume II<em>. Kriegs und Heldensagen</em>. [M&eacute;moires de la Soci&eacute;t&eacute; Finno-Ougrienne 109]. Helsinki: Suomalais-Ugrilainen Seura.</li>
<li>Kannisto, Artturi and Matti Liimola 1956: <em>Wogulische Volksdichtung</em> gesammelt und &uuml;bersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola<em> </em>Volume III. <em>M&auml;rchen</em>. [M&eacute;moires de la Soci&eacute;t&eacute; Finno-Ougrienne 111]. Helsinki: Suomalais-Ugrilainen Seura.</li>
<li>Kannisto, Artturi and Matti Liimola 1958: <em>Wogulische Volksdichtung </em>gesammelt und &uuml;bersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola<em> </em>Volume IV<em>. B&auml;renlieder</em>. [M&eacute;moires de la Soci&eacute;t&eacute; Finno-Ougrienne 114]. Helsinki: Suomalais-Ugrilainen Seura.</li>
<li>Kannisto, Artturi and Matti Liimola 1963: <em>Wogulische Volksdichtung </em>gesammelt und &uuml;bersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola<em> </em>Volume VI.<em> Schicksalslieder, Klagelieder, Kinderreime, R&auml;tsel, Verschiedenes</em>. [M&eacute;moires de la Soci&eacute;t&eacute; Finno-Ougrienne 134]. Helsinki: Suomalais-Ugrilainen Seura.</li>
<li>Munk&aacute;csi, Bern&aacute;t 1896: <em>Vogul n&eacute;pk&ouml;lt&eacute;si gyűjtem&eacute;ny</em> IV. &Eacute;letk&eacute;pek. Budapest: Magyar Tudom&aacute;nyos Akad&eacute;mia.</li>
</ul>
<p><strong>Corpus size</strong></p>
<p>The corpus currently contains <strong>29 </strong>transcripts with <strong>2,042 </strong>utterances and <strong>11,879 </strong>tokens.</p>
<p><strong>Funding</strong></p>
<p>The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies&rsquo; Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The<br>
Academies&rsquo; Programme is coordinated by the Union of the German Academies of Sciences and Humanities.</p>
<p><strong>Searching the corpus</strong></p>
<p>The corpus can be downloaded from the ZFDM Repository using the links provided below and browsed or searched locally using the&nbsp;<a href="https://exmaralda.org/">EXMARaLDA</a>&nbsp;software or, alternatively,&nbsp;<a href="https://archive.mpi.nl/tla/elan">ELAN</a>.</p>
<p>Online search with Tsakorpus platform is available at&nbsp;<a href="https://inel.corpora.uni-hamburg.de/TavdaMansiCorpus/search">https://inel.corpora.uni-hamburg.de/TavdaMansiCorpus/search</a>.</p>
<p>Remote search with EXMARaLDA is also possible without downloading all the files (see&nbsp;<a href="https://inel.corpora.uni-hamburg.de/portal/help/en/index.php">https://inel.corpora.uni-hamburg.de/portal/help/en/index.php</a>).</p>
<p>See the user documentation (section 3) for details on transcription, annotation tiers and annotation tags. Find further information and links on the Mansi Corpus page at the INEL Resources portal:&nbsp;<a href="https://inel.corpora.uni-hamburg.de/portal/corpora/mansi/">https://inel.corpora.uni-hamburg.de/portal/corpora/mansi/</a>.</p></subfield>
</datafield>
<datafield tag="260" ind1=" " ind2=" ">
<subfield code="c">2025-05-15</subfield>
</datafield>
<datafield tag="041" ind1=" " ind2=" ">
<subfield code="a">mns</subfield>
</datafield>
<datafield tag="980" ind1=" " ind2=" ">
<subfield code="a">user-adwhh</subfield>
</datafield>
<datafield tag="980" ind1=" " ind2=" ">
<subfield code="a">user-inel</subfield>
</datafield>
<datafield tag="980" ind1=" " ind2=" ">
<subfield code="a">user-uhh</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">Uralic</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">Mansi</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">Tavda Mansi</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">endangered language</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">language contact</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">language documentation</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">legacy data</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">INEL</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">AdWHH</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">text corpus</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">parallel texts</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">folklore</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">tales</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">narrative</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">song</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">transcription</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">morphological glossing</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">part-of-speech</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">borrowings</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">dialogue</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">English translation</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">Russian translation</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">EXMARaLDA</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">ELAN</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">XML</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">ISO/TEI</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">German translation</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">Hungarian translation</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">existential predication</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">locative predication</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">possessive predication</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">Ob-Ugric languages</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">semantic role</subfield>
</datafield>
<datafield tag="653" ind1=" " ind2=" ">
<subfield code="a">syntactic function</subfield>
</datafield>
<datafield tag="773" ind1=" " ind2=" ">
<subfield code="a">10.25592/uhhfdm.17512</subfield>
<subfield code="i">isVersionOf</subfield>
<subfield code="n">doi</subfield>
</datafield>
<datafield tag="542" ind1=" " ind2=" ">
<subfield code="l">open</subfield>
</datafield>
<datafield tag="100" ind1=" " ind2=" ">
<subfield code="a">Wagner-Nagy, Beáta</subfield>
<subfield code="u">Universität Hamburg</subfield>
<subfield code="0">(orcid)0000-0002-6801-1895</subfield>
</datafield>
<datafield tag="856" ind1="4" ind2=" ">
<subfield code="s">1006988</subfield>
<subfield code="u">https://www.fdr.uni-hamburg.de/record/17513/files/tavda-mansi-1.0-documentation.pdf</subfield>
<subfield code="z">md5:ec16770132be63659c014399b870a907</subfield>
</datafield>
<datafield tag="856" ind1="4" ind2=" ">
<subfield code="s">7198065</subfield>
<subfield code="u">https://www.fdr.uni-hamburg.de/record/17513/files/tavda-mansi-1.0-lite.zip</subfield>
<subfield code="z">md5:4ca798e55b6ed24c8bde21657e3cddb2</subfield>
</datafield>
<datafield tag="856" ind1="4" ind2=" ">
<subfield code="s">74272357</subfield>
<subfield code="u">https://www.fdr.uni-hamburg.de/record/17513/files/tavda-mansi-1.0-standard.zip</subfield>
<subfield code="z">md5:78c59597ccddc1e3d9fd8ea0358d090b</subfield>
</datafield>
<controlfield tag="005">20250528095942.0</controlfield>
<datafield tag="540" ind1=" " ind2=" ">
<subfield code="u">https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode</subfield>
<subfield code="a">Creative Commons Attribution Non Commercial Share Alike 4.0 International</subfield>
</datafield>
<datafield tag="024" ind1=" " ind2=" ">
<subfield code="a">10.25592/uhhfdm.17513</subfield>
<subfield code="2">doi</subfield>
</datafield>
<datafield tag="024" ind1=" " ind2=" ">
<subfield code="a">11022/0000-0007-FE69-6</subfield>
<subfield code="2">handle</subfield>
<subfield code="q">alternateidentifier</subfield>
</datafield>
</record>