Dataset Open Access
Wagner-Nagy, Beáta;
Sipőcz, Katalin
<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-3" xsi:schemaLocation="http://datacite.org/schema/kernel-3 http://schema.datacite.org/meta/kernel-3/metadata.xsd">
<identifier identifierType="DOI">10.25592/uhhfdm.17513</identifier>
<creators>
<creator>
<creatorName>Wagner-Nagy, Beáta</creatorName>
<nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-6801-1895</nameIdentifier>
<affiliation>Universität Hamburg</affiliation>
</creator>
<creator>
<creatorName>Sipőcz, Katalin</creatorName>
<nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0003-1146-6562</nameIdentifier>
<affiliation>University of Szeged</affiliation>
</creator>
</creators>
<titles>
<title>INEL Tavda Mansi Corpus</title>
</titles>
<publisher>Universität Hamburg</publisher>
<publicationYear>2025</publicationYear>
<subjects>
<subject>Uralic</subject>
<subject>Mansi</subject>
<subject>Tavda Mansi</subject>
<subject>endangered language</subject>
<subject>language contact</subject>
<subject>language documentation</subject>
<subject>legacy data</subject>
<subject>INEL</subject>
<subject>AdWHH</subject>
<subject>text corpus</subject>
<subject>parallel texts</subject>
<subject>folklore</subject>
<subject>tales</subject>
<subject>narrative</subject>
<subject>song</subject>
<subject>transcription</subject>
<subject>morphological glossing</subject>
<subject>part-of-speech</subject>
<subject>borrowings</subject>
<subject>dialogue</subject>
<subject>English translation</subject>
<subject>Russian translation</subject>
<subject>EXMARaLDA</subject>
<subject>ELAN</subject>
<subject>XML</subject>
<subject>ISO/TEI</subject>
<subject>German translation</subject>
<subject>Hungarian translation</subject>
<subject>existential predication</subject>
<subject>locative predication</subject>
<subject>possessive predication</subject>
<subject>Ob-Ugric languages</subject>
<subject>semantic role</subject>
<subject>syntactic function</subject>
</subjects>
<contributors>
<contributor contributorType="Editor">
<contributorName>Wagner-Nagy, Beáta</contributorName>
<nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-6801-1895</nameIdentifier>
<affiliation>Universität Hamburg</affiliation>
</contributor>
<contributor contributorType="Editor">
<contributorName>Arkhipov, Alexandre</contributorName>
<nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0001-5395-0921</nameIdentifier>
<affiliation>Universität Hamburg</affiliation>
</contributor>
<contributor contributorType="Editor">
<contributorName>Brykina, Maria</contributorName>
<affiliation>Universität Hamburg</affiliation>
</contributor>
<contributor contributorType="DataManager">
<contributorName>Lazarenko, Elena</contributorName>
<affiliation>Universität Hamburg</affiliation>
</contributor>
<contributor contributorType="DataManager">
<contributorName>Riaposov, Aleksandr</contributorName>
<affiliation>Universität Hamburg</affiliation>
</contributor>
</contributors>
<dates>
<date dateType="Issued">2025-05-15</date>
</dates>
<resourceType resourceTypeGeneral="Dataset"/>
<alternateIdentifiers>
<alternateIdentifier alternateIdentifierType="handle">11022/0000-0007-FE69-6</alternateIdentifier>
<alternateIdentifier alternateIdentifierType="url">https://www.fdr.uni-hamburg.de/record/17513</alternateIdentifier>
</alternateIdentifiers>
<relatedIdentifiers>
<relatedIdentifier relatedIdentifierType="DOI" relationType="IsPartOf">10.25592/uhhfdm.17512</relatedIdentifier>
</relatedIdentifiers>
<version>1.0</version>
<rightsList>
<rights rightsURI="https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode">Creative Commons Attribution Non Commercial Share Alike 4.0 International</rights>
<rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
</rightsList>
<descriptions>
<description descriptionType="Abstract"><p><strong>Corpus Citation</strong></p>
<p>Sipőcz, Katalin &amp; Wagner-Nagy, Be&aacute;ta. 2025. INEL Tavda Mansi Corpus. Version 1.0. Publication date 2025-05-15. <a href="https://hdl.handle.net/11022/0000-0007-FE69-6">https://hdl.handle.net/11022/0000-0007-FE69-6</a>. Archived at Universit&auml;t Hamburg. In: <em>The INEL corpora of indigenous Northern Eurasian languages. </em><a href="https://hdl.handle.net/11022/0000-0007-F45A-1">https://hdl.handle.net/11022/0000-0007-F45A-1</a></p>
<p><strong>Corpus Description</strong><br>
The present corpus of Tavda Mansi has been created as part of the long-term research project INEL (&ldquo;<em>Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages&rdquo;)&nbsp;</em>in the context of the Academies&rsquo; Programme, coordinated by the Union of the German Academies of Sciences and Humanities.</p>
<p>The INEL Tavda Mansi corpus at hand fills a gap in the documentation of the indigenous languages of Northern Eurasia and makes possible further descriptions of the language. Mansi is a relatively good described language: there are numerous descriptions and a corpus is also available, &nbsp;however, the Tavda variety is not included in the existing corpora.</p>
<p>The analysis of materials from the Tavda variety has already been conducted by Norbert Szil&aacute;gyi., but he did not produce a corpus that could be searched and evaluated electronically. However, he has made his materials available under the URL: <a href="https://norbertszilagyi91.wixsite.com/tawdamansi">https://norbertszilagyi91.wixsite.com/tawdamansi</a>. In the material published in the INEL corpus, the analyses differ significantly from Szil&aacute;gyi&#39;s analysis. For the sake of comparison, the texts analysed by Szil&aacute;gyi are appended to the corpus, and the Hungarian translations he provided have been retained, but some places have been corrected.<br>
<br>
The INEL Tavda Mansi Corpus contains texts texts from different sources:</p>
<ul>
<li>Kannisto, Artturi and Matti Liimola 1951: <em>Wogulische Volksdichtung </em>gesammelt und &uuml;bersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola<em> </em>Volume I. <em>Texte mythischen Inhalts</em>. [M&eacute;moires de la Soci&eacute;t&eacute; Finno-Ougrienne 101]. Helsinki: Suomalais-Ugrilainen Seura.</li>
<li>Kannisto, Artturi and Matti Liimola 1955: <em>Wogulische Volksdichtung </em>gesammelt und &uuml;bersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola<em> </em>Volume II<em>. Kriegs und Heldensagen</em>. [M&eacute;moires de la Soci&eacute;t&eacute; Finno-Ougrienne 109]. Helsinki: Suomalais-Ugrilainen Seura.</li>
<li>Kannisto, Artturi and Matti Liimola 1956: <em>Wogulische Volksdichtung</em> gesammelt und &uuml;bersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola<em> </em>Volume III. <em>M&auml;rchen</em>. [M&eacute;moires de la Soci&eacute;t&eacute; Finno-Ougrienne 111]. Helsinki: Suomalais-Ugrilainen Seura.</li>
<li>Kannisto, Artturi and Matti Liimola 1958: <em>Wogulische Volksdichtung </em>gesammelt und &uuml;bersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola<em> </em>Volume IV<em>. B&auml;renlieder</em>. [M&eacute;moires de la Soci&eacute;t&eacute; Finno-Ougrienne 114]. Helsinki: Suomalais-Ugrilainen Seura.</li>
<li>Kannisto, Artturi and Matti Liimola 1963: <em>Wogulische Volksdichtung </em>gesammelt und &uuml;bersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola<em> </em>Volume VI.<em> Schicksalslieder, Klagelieder, Kinderreime, R&auml;tsel, Verschiedenes</em>. [M&eacute;moires de la Soci&eacute;t&eacute; Finno-Ougrienne 134]. Helsinki: Suomalais-Ugrilainen Seura.</li>
<li>Munk&aacute;csi, Bern&aacute;t 1896: <em>Vogul n&eacute;pk&ouml;lt&eacute;si gyűjtem&eacute;ny</em> IV. &Eacute;letk&eacute;pek. Budapest: Magyar Tudom&aacute;nyos Akad&eacute;mia.</li>
</ul>
<p><strong>Corpus size</strong></p>
<p>The corpus currently contains <strong>29 </strong>transcripts with <strong>2,042 </strong>utterances and <strong>11,879 </strong>tokens.</p>
<p><strong>Funding</strong></p>
<p>The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies&rsquo; Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The<br>
Academies&rsquo; Programme is coordinated by the Union of the German Academies of Sciences and Humanities.</p>
<p><strong>Searching the corpus</strong></p>
<p>The corpus can be downloaded from the ZFDM Repository using the links provided below and browsed or searched locally using the&nbsp;<a href="https://exmaralda.org/">EXMARaLDA</a>&nbsp;software or, alternatively,&nbsp;<a href="https://archive.mpi.nl/tla/elan">ELAN</a>.</p>
<p>Online search with Tsakorpus platform is available at&nbsp;<a href="https://inel.corpora.uni-hamburg.de/TavdaMansiCorpus/search">https://inel.corpora.uni-hamburg.de/TavdaMansiCorpus/search</a>.</p>
<p>Remote search with EXMARaLDA is also possible without downloading all the files (see&nbsp;<a href="https://inel.corpora.uni-hamburg.de/portal/help/en/index.php">https://inel.corpora.uni-hamburg.de/portal/help/en/index.php</a>).</p>
<p>See the user documentation (section 3) for details on transcription, annotation tiers and annotation tags. Find further information and links on the Mansi Corpus page at the INEL Resources portal:&nbsp;<a href="https://inel.corpora.uni-hamburg.de/portal/corpora/mansi/">https://inel.corpora.uni-hamburg.de/portal/corpora/mansi/</a>.</p></description>
</descriptions>
</resource>