Dataset Open Access

INEL Tavda Mansi Corpus

Wagner-Nagy, Beáta; Sipőcz, Katalin


DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-3" xsi:schemaLocation="http://datacite.org/schema/kernel-3 http://schema.datacite.org/meta/kernel-3/metadata.xsd">
  <identifier identifierType="DOI">10.25592/uhhfdm.17513</identifier>
  <creators>
    <creator>
      <creatorName>Wagner-Nagy, Beáta</creatorName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-6801-1895</nameIdentifier>
      <affiliation>Universität Hamburg</affiliation>
    </creator>
    <creator>
      <creatorName>Sipőcz, Katalin</creatorName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0003-1146-6562</nameIdentifier>
      <affiliation>University of Szeged</affiliation>
    </creator>
  </creators>
  <titles>
    <title>INEL Tavda Mansi Corpus</title>
  </titles>
  <publisher>Universität Hamburg</publisher>
  <publicationYear>2025</publicationYear>
  <subjects>
    <subject>Uralic</subject>
    <subject>Mansi</subject>
    <subject>Tavda Mansi</subject>
    <subject>endangered language</subject>
    <subject>language contact</subject>
    <subject>language documentation</subject>
    <subject>legacy data</subject>
    <subject>INEL</subject>
    <subject>AdWHH</subject>
    <subject>text corpus</subject>
    <subject>parallel texts</subject>
    <subject>folklore</subject>
    <subject>tales</subject>
    <subject>narrative</subject>
    <subject>song</subject>
    <subject>transcription</subject>
    <subject>morphological glossing</subject>
    <subject>part-of-speech</subject>
    <subject>borrowings</subject>
    <subject>dialogue</subject>
    <subject>English translation</subject>
    <subject>Russian translation</subject>
    <subject>EXMARaLDA</subject>
    <subject>ELAN</subject>
    <subject>XML</subject>
    <subject>ISO/TEI</subject>
    <subject>German translation</subject>
    <subject>Hungarian translation</subject>
    <subject>existential predication</subject>
    <subject>locative predication</subject>
    <subject>possessive predication</subject>
    <subject>Ob-Ugric languages</subject>
    <subject>semantic role</subject>
    <subject>syntactic function</subject>
  </subjects>
  <contributors>
    <contributor contributorType="Editor">
      <contributorName>Wagner-Nagy, Beáta</contributorName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-6801-1895</nameIdentifier>
      <affiliation>Universität Hamburg</affiliation>
    </contributor>
    <contributor contributorType="Editor">
      <contributorName>Arkhipov, Alexandre</contributorName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0001-5395-0921</nameIdentifier>
      <affiliation>Universität Hamburg</affiliation>
    </contributor>
    <contributor contributorType="Editor">
      <contributorName>Brykina, Maria</contributorName>
      <affiliation>Universität Hamburg</affiliation>
    </contributor>
    <contributor contributorType="DataManager">
      <contributorName>Lazarenko, Elena</contributorName>
      <affiliation>Universität Hamburg</affiliation>
    </contributor>
    <contributor contributorType="DataManager">
      <contributorName>Riaposov, Aleksandr</contributorName>
      <affiliation>Universität Hamburg</affiliation>
    </contributor>
  </contributors>
  <dates>
    <date dateType="Issued">2025-05-15</date>
  </dates>
  <resourceType resourceTypeGeneral="Dataset"/>
  <alternateIdentifiers>
    <alternateIdentifier alternateIdentifierType="handle">11022/0000-0007-FE69-6</alternateIdentifier>
    <alternateIdentifier alternateIdentifierType="url">https://www.fdr.uni-hamburg.de/record/17513</alternateIdentifier>
  </alternateIdentifiers>
  <relatedIdentifiers>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsPartOf">10.25592/uhhfdm.17512</relatedIdentifier>
  </relatedIdentifiers>
  <version>1.0</version>
  <rightsList>
    <rights rightsURI="https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode">Creative Commons Attribution Non Commercial Share Alike 4.0 International</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
  </rightsList>
  <descriptions>
    <description descriptionType="Abstract">&lt;p&gt;&lt;strong&gt;Corpus Citation&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Sipőcz, Katalin &amp;amp; Wagner-Nagy, Be&amp;aacute;ta. 2025. INEL Tavda Mansi Corpus. Version 1.0. Publication date 2025-05-15. &lt;a href="https://hdl.handle.net/11022/0000-0007-FE69-6"&gt;https://hdl.handle.net/11022/0000-0007-FE69-6&lt;/a&gt;. Archived at Universit&amp;auml;t Hamburg. In: &lt;em&gt;The INEL corpora of indigenous Northern Eurasian languages. &lt;/em&gt;&lt;a href="https://hdl.handle.net/11022/0000-0007-F45A-1"&gt;https://hdl.handle.net/11022/0000-0007-F45A-1&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Corpus Description&lt;/strong&gt;&lt;br&gt;
The present corpus of Tavda Mansi has been created as part of the long-term research project INEL (&amp;ldquo;&lt;em&gt;Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages&amp;rdquo;)&amp;nbsp;&lt;/em&gt;in the context of the Academies&amp;rsquo; Programme, coordinated by the Union of the German Academies of Sciences and Humanities.&lt;/p&gt;

&lt;p&gt;The INEL Tavda Mansi corpus at hand fills a gap in the documentation of the indigenous languages of Northern Eurasia and makes possible further descriptions of the language. Mansi is a relatively good described language: there are numerous descriptions and a corpus is also available, &amp;nbsp;however, the Tavda variety is not included in the existing corpora.&lt;/p&gt;

&lt;p&gt;The analysis of materials from the Tavda variety has already been conducted by Norbert Szil&amp;aacute;gyi., but he did not produce a corpus that could be searched and evaluated electronically. However, he has made his materials available under the URL: &lt;a href="https://norbertszilagyi91.wixsite.com/tawdamansi"&gt;https://norbertszilagyi91.wixsite.com/tawdamansi&lt;/a&gt;. In the material published in the INEL corpus, the analyses differ significantly from Szil&amp;aacute;gyi&amp;#39;s analysis. For the sake of comparison, the texts analysed by Szil&amp;aacute;gyi are appended to the corpus, and the Hungarian translations he provided have been retained, but some places have been corrected.&lt;br&gt;
&lt;br&gt;
The INEL Tavda Mansi Corpus contains texts texts from different sources:&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;Kannisto, Artturi and Matti Liimola 1951: &lt;em&gt;Wogulische Volksdichtung &lt;/em&gt;gesammelt und &amp;uuml;bersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola&lt;em&gt; &lt;/em&gt;Volume I. &lt;em&gt;Texte mythischen Inhalts&lt;/em&gt;. [M&amp;eacute;moires de la Soci&amp;eacute;t&amp;eacute; Finno-Ougrienne 101]. Helsinki: Suomalais-Ugrilainen Seura.&lt;/li&gt;
	&lt;li&gt;Kannisto, Artturi and Matti Liimola 1955: &lt;em&gt;Wogulische Volksdichtung &lt;/em&gt;gesammelt und &amp;uuml;bersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola&lt;em&gt; &lt;/em&gt;Volume II&lt;em&gt;. Kriegs und Heldensagen&lt;/em&gt;. [M&amp;eacute;moires de la Soci&amp;eacute;t&amp;eacute; Finno-Ougrienne 109]. Helsinki: Suomalais-Ugrilainen Seura.&lt;/li&gt;
	&lt;li&gt;Kannisto, Artturi and Matti Liimola 1956: &lt;em&gt;Wogulische Volksdichtung&lt;/em&gt; gesammelt und &amp;uuml;bersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola&lt;em&gt; &lt;/em&gt;Volume III. &lt;em&gt;M&amp;auml;rchen&lt;/em&gt;. [M&amp;eacute;moires de la Soci&amp;eacute;t&amp;eacute; Finno-Ougrienne 111]. Helsinki: Suomalais-Ugrilainen Seura.&lt;/li&gt;
	&lt;li&gt;Kannisto, Artturi and Matti Liimola 1958: &lt;em&gt;Wogulische Volksdichtung &lt;/em&gt;gesammelt und &amp;uuml;bersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola&lt;em&gt; &lt;/em&gt;Volume IV&lt;em&gt;. B&amp;auml;renlieder&lt;/em&gt;. [M&amp;eacute;moires de la Soci&amp;eacute;t&amp;eacute; Finno-Ougrienne 114]. Helsinki: Suomalais-Ugrilainen Seura.&lt;/li&gt;
	&lt;li&gt;Kannisto, Artturi and Matti Liimola 1963: &lt;em&gt;Wogulische Volksdichtung &lt;/em&gt;gesammelt und &amp;uuml;bersetzt von Artturi Kannisto, bearbeitet und herausgegeben von Matti Liimola&lt;em&gt; &lt;/em&gt;Volume VI.&lt;em&gt; Schicksalslieder, Klagelieder, Kinderreime, R&amp;auml;tsel, Verschiedenes&lt;/em&gt;. [M&amp;eacute;moires de la Soci&amp;eacute;t&amp;eacute; Finno-Ougrienne 134]. Helsinki: Suomalais-Ugrilainen Seura.&lt;/li&gt;
	&lt;li&gt;Munk&amp;aacute;csi, Bern&amp;aacute;t 1896: &lt;em&gt;Vogul n&amp;eacute;pk&amp;ouml;lt&amp;eacute;si gyűjtem&amp;eacute;ny&lt;/em&gt; IV. &amp;Eacute;letk&amp;eacute;pek. Budapest: Magyar Tudom&amp;aacute;nyos Akad&amp;eacute;mia.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Corpus size&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The corpus currently contains &lt;strong&gt;29 &lt;/strong&gt;transcripts with &lt;strong&gt;2,042 &lt;/strong&gt;utterances and &lt;strong&gt;11,879 &lt;/strong&gt;tokens.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Funding&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies&amp;rsquo; Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The&lt;br&gt;
Academies&amp;rsquo; Programme is coordinated by the Union of the German Academies of Sciences and Humanities.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Searching the corpus&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The corpus can be downloaded from the ZFDM Repository using the links provided below and browsed or searched locally using the&amp;nbsp;&lt;a href="https://exmaralda.org/"&gt;EXMARaLDA&lt;/a&gt;&amp;nbsp;software or, alternatively,&amp;nbsp;&lt;a href="https://archive.mpi.nl/tla/elan"&gt;ELAN&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Online search with Tsakorpus platform is available at&amp;nbsp;&lt;a href="https://inel.corpora.uni-hamburg.de/TavdaMansiCorpus/search"&gt;https://inel.corpora.uni-hamburg.de/TavdaMansiCorpus/search&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Remote search with EXMARaLDA is also possible without downloading all the files (see&amp;nbsp;&lt;a href="https://inel.corpora.uni-hamburg.de/portal/help/en/index.php"&gt;https://inel.corpora.uni-hamburg.de/portal/help/en/index.php&lt;/a&gt;).&lt;/p&gt;

&lt;p&gt;See the user documentation (section 3) for details on transcription, annotation tiers and annotation tags. Find further information and links on the Mansi Corpus page at the INEL Resources portal:&amp;nbsp;&lt;a href="https://inel.corpora.uni-hamburg.de/portal/corpora/mansi/"&gt;https://inel.corpora.uni-hamburg.de/portal/corpora/mansi/&lt;/a&gt;.&lt;/p&gt;</description>
  </descriptions>
</resource>

Cite record as