Dataset Open Access

B4 Tatian Corpus of Deviating Examples 2.1

Petrova, Svetlana

Data curator(s)
Petrova, Svetlana; Chun, Yen; Odebrecht, Carolin; Battefeld, Malte; Linde, Sonja; Donhauser, Karin; Solf, Michael; Kullick, Axel; Gehrlein, Anke

The present corpus, the Tatian Corpus of Deviating Examples T-CODEX 2.1, provides morpho-syntactic and information structural annotation of parts of the Old High German translation attested in the MS St. Gallen Cod. 56, traditionally called the OHG Tatian, one of the largest prose texts from the classical OHG period. This corpus was designed and annotated by Project B4 of Collaborative Research Center on Information Structure at Humboldt University Berlin. The present corpus compiles ca. 2.000 deviating examples found in the text portions of the scribes α, β, γ and ε. Each clause structure represents an extra file annotated with the annotation tool EXMARaLDA and searchable via ANNIS, a general-purpose tool for the publication, visualisation and querying of linguistic data collections, developed by Project D1 of the Collaborative Research Center on Information Structure at Potsdam University.

CLARIN Metadata summary for B4 Tatian Corpus of Deviating Examples 2.1 (CMDI-based)

Title: B4 Tatian Corpus of Deviating Examples 2.1
Description: The present corpus, the Tatian Corpus of Deviating Examples T-CODEX 2.1, provides morpho-syntactic and information structural annotation of parts of the Old High German translation attested in the MS St. Gallen Cod. 56, traditionally called the OHG Tatian, one of the largest prose texts from the classical OHG period. This corpus was designed and annotated by Project B4 of Collaborative Research Center on Information Structure at Humboldt University Berlin. The present corpus compiles ca. 2.000 deviating examples found in the text portions of the scribes α, β, γ and ε. Each clause structure represents an extra file annotated with the annotation tool EXMARaLDA and searchable via ANNIS, a general-purpose tool for the publication, visualisation and querying of linguistic data collections, developed by Project D1 of the Collaborative Research Center on Information Structure at Potsdam University.
Publication date: 2014-12-01
Data owner: Prof. Dr. Svetlana Petrova
Contributors: Svetlana Petrova (editor), Karin Donhauser (editor), Carolin Odebrecht (editor), Svetlana Petrova (annotator), Carolin Odebrecht (annotator), Michael Solf (annotator), Yen Chun Chen (annotator), Axel Kullick (annotator), Malte Battefeld (annotator), Sonja Linde (annotator), Anke Gehrlein (annotator)
Project: Special Research Centre 632 Information structure, German Research Foundation
Keywords: historical texts, religious texts, information structure
Languages: Latin (lat), Old High German (goh)
Size: 11295 Token
Segmentation units: other
Annotation types: aboutness (manual), tok (manual), LAT (manual), align (manual), pos (manual), cat (manual), clause-status (manual), gf (manual), syl_no (manual), givenness (manual), top-comm (manual), position (manual), topic-marker (manual), definiteness (manual), foc-bg (manual), foc-marker (manual), context (manual), comment (manual), bibl (manual), meta::writer (manual), meta::corpus-code (manual), meta::page (manual), X::abbreviation (manual), X::sex (manual)
Temporal Coverage: 830-01-01/830-12-31
Spatial Coverage: Fulda, DE
Genre: religious text
Modality: written

Files (30.5 MB)
Name Size
b4.tatian.cmdi
md5:30b844458fbe17a721e5e8c64d0cab81
75.3 kB Download
b4.tatian.xml
md5:0da9429f4f63e9ed7d9aa786ceac5cab
6.7 kB Download
b4.tatian.zip
md5:c545664fc1b8bc22664ee72ab57dfe80
30.4 MB Download

Cite record as