Dataset Open Access
Hussein Mohammed;
Quang-Vinh Dang
<?xml version='1.0' encoding='utf-8'?> <resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-3" xsi:schemaLocation="http://datacite.org/schema/kernel-3 http://schema.datacite.org/meta/kernel-3/metadata.xsd"> <identifier identifierType="DOI">10.25592/uhhfdm.17932</identifier> <creators> <creator> <creatorName>Hussein Mohammed</creatorName> <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0001-5020-3592</nameIdentifier> <affiliation>Universität Hamburg</affiliation> </creator> <creator> <creatorName>Quang-Vinh Dang</creatorName> <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-6715-7112</nameIdentifier> <affiliation>Universität Hamburg</affiliation> </creator> </creators> <titles> <title>Annotated subset of RDR notebooks for CVC development</title> </titles> <publisher>Universität Hamburg</publisher> <publicationYear>2025</publicationYear> <subjects> <subject>page detection</subject> <subject>word detection</subject> <subject>colour recognition</subject> <subject>recognition of writing implement</subject> <subject>visual navigation</subject> <subject>computational visual cataloguing</subject> </subjects> <dates> <date dateType="Issued">2025-09-05</date> </dates> <language>en</language> <resourceType resourceTypeGeneral="Dataset"/> <alternateIdentifiers> <alternateIdentifier alternateIdentifierType="url">https://www.fdr.uni-hamburg.de/record/17932</alternateIdentifier> </alternateIdentifiers> <relatedIdentifiers> <relatedIdentifier relatedIdentifierType="DOI" relationType="IsSupplementedBy">10.25592/uhhfdm.17809</relatedIdentifier> <relatedIdentifier relatedIdentifierType="DOI" relationType="IsSupplementedBy">10.25592/uhhfdm.17613</relatedIdentifier> <relatedIdentifier relatedIdentifierType="DOI" relationType="IsSupplementedBy">10.25592/uhhfdm.17615</relatedIdentifier> <relatedIdentifier relatedIdentifierType="DOI" relationType="IsPartOf">10.25592/uhhfdm.17931</relatedIdentifier> </relatedIdentifiers> <version>1</version> <rightsList> <rights rightsURI="https://creativecommons.org/licenses/by/4.0/legalcode">Creative Commons Attribution 4.0 International</rights> <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights> </rightsList> <descriptions> <description descriptionType="Abstract"><p>This&nbsp;dataset is structured into four components, each serving a distinct role in the development of a&nbsp;document analysis system.</p> <ol> <li> <p><strong>Word-level annotations</strong> are provided in the file <code>word_annotations_for_cropped_images.json</code>. These annotations describe the images contained in the <code>cropped_images</code> folder. Each entry specifies the location of a word as a polygon, together with its orientation (horizontal, vertical, or tilted) and the type of writing implement used (ink or pencil). Additional metadata, such as bounding boxes and segmentation areas, is also included.</p> </li> <li> <p><strong>Cropped images</strong> are stored in the <code>cropped_images</code> folder. This set comprises 50 images, each containing only the primary page extracted from the corresponding full notebook scans.</p> </li> <li> <p><strong>Full images</strong> are located in the <code>full_images</code> folder. This collection also contains 50 items, representing the complete notebook scans in which the primary page appears alongside other material.</p> </li> <li> <p><strong>Page-level annotations</strong> are contained in the <code>page_annotations</code> folder. These are provided in YOLO format, with a single class (<code>page</code>) defined in <code>classes.txt</code>. Each annotation file specifies the bounding box of the primary page within the corresponding image in the <code>full_images</code> folder.</p> </li> </ol> <p>Examples illustrate the annotation structure. In the JSON file, a typical word annotation records polygon coordinates, the attribute <code>&quot;orientation&quot;: &quot;horizontal&quot;</code>, and <code>&quot;writing_tool&quot;: &quot;pencil&quot;</code>. In the YOLO annotations, a sample entry such as <code>0 0.499023 0.500776 0.777344 0.816912</code> denotes the normalised coordinates of the primary page bounding box.</p> <p><strong>Acknowledgement:</strong></p> <p>The research for this work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany&rsquo;s Excellence Strategy - EXC 2176 &lsquo;Understanding Written Artefacts: Material, Interaction and Transmission in Manuscript Cultures&rsquo;, project no. 390893796. The research was conducted within the scope of the Centre for the Study of Manuscript Cultures (CSMC) at Universit&auml;t Hamburg.</p> <p>We thank Hui Xu for her support in annotating the images.</p></description> </descriptions> </resource>