Dataset Open Access
Hussein Mohammed;
Quang-Vinh Dang
{"@context":"https://schema.org/","@id":"http://doi.org/10.25592/uhhfdm.17932","@type":"Dataset","creator":[{"@id":"https://orcid.org/0000-0001-5020-3592","@type":"Person","affiliation":"Universit\u00e4t Hamburg","name":"Hussein Mohammed"},{"@id":"https://orcid.org/0000-0002-6715-7112","@type":"Person","affiliation":"Universit\u00e4t Hamburg","name":"Quang-Vinh Dang"}],"datePublished":"2025-09-05","description":"<p>This dataset is structured into four components, each serving a distinct role in the development of a document analysis system.</p>\n\n<ol>\n\t<li>\n\t<p><strong>Word-level annotations</strong> are provided in the file <code>word_annotations_for_cropped_images.json</code>. These annotations describe the images contained in the <code>cropped_images</code> folder. Each entry specifies the location of a word as a polygon, together with its orientation (horizontal, vertical, or tilted) and the type of writing implement used (ink or pencil). Additional metadata, such as bounding boxes and segmentation areas, is also included.</p>\n\t</li>\n\t<li>\n\t<p><strong>Cropped images</strong> are stored in the <code>cropped_images</code> folder. This set comprises 50 images, each containing only the primary page extracted from the corresponding full notebook scans.</p>\n\t</li>\n\t<li>\n\t<p><strong>Full images</strong> are located in the <code>full_images</code> folder. This collection also contains 50 items, representing the complete notebook scans in which the primary page appears alongside other material.</p>\n\t</li>\n\t<li>\n\t<p><strong>Page-level annotations</strong> are contained in the <code>page_annotations</code> folder. These are provided in YOLO format, with a single class (<code>page</code>) defined in <code>classes.txt</code>. Each annotation file specifies the bounding box of the primary page within the corresponding image in the <code>full_images</code> folder.</p>\n\t</li>\n</ol>\n\n<p>Examples illustrate the annotation structure. In the JSON file, a typical word annotation records polygon coordinates, the attribute <code>"orientation": "horizontal"</code>, and <code>"writing_tool": "pencil"</code>. In the YOLO annotations, a sample entry such as <code>0 0.499023 0.500776 0.777344 0.816912</code> denotes the normalised coordinates of the primary page bounding box.</p>\n\n<p><strong>Acknowledgement:</strong></p>\n\n<p>The research for this work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy - EXC 2176 ‘Understanding Written Artefacts: Material, Interaction and Transmission in Manuscript Cultures’, project no. 390893796. The research was conducted within the scope of the Centre for the Study of Manuscript Cultures (CSMC) at Universität Hamburg.</p>\n\n<p>We thank Hui Xu for her support in annotating the images.</p>","distribution":[{"@type":"DataDownload","contentUrl":"https://www.fdr.uni-hamburg.de/api/files/f19ccf9b-d18b-4d52-be91-d949824d9cfb/DLA_RMR_AnnotatedSubset.zip","encodingFormat":"zip"},{"@type":"DataDownload","contentUrl":"https://www.fdr.uni-hamburg.de/api/files/f19ccf9b-d18b-4d52-be91-d949824d9cfb/HS01309170_0053.jpg","encodingFormat":"jpg"}],"identifier":"http://doi.org/10.25592/uhhfdm.17932","inLanguage":{"@type":"Language","alternateName":"eng","name":"English"},"keywords":["page detection","word detection","colour recognition","recognition of writing implement","visual navigation","computational visual cataloguing"],"license":"https://creativecommons.org/licenses/by/4.0/legalcode","name":"Annotated subset of RDR notebooks for CVC development","url":"https://www.fdr.uni-hamburg.de/record/17932","version":"1"}