Dataset Open Access

Computational Visual Catalogue (CVC) - Rilke's notebooks - Minimal Example

Hussein Mohammed; Quang-Vinh Dang


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="o">oai:fdr.uni-hamburg.de:17808</subfield>
    <subfield code="p">user-csmc</subfield>
    <subfield code="p">user-uhh</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="a">10.25592/uhhfdm.17613</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="n">doi</subfield>
  </datafield>
  <controlfield tag="005">20250812081000.0</controlfield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2025-06-08</subfield>
  </datafield>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Computational visual catalogue</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Computational Visual Catalogue (CVC) - Rilke's notebooks - Minimal Example</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">54627</subfield>
    <subfield code="u">https://www.fdr.uni-hamburg.de/record/17808/files/Computational Visual Catalogue.png</subfield>
    <subfield code="z">md5:6d80f39e6e59025e535e6bf95ea4b834</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">62471104</subfield>
    <subfield code="u">https://www.fdr.uni-hamburg.de/record/17808/files/Testset_ScriptSight_v1.5.zip</subfield>
    <subfield code="z">md5:6794baf6cbf12891f09199d565bd400d</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;This small test set consists of 30 images and one JSON file. The images are a few notebook pages from Rainer Maria Rilke, from the Deutsche Literaturarchiv Marbach (DLA), A:Rilke-Archiv Gernsbach. The JSON file was computationally generated using several AI models and contains information automatically extracted from the images about various visual properties of text, such as word location, colour, orientation, and writing implement.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What&amp;rsquo;s new in this version:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;
	&lt;p&gt;The annotations in this JSON file are generated using our enhanced models for improved accuracy in word detection, colour recognition, writing implement recognition, and orientation classification.&lt;/p&gt;
	&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The structure of the JSON file is as follows:&lt;/p&gt;

&lt;p&gt;Root (object)&lt;br&gt;
├─ info (object)&lt;br&gt;
│ &amp;nbsp; ├─ description &amp;nbsp; : string&lt;br&gt;
│ &amp;nbsp; ├─ contributor &amp;nbsp; : string&lt;br&gt;
│ &amp;nbsp; ├─ version &amp;nbsp; &amp;nbsp; &amp;nbsp; : string&lt;br&gt;
│ &amp;nbsp; ├─ year &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;: integer&lt;br&gt;
│ &amp;nbsp; └─ date_created &amp;nbsp;: string &amp;nbsp; &amp;nbsp;# &amp;quot;YYYY-MM-DD&amp;quot;&lt;br&gt;
│&lt;br&gt;
├─ images (array of object)&lt;br&gt;
│ &amp;nbsp; └─ [image] (object)&lt;br&gt;
│ &amp;nbsp; &amp;nbsp; &amp;nbsp; ├─ id &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;: integer&lt;br&gt;
│ &amp;nbsp; &amp;nbsp; &amp;nbsp; ├─ file_name : string&lt;br&gt;
│ &amp;nbsp; &amp;nbsp; &amp;nbsp; ├─ width &amp;nbsp; &amp;nbsp; : integer&lt;br&gt;
│ &amp;nbsp; &amp;nbsp; &amp;nbsp; └─ height &amp;nbsp; &amp;nbsp;: integer&lt;br&gt;
│&lt;br&gt;
└─ annotations (array of object)&lt;br&gt;
&amp;nbsp; &amp;nbsp; └─ [annotation] (object)&lt;br&gt;
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ├─ id &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;: integer&lt;br&gt;
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ├─ image_id &amp;nbsp; &amp;nbsp; &amp;nbsp;: integer&lt;br&gt;
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ├─ category_id &amp;nbsp; : integer&lt;br&gt;
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ├─ bbox &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;: array of 4 numbers &amp;nbsp; &amp;nbsp; &amp;nbsp;# [x, y, width, height]&lt;br&gt;
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ├─ area &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;: number &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; # float&lt;br&gt;
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ├─ segmentation &amp;nbsp;: array of array of number &amp;nbsp;# [[x1, y1, x2, y2, &amp;hellip;]]&lt;br&gt;
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ├─ iscrowd &amp;nbsp; &amp;nbsp; &amp;nbsp; : integer &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;# 0 or 1&lt;br&gt;
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ├─ score &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; : number &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; # float&lt;br&gt;
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ├─ color_name &amp;nbsp; &amp;nbsp;: string&lt;br&gt;
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ├─ color_code &amp;nbsp; &amp;nbsp;: string &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; # e.g. &amp;quot;145-144-122&amp;quot;&lt;br&gt;
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ├─ orientation &amp;nbsp; : string &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; # e.g. &amp;quot;hor&amp;quot; or &amp;quot;ver&amp;quot;&lt;br&gt;
&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; └─ writing_tool &amp;nbsp;: string &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; # e.g. &amp;quot;pcl&amp;quot;&lt;br&gt;
&amp;nbsp;&lt;/p&gt;

&lt;p&gt;See ScriptSight tool for examples of how this computational visual catalogue can be used.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Acknowledgements:&amp;nbsp;&lt;/strong&gt;&lt;br&gt;
The research for this work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany&amp;rsquo;s Excellence Strategy - EXC 2176 &amp;lsquo;Understanding Written Artefacts: Material, Interaction and Transmission in Manuscript Cultures&amp;rsquo;, project no. 390893796. The research was conducted within the scope of the Centre for the Study of Manuscript Cultures (CSMC) at Universit&amp;auml;t Hamburg.&lt;/p&gt;

&lt;p&gt;The images are offered by the&amp;nbsp;Deutsche Literaturarchiv Marbach (DLA) as a part of their collaboration with the CSMC.&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-csmc</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-uhh</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Quang-Vinh Dang</subfield>
    <subfield code="u">Universität Hamburg</subfield>
    <subfield code="0">(orcid)0000-0002-6715-7112</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.25592/uhhfdm.17808</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">Hussein Mohammed</subfield>
    <subfield code="u">Universität Hamburg</subfield>
    <subfield code="0">(orcid)0000-0001-5020-3592</subfield>
  </datafield>
  <controlfield tag="001">17808</controlfield>
</record>

Cite record as