The DLA-RMR dataset: Annotated subset of RMR notebooks for CVC development

Link:
Autor/in:
Verlag/Körperschaft:
Universität Hamburg
Erscheinungsjahr:
2025
Medientyp:
Datensatz
Schlagworte:
  • page detection
  • word detection
  • colour recognition
  • recognition of writing implement
  • visual navigation
  • computational visual cataloguing
Beschreibung:
  • What’s new in this version: Some annotations were missing one of the visual attributes (orientation or writing implement). All missing attributes have been added in this version.

    This dataset is structured into four components, each serving a distinct role in the development of a document analysis system. 

    1. Word-level annotations are provided in the file word_annotations_for_cropped_images.json. These annotations describe the images contained in the cropped_images folder. Each entry specifies the location of a word as a polygon, together with its orientation (horizontal, vertical, or tilted) and the type of writing implement used (ink or pencil). Additional metadata, such as bounding boxes and segmentation areas, is also included.

    2. Cropped images are stored in the cropped_images folder. This set comprises 50 images, each containing only the primary page extracted from the corresponding full notebook scans.

    3. Full images are located in the full_images folder. This collection also contains 50 items, representing the complete notebook scans in which the primary page appears alongside other material.

    4. Page-level annotations are contained in the page_annotations folder. These are provided in YOLO format, with a single class (page) defined in classes.txt. Each annotation file specifies the bounding box of the primary page within the corresponding image in the full_images folder.

    Examples illustrate the annotation structure. In the JSON file, a typical word annotation records polygon coordinates, the attribute "orientation": "horizontal", and "writing_tool": "pencil". In the YOLO annotations, a sample entry such as 0 0.499023 0.500776 0.777344 0.816912 denotes the normalised coordinates of the primary page bounding box.

    Acknowledgement:

    The research for this work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy - EXC 2176 ‘Understanding Written Artefacts: Material, Interaction and Transmission in Manuscript Cultures’, project no. 390893796. The research was conducted within the scope of the Centre for the Study of Manuscript Cultures (CSMC) at Universität Hamburg.

    The images are taken from notebook pages of Rainer Maria Rilke, from the Deutsche Literaturarchiv Marbach (DLA), A:Rilke-Archiv Gernsbach. 

    We thank Hui Xu for her support in annotating the images.

relatedIdentifier:
DOI 10.25592/uhhfdm.17809 DOI 10.25592/uhhfdm.17613 DOI 10.25592/uhhfdm.17615 DOI 10.25592/uhhfdm.17931
Lizenzen:
  • https://creativecommons.org/licenses/by/4.0/legalcode
  • info:eu-repo/semantics/openAccess
Quellsystem:
Forschungsdatenrepositorium der UHH

Interne Metadaten
Quelldatensatz
oai:fdr.uni-hamburg.de:18089