The Bullinger Dataset: A Writer Adaptation Challenge

Scius-Bertrand, Anna; Ströbel, Phillip; Volk, Martin; Hodel, Tobias; Fischer, Andreas (August 2023). The Bullinger Dataset: A Writer Adaptation Challenge. In: Fink, Gernot A.; Jain, Rajiv; Kise, Koichi; Zanibbi, Richard (eds.) ICDAR 2023. Lecture Notes in Computer Science: Vol. 14187 (pp. 397-410). Cham: Springer 10.1007/978-3-031-41676-7_23

[img] Text
978-3-031-41676-7_23.pdf - Published Version
Restricted to registered users only
Available under License Publisher holds Copyright.

Download (2MB) | Request a copy

One of the main challenges of automatically transcribing large collections of handwritten letters is to cope with the high variability of writing styles present in the collection. In particular, the writing styles of non-frequent writers, who have contributed only few letters, are often missing in the annotated learning samples used for training handwriting recognition systems. In this paper, we introduce the Bullinger dataset for writer adaptation, which is based on the Heinrich Bullinger letter collection from the 16th century, using a subset of 3,622 annotated letters (about 1.2 million words) from 306 writers. We provide baseline results for handwriting recognition with modern recognizers, before and after the application of standard techniques for supervised adaptation of frequent writers and self-supervised adaptation of non-frequent writers.

Item Type:

Conference or Workshop Item (Paper)

Division/Institute:

06 Faculty of Humanities > Other Institutions > Walter Benjamin Kolleg (WBKolleg) > Digital Humanities
06 Faculty of Humanities > Other Institutions > Walter Benjamin Kolleg (WBKolleg)

UniBE Contributor:

Ströbel, Phillip Benjamin, Hodel, Tobias Mathias

Subjects:

100 Philosophy
800 Literature, rhetoric & criticism
900 History

ISBN:

978-3-031-41676-7

Series:

Lecture Notes in Computer Science

Publisher:

Springer

Funders:

[159] Hasler Foundation

Language:

English

Submitter:

Tobias Mathias Hodel

Date Deposited:

12 Sep 2023 07:28

Last Modified:

12 Sep 2023 07:34

Publisher DOI:

10.1007/978-3-031-41676-7_23

BORIS DOI:

10.48350/185720

URI:

https://boris.unibe.ch/id/eprint/185720

Actions (login required)

Edit item Edit item
Provide Feedback