Pre-Modern Data: Applying Language Modeling and Named Entity Recognition on Criminal Records in the City of Bern

Hodel, Tobias; Prada Ziegler, Ismail; Schneider, Christa (2023). Pre-Modern Data: Applying Language Modeling and Named Entity Recognition on Criminal Records in the City of Bern. In: Digital Humanities 2023. Collaboration as Opportunity (DH2023). Graz. 13.07.2023. 10.5281/zenodo.8107616

[img]
Preview
Text
HODEL_Tobias_Pre_Modern_Data__Applying_Language_Modeling_and.pdf - Published Version
Available under License Creative Commons: Attribution (CC-BY).

Download (40kB) | Preview

How can NLP technologies be applied and measured on pre-modern documents? Based on a large handwritten dataset from the tower of Bern, we tested available language models and taggers, showcasing that specific forms of representation and identification need to be found. Relying on cooperation to further improve information retrieval.

Item Type:

Conference or Workshop Item (Paper)

Division/Institute:

06 Faculty of Humanities > Other Institutions > Walter Benjamin Kolleg (WBKolleg) > Digital Humanities
06 Faculty of Humanities > Other Institutions > Walter Benjamin Kolleg (WBKolleg)

UniBE Contributor:

Hodel, Tobias Mathias, Prada Ziegler, Ismail Muhammad, Schneider, Christa

Subjects:

100 Philosophy
800 Literature, rhetoric & criticism
900 History

Language:

German

Submitter:

Tobias Mathias Hodel

Date Deposited:

07 Jul 2023 08:44

Last Modified:

26 Jul 2023 08:57

Publisher DOI:

10.5281/zenodo.8107616

Uncontrolled Keywords:

Named Entity Recognition, Language Modeling, Pre-Modern German

BORIS DOI:

10.48350/184547

URI:

https://boris.unibe.ch/id/eprint/184547

Actions (login required)

Edit item Edit item
Provide Feedback