Experimenting with Training a Neural Network in Transkribus to Recognise Text in a Multilingual and Multi-Authored Manuscript Collection [PDF]
This work aims at developing an optimal strategy to automatically transcribe a large quantity of uncategorised, digitised archival documents when resources include handwritten text by multiple authors and in several languages.
Carlotta Capurro +2 more
doaj +3 more sources
Transcribing Foucault’s handwriting with Transkribus [PDF]
The Foucault Fiches de Lecture (FFL) project aims both to explore and to make available online a large set of Michel Foucault’s reading notes (organized citations, references and comments) held at the BnF since 2013.
Marie-Laure Massot +2 more
doaj +4 more sources
Transcribing Foucault’s handwriting with Transkribus Transcrire l'écriture de Foucault avec Transkribus [PDF]
The Foucault Fiches de Lecture (FFL) project aims both to explore and to make available online a large set of Michel Foucault’s reading notes (organized citations, references and comments) held at the BnF since 2013.
Marie-Laure Massot +2 more
doaj +2 more sources
Handwritten Text Recognition of Ukrainian Manuscripts in the 21st Century: Possibilities, Challenges, and the Future of the First Generic AI-based Model [PDF]
This article reports on developing and evaluating a generic Handwritten Text Recognition (HTR) model created for the automatic computer-assisted transcription of Ukrainian handwriting publicly available via the HTR platform Transkribus.
Aleksej Tikhonov, Achim Rabus
doaj +4 more sources
Collaborative Workflows for Handwritten Text Recognition in Under-Resourced Manuscript Collections [PDF]
This article addresses important questions that arise when trying to transcribe large and diverse historical manuscript collections, with a focus on under-resourced languages and scripts. Using a pilot study of challenging Tibetan manuscripts, we propose
Marieke Meelen, Rachael M. Griffiths
doaj +3 more sources
Assessing advanced handwritten text recognition engines for digitizing historical documents. [PDF]
This study provides critical insights and evaluates the performance of state-of-the-art Handwritten Text Recognition (HTR) engines—PyLaia, HTR + , IDA, TrOCR-f, and Transkribus’ proprietary Transformer-based “supermodel” Titan—to digitize historical ...
Romein CA +3 more
europepmc +5 more sources
Leveraging OCR and HTR cloud services towards data mobilisation of historical plant names. [PDF]
We present our solution to the problem of how to mobilise (that is, extract and enrich) digital data from the analogue, printed book version Sir Hans Sloane’s copy of John Ray’s Historia Plantarum, to create the first searchable facility of its kind to ...
Sadek J +6 more
europepmc +3 more sources
Printed Text Recognition for Lexical Lists in Chinese-International Phonetic Alphabet (IPA) Glossing
This study presents a dataset serving as a benchmark for the recognition of printed text in lexical lists using Chinese-IPA glossing. The paper provides an overview of the baseline model, transcription model, and PyLaia engines employed in the research ...
Shihua Li, Nathan Hill
doaj +1 more source
Abstract The past decade has seen tremendous growth and innovation in the use of digital resources, methods, and tools in the history of art and architecture. While digital art history is less developed than text‐based disciplines, the emergence of new digital standards for visual and spatial data, and advances in computer vision are poised to ...
Alexander Brey
wiley +1 more source
Text Recognition for Nepalese Manuscripts in Pracalit Script
This dataset is a model for handwritten text recognition (HTR) of Sanskrit and Newar Nepalese manuscripts in Pracalit script. This paper introduces the state of the field in Newar literature, Newar manuscripts, and HTR engines.
Alexander James O’Neill, Nathan Hill
doaj +1 more source

