Results 1 to 10 of about 29,591 (137)
OCR-IDL: OCR Annotations for Industry Document Library Dataset [PDF]
Pretraining has proven successful in Document Intelligence tasks where deluge of documents are used to pretrain the models only later to be finetuned on downstream tasks. One of the problems of the pretraining approaches is the inconsistent usage of pretraining data with different OCR engines leading to incomparable results between models.
Biten, Ali Furkan +4 more
openaire +3 more sources
A novel scene text recognizer based on Vision-Language Transformer (VLT) is presented. Inspired by Levenshtein Transformer in the area of NLP, the proposed method (named Levenshtein OCR, and LevOCR for short) explores an alternative way for automatically transcribing textual content from cropped natural images.
Da, Cheng, Wang, Peng, Yao, Cong
openaire +3 more sources
OCR17: Ground Truth and Models for 17th c. French Prints (and hopefully more) [PDF]
Machine learning begins with machine teaching: in the following paper, we present the data that we have prepared to kick-start the training of reliable OCR models for 17th century prints written in French. The construction of a representative corpus is a
Simon Gabay +2 more
doaj +1 more source
Distinctive features of recognition for documents printed in the Romanian transitional alphabets [PDF]
In this paper, we summarize the research of digitization of documents printed by Romanian transitional alphabet. These printings are the most original Romanian historical documents, which makes our experience useful when researching OCR methods for ...
Tudor Bumbu +4 more
doaj +1 more source
UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model [PDF]
Text is ubiquitous in our visual world, conveying crucial information, such as in documents, websites, and everyday photographs. In this work, we propose UReader, a first exploration of universal OCR-free visually-situated language understanding based on
Jiabo Ye +13 more
semanticscholar +1 more source
The accessibility and reuse of legal data is paramount for promoting transparency, accountability and, ultimately, trust towards governance institutions.
Sotiris Leventis +2 more
doaj +1 more source
PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System [PDF]
Optical character recognition (OCR) technology has been widely used in various scenarios, as shown in Figure 1. De-signing a practical OCR system is still a meaningful but chal- lenging task.
Chenxia Li +11 more
semanticscholar +1 more source
OCR-D - Koordinierte Förderinitiative zur Weiterentwicklung von OCR-Verfahren [PDF]
Das Projekt OCR-D hat zum Ziel, das Verfahren der automatischen Texterkennung historischer Texte weiterzuentwickeln. Nach einer primären Phase der Bedarfsanalyse folgt 2018 die Modulprojektphase. Der vorliegende Artikel beschreibt in Kürze das in der ersten Projektphase erarbeitete Funktionsmodell von OCR-D und geht auf die Herausforderungen der ...
Herrmann, Elisa, Stäcker, Thomas
openaire +4 more sources
DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents [PDF]
Information Extraction from visually rich documents is a challenging task that has gained a lot of attention in recent years due to its importance in several document-control based applications and its widespread commercial value.
M. Dhouib, G. Bettaieb, A. Shabou
semanticscholar +1 more source
Towards a general open dataset and model for late medieval Castilian text recognition (HTR/OCR) [PDF]
Submitted to the Journal of Data Mining and Digital Humanities, and accepted. Pending last revisions. Please cite: @article{gille_levenson_2023_towards, author = {Gille Levenson, Matthias}, date = {2023}, journaltitle = {Journal of Data Mining and ...
Matthias Gille Levenson
doaj +1 more source

