Results 1 to 10 of about 29,591 (137)

OCR-IDL: OCR Annotations for Industry Document Library Dataset [PDF]

open access: yesECCV Workshops, 2023
Pretraining has proven successful in Document Intelligence tasks where deluge of documents are used to pretrain the models only later to be finetuned on downstream tasks. One of the problems of the pretraining approaches is the inconsistent usage of pretraining data with different OCR engines leading to incomparable results between models.
Biten, Ali Furkan   +4 more
openaire   +3 more sources

Levenshtein OCR [PDF]

open access: yesEuropean Conference on Computer Vision, 2022
A novel scene text recognizer based on Vision-Language Transformer (VLT) is presented. Inspired by Levenshtein Transformer in the area of NLP, the proposed method (named Levenshtein OCR, and LevOCR for short) explores an alternative way for automatically transcribing textual content from cropped natural images.
Da, Cheng, Wang, Peng, Yao, Cong
openaire   +3 more sources

OCR17: Ground Truth and Models for 17th c. French Prints (and hopefully more) [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2023
Machine learning begins with machine teaching: in the following paper, we present the data that we have prepared to kick-start the training of reliable OCR models for 17th century prints written in French. The construction of a representative corpus is a
Simon Gabay   +2 more
doaj   +1 more source

Distinctive features of recognition for documents printed in the Romanian transitional alphabets [PDF]

open access: yesComputer Science Journal of Moldova, 2023
In this paper, we summarize the research of digitization of documents printed by Romanian transitional alphabet. These printings are the most original Romanian historical documents, which makes our experience useful when researching OCR methods for ...
Tudor Bumbu   +4 more
doaj   +1 more source

UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model [PDF]

open access: yesConference on Empirical Methods in Natural Language Processing, 2023
Text is ubiquitous in our visual world, conveying crucial information, such as in documents, websites, and everyday photographs. In this work, we propose UReader, a first exploration of universal OCR-free visually-situated language understanding based on
Jiabo Ye   +13 more
semanticscholar   +1 more source

Diversification of Legislation Editing Open Software (LEOS) Using Software Agents—Transforming Parliamentary Control of the Hellenic Parliament into Big Open Legal Data

open access: yesBig Data and Cognitive Computing, 2021
The accessibility and reuse of legal data is paramount for promoting transparency, accountability and, ultimately, trust towards governance institutions.
Sotiris Leventis   +2 more
doaj   +1 more source

PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System [PDF]

open access: yesarXiv.org, 2022
Optical character recognition (OCR) technology has been widely used in various scenarios, as shown in Figure 1. De-signing a practical OCR system is still a meaningful but chal- lenging task.
Chenxia Li   +11 more
semanticscholar   +1 more source

OCR-D - Koordinierte Förderinitiative zur Weiterentwicklung von OCR-Verfahren [PDF]

open access: yeso-bib. Das offene Bibliotheksjournal, 2017
Das Projekt OCR-D hat zum Ziel, das Verfahren der automatischen Texterkennung historischer Texte weiterzuentwickeln. Nach einer primären Phase der Bedarfsanalyse folgt 2018 die Modulprojektphase. Der vorliegende Artikel beschreibt in Kürze das in der ersten Projektphase erarbeitete Funktionsmodell von OCR-D und geht auf die Herausforderungen der ...
Herrmann, Elisa, Stäcker, Thomas
openaire   +4 more sources

DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents [PDF]

open access: yesIEEE International Conference on Document Analysis and Recognition, 2023
Information Extraction from visually rich documents is a challenging task that has gained a lot of attention in recent years due to its importance in several document-control based applications and its widespread commercial value.
M. Dhouib, G. Bettaieb, A. Shabou
semanticscholar   +1 more source

Towards a general open dataset and model for late medieval Castilian text recognition (HTR/OCR) [PDF]

open access: yesJournal of Data Mining and Digital Humanities, 2023
Submitted to the Journal of Data Mining and Digital Humanities, and accepted. Pending last revisions. Please cite: @article{gille_levenson_2023_towards, author = {Gille Levenson, Matthias}, date = {2023}, journaltitle = {Journal of Data Mining and ...
Matthias Gille Levenson
doaj   +1 more source

Home - About - Disclaimer - Privacy