Results 81 to 90 of about 5,692 (199)

An Efficient Language-Independent Multi-Font OCR for Arabic Script

open access: yesComputer Science & Information Technology (CS & IT), 2020
Optical Character Recognition (OCR) is the process of extracting digitized text from images of scanned documents. While OCR systems have already matured in many languages, they still have shortcomings in cursive languages with overlapping letters such as the Arabic language. This paper proposes a complete Arabic OCR system that takes a scanned image of
Osman, Hussein   +3 more
openaire   +2 more sources

CNN-optimized text recognition with binary embeddings for Arabic expiry date recognition

open access: yesJournal of Electrical Systems and Information Technology
Recognizing Arabic dot-matrix digits is a challenging problem due to the unique characteristics of dot-matrix fonts, such as irregular dot spacing and varying dot sizes.
Mohamed Lotfy, Ghada Soliman
doaj   +1 more source

PsOCR: Benchmarking large multimodal models for optical character recognition in low-resource pashto language

open access: yesAin Shams Engineering Journal
This paper evaluates the performance of Large Multimodal Models (LMMs) on Optical Character Recognition (OCR) for the low-resource Pashto language. Pashto OCR is challenging due to its cursive Perso-Arabic script and the scarcity of large-scale annotated
Ijazul Haq   +2 more
doaj   +1 more source

Glyph Identification and Character Recognition for Sindhi OCR [PDF]

open access: yesMehran University Research Journal of Engineering and Technology, 2017
A computer can read and write multiple languages and today?s computers are capable of understanding various human languages. A computer can be given instructions through various input methods but OCR (Optical Character Recognition) and handwritten ...
NISAR AHMEDMEMON   +2 more
doaj  

The “Digital Maktaba LP”

open access: yesUmanistica Digitale
Optical Character Recognition (OCR) plays a vital role in digitising and enabling access to historical records in digital libraries. Yet, OCR technologies frequently face challenges when interpreting and categorising intricate document structures ...
Riccardo Amerigo Vigliermo   +3 more
doaj   +1 more source

An Efficient Thinning Algorithm for Arabic OCR Systems

open access: yesSignal & Image Processing : An International Journal, 2012
This paper address an efficient iterative thinning algorithm based on boundary pixels deletion using colour coding for different pixel types. A black pixel is tested by observing neighbouring pixels, and it gives us an efficient way to decide whether the pixel is deleted or not.
openaire   +1 more source

Turkish Optical Character Recognition Under the Lens: A Systematic Review of Language-Specific Challenges, Dataset Scarcity, and Open-Source Limitations

open access: yesIEEE Access
This systematic literature review explores the progress, challenges, and opportunities in the field of Optical Character Recognition (OCR) for the Turkish language.
Mirac Goksu Ozturk   +2 more
doaj   +1 more source

Generative vs. Discriminative Recognition Models for Off-Line Arabic Handwriting

open access: yesSensors, 2018
The majority of handwritten word recognition strategies are constructed on learning-based generative frameworks from letter or word training samples.
Moftah Elzobi, Ayoub Al-Hamadi
doaj   +1 more source

Acceleration of Urdu Optical Character Recognition on Zynq UltraScale+ MPSoC Using Deep Convolutional Neural Network

open access: yesIEEE Access
Deploying deep learning–based optical character recognition (OCR) systems for low-resource, complex-script languages like Urdu remains a major challenge due to high computational costs, lack of annotated datasets, and limited hardware support for ...
Fauzia Yasir, Majida Kazmi
doaj   +1 more source

Classification of Arabic Alphabets Using a Combination of a Convolutional Neural Network and the Morphological Gradient Method

open access: yesمجلة بغداد للعلوم
The field of Optical Character Recognition (OCR) is the process of converting an image of text into a machine-readable text format. The classification of Arabic manuscripts in general is part of this field.
Mouhssine EL ATILLAH   +2 more
doaj   +1 more source

Home - About - Disclaimer - Privacy