Results 11 to 20 of about 1,196,582 (339)

Automated analysis of images in documents for intelligent document search [PDF]

open access: yesInternational Journal on Document Analysis and Recognition (IJDAR), 2009
Authors use images to present a wide variety of important information in documents. For example, two-dimensional (2-D) plots display important data in scientific publications. Often, end-users seek to extract this data and convert it into a machine-processible form so that the data can be analyzed automatically or compared with other existing data ...
Xiaonan Lu   +5 more
openaire   +1 more source

PHTI: Pashto Handwritten Text Imagebase for Deep Learning Applications

open access: yesIEEE Access, 2022
Document Image Analysis (DIA) is one of the research areas of Artificial Intelligence (AI) that converts document images into machine-readable codes. In DIA systems, Optical Character Recognition (OCR) plays a key role in digitizing document images.
Ibrar Hussain   +5 more
doaj   +1 more source

FADIT: Fast Document Image Thresholding

open access: yesAlgorithms, 2020
We propose a fast document image thresholding method (FADIT) and evaluations of the two classic methods for demonstrating the effectiveness of FADIT. We put forward two assumptions: (1) the probability of the occurrence of grayscale text and background ...
Yufang Min, Yaonan Zhang
doaj   +1 more source

Towards Assisting the Visually Impaired: A Review on Techniques for Decoding the Visual Data From Chart Images

open access: yesIEEE Access, 2021
The textual data of a document is supplemented by the graphical information in it. To make communication easier, they contain tables, charts and images. However, it excludes a section of our population - the visually impaired.
K. C. Shahira, A. Lijiya
doaj   +1 more source

Toward Semi-Supervised Graphical Object Detection in Document Images

open access: yesFuture Internet, 2022
The graphical page object detection classifies and localizes objects such as Tables and Figures in a document. As deep learning techniques for object detection become increasingly successful, many supervised deep neural network-based methods have been ...
Goutham Kallempudi   +5 more
doaj   +1 more source

The PAGE (Page Analysis and Ground-Truth Elements) format framework [PDF]

open access: yes, 2010
There is a plethora of established and proposed document representation formats but none that can adequately support individual stages within an entire sequence of document image analysis methods (from document image enhancement to layout analysis to OCR)
Antonacopoulos, A, Pletschacher, S
core   +2 more sources

Unsupervised Exemplar-Based Learning for Improved Document Image Classification

open access: yesIEEE Access, 2019
Many recent state-of-the-art approaches for document image classification are based on supervised feature learning that requires a large amount of labeled training data.
Sherif Abuelwafa   +2 more
doaj   +1 more source

A new Approach for Detection and ExtractionTables in Scanned Document Image using Improved Hough Transform [PDF]

open access: yesEngineering and Technology Journal, 2016
In this paper, an improvement approach of Hough transform for tables detection and extraction from scanned document images is achieved as one of the main stages in document recognition to recognize between original and faked documents.
Hasanen S. Abdullah, Ammar H. Jasim
doaj   +1 more source

Investigating Attention Mechanism for Page Object Detection in Document Images

open access: yesApplied Sciences, 2022
Page object detection in scanned document images is a complex task due to varying document layouts and diverse page objects. In the past, traditional methods such as Optical Character Recognition (OCR)-based techniques have been employed to extract ...
Shivam Naik   +5 more
doaj   +1 more source

Supervised cross-modal factor analysis for multiple modal data classification [PDF]

open access: yes, 2015
In this paper we study the problem of learning from multiple modal data for purpose of document classification. In this problem, each document is composed two different modals of data, i.e., an image and a text. Cross-modal factor analysis (CFA) has been
Bensmail, Halima   +4 more
core   +3 more sources

Home - About - Disclaimer - Privacy