Results 291 to 300 of about 1,196,582 (339)
Some of the next articles are maybe not open access.

Features for printed document image analysis

Object recognition supported by user interaction for service robots, 2003
This paper presents features for text/non-text area separation in printed document images. First, it introduces entropic discrimination, i.e., a simple separation using only one feature. Then, a brief recall on existing texture and geometric discriminant parameters proposed in previous research (2001, 2002) is included.
Jean Duong, Hubert Emptoz, Myriam Côté
openaire   +1 more source

Image based typographic analysis of documents

Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93), 2002
An approach to image based typographic analysis of documents is provided. The problem requires a spatial understanding of the document layout as well as knowledge of the proper syntax. The system performs a page synthesis from the stream of formatting commands defined in a DVI file.
David S. Doermann, Richard Furuta
openaire   +1 more source

Handwritten document image segmentation and analysis

Pattern Recognition Letters, 1993
Abstract The paper proposes automatic segmentation of handwritten document binary images. The Hough transform (HT) over the source image is applied and then the slope of handwritten rows is estimated by analyzing the parameter plane. The inverse HT is used for ‘cutting’ the image and forming ‘strips’, containing the respective handwritten rows ...
Vladimir A. Shapiro   +2 more
openaire   +1 more source

XML Data Representation in Document Image Analysis

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007), 2007
This paper presents the XML-based formats ALTO, TEI, METS used for Digital Libraries and their interest for data representation in a Document Image Analysis and Recognition (DIAR) process. In the first part we briefly present these formats with focus on their adequacy for structural representation and modeling of DIAR data.
Belaid, Abdel   +2 more
openaire   +2 more sources

Treatment of Diagrams in Document Image Analysis

2000
Document image analysis is the study of converting documents from paper form to an electronic form that captures the information content of the document. Necessary processing includes recognition of document layout (to determine reading order, and to distinguish text from diagrams), recognition of text (called Optical Character Recognition, OCR), and ...
Dorothea Blostein   +2 more
openaire   +1 more source

Watershed Based Document Image Analysis

2010
Document image analysis is used to segment and classify regions of a document image into categories such as text, graphic and background. In this paper we first review existing document image analysis approaches and discuss their limits. Then we adapt the well-known watershed segmentation in order to obtain a very fast and efficient classification ...
Pasha Shadkami, Nicolas Bonnier
openaire   +1 more source

Document Image Analysis

2018
This chapter provides an overview of document image analysis (DIA). It aims to provide fundamental issues/techniques related to DIA, such textual processing and graphics processing. The chapter focusses on how research scientists, academicians and industrialists see the phrase DIA, and how have they approached since several years.
openaire   +2 more sources

Document image analysis for digital libraries

Proceedings of the 2006 international workshop on Research issues in digital libraries, 2006
Digital Libraries have many forms -- institutional libraries for information dissemination, document repositories for record-keeping, and personal digital libraries for organizing personal thoughts, knowledge, and course of action. Digital image content (scanned or otherwise) is a substantial component of all of these libraries.
openaire   +1 more source

Document images analysis solutions for digital libraries

First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings., 2004
Today the development of digital libraries is reaching technological limits due to the difficulty of automatically processing a growing mass of digitized images of documents from different origins. The main problem is the high cost of the digitization and retro-conversion processes which include image capture and indexation, metadata extraction, image ...
Le Bourgeois, Frank   +4 more
openaire   +2 more sources

Layout Analysis of Document Images

LatinX in AI at Computer Vision and Pattern Recognition Conference 2023, 2023
In order to extract information of interest from document images, their content must be recognized. To that end, layout analysis is an important step. In layout analysis one is concerned in finding page components such as text blocks, tables, formulas, diagrams, and determining their logical role.
openaire   +1 more source

Home - About - Disclaimer - Privacy