Results 21 to 30 of about 4,188,442 (307)

What If We Only Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer Labels [PDF]

open access: yesComputer Vision and Pattern Recognition, 2021
Scene text recognition (STR) task has a common practice: All state-of-the-art STR models are trained on large synthetic data. In contrast to this practice, training STR models only on fewer real labels (STR with fewer labels) is important when we have to
Jeonghun Baek, Yusuke Matsui, K. Aizawa
semanticscholar   +1 more source

Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition [PDF]

open access: yesEuropean Conference on Computer Vision, 2022
. Artistic text recognition is an extremely challenging task with a wide range of applications. However, current scene text recognition methods mainly focus on irregular text while have not explored artistic text specifically.
Xudong Xie   +4 more
semanticscholar   +1 more source

A Benchmark of Parsing Vietnamese Publications

open access: yesIEEE Access, 2022
In recent decades, digital transformation has received growing attention worldwide, that has leveraged the explosion of digitized document data. In this paper, we address the problem of parsing publications, in particular, Vietnamese publications.
Khang Nguyen   +5 more
doaj   +1 more source

Robust Sewer Defect Detection With Text Analysis Based on Deep Learning

open access: yesIEEE Access, 2022
Sewerage systems play a vital role in building modern cities, providing appropriate ways to release liquid wastes. Due to the rapid expansion of cities, the deterioration of sewage pipes are increasing.
Chanmi Oh   +3 more
doaj   +1 more source

Towards Accurate Scene Text Recognition With Semantic Reasoning Networks [PDF]

open access: yesComputer Vision and Pattern Recognition, 2020
Scene text image contains two levels of contents: visual texture and semantic information. Although the previous scene text recognition methods have made great progress over the past few years, the research on mining semantic information to assist text ...
Deli Yu   +5 more
semanticscholar   +1 more source

Toward a Low-Resource Non-Latin-Complete Baseline: An Exploration of Khmer Optical Character Recognition

open access: yesIEEE Access, 2023
Many existing text recognition methods rely on the structure of Latin characters and words. Such methods may not be able to deal with non-Latin scripts that have highly complex features, such as character stacking, diacritics, ligatures, non-uniform ...
Rina Buoy   +3 more
doaj   +1 more source

MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining [PDF]

open access: yesarXiv.org, 2022
Text images contain both visual and linguistic information. However, existing pre-training techniques for text recognition mainly focus on either visual representation learning or linguistic knowledge learning.
Pengyuan Lyu   +9 more
semanticscholar   +1 more source

Text Detection and Recognition for Images of Medical Laboratory Reports With a Deep Learning Approach

open access: yesIEEE Access, 2020
The adoption of electronic health records (EHRs) is an important step in the development of modern medicine. However, complete health records are not often available during treatment because of the functional problem of the EHR system or information ...
Wenyuan Xue, Qingyong Li, Qiyuan Xue
doaj   +1 more source

MTR-SAM: Visual Multimodal Text Recognition and Sentiment Analysis in Public Opinion Analysis on the Internet

open access: yesApplied Sciences, 2023
Existing methods for monitoring internet public opinion rely primarily on regular crawling of textual information on web pages but cannot quickly and accurately acquire and identify textual information in images and videos and discriminate sentiment. The
Xing Liu   +8 more
doaj   +1 more source

Primitive Representation Learning for Scene Text Recognition [PDF]

open access: yesComputer Vision and Pattern Recognition, 2021
Scene text recognition is a challenging task due to di-verse variations of text instances in natural scene images. Conventional methods based on CNN-RNN-CTC or encoder-decoder with attention mechanism may not fully investigate stable and efficient ...
Ruijie Yan   +3 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy