Results 41 to 50 of about 2,167,785 (318)
Multiscale fully convolutional network‐based approach for multilingual character segmentation
Character segmentation is a challenging task for optical character recognition systems. Traditional methods usually utilize rule‐based algorithms but most of them are not applicable in modern intelligent recognition applications that require high ...
Chao Yu, Jin Liu, Yunhui Li
doaj +1 more source
Optical character recognition without segmentation [PDF]
A segmentation-free approach for off-line optical character recognition is presented. The proposed method performs the recognition by extracting the characters from the whole word, avoiding the segmentation process. A control point set which includes position and attribute vectors is selected for the features.
Mehmet Ali Özdil +2 more
openaire +1 more source
Neural Word Segmentation with Rich Pretraining
Neural word segmentation research has benefited from large-scale raw texts by leveraging them for pretraining character and word embeddings. On the other hand, statistical segmentation research has exploited richer sources of external information, such ...
Dong, Fei, Yang, Jie, Zhang, Yue
core +1 more source
Segmentation of Devanagari Handwritten Characters
The world is fast moving towards digitalization. In the age of super-fast computational capabilities, everything has to be made digitalized so as to make the computer understand and thereby process the given information. Optical character recognition is a method by which the computer is made to learn, understand and interpret the languages used and ...
Neha Sahu, Ankita Srivastav
openaire +1 more source
Radical-Enhanced Chinese Character Embedding
We present a method to leverage radical for learning Chinese character embedding. Radical is a semantic and phonetic component of Chinese character. It plays an important role as characters with the same radical usually have similar semantic meaning and ...
Ji, Zhenzhou +5 more
core +1 more source
Segmentation and Recognition for Historical Tibetan Document Images
As a shining pearl in traditional Tibetan culture, historical Tibetan documents have received extensive attention from historians, linguists and Buddhist scholars.
Longlong Ma +5 more
doaj +1 more source
Domain generation algorithms (DGAs) play an important role in network attacks and can be mainly divided into two types: dictionary-based and character-based.
Shaojie Chen +3 more
doaj +1 more source
ANN-based Innovative Segmentation Method for Handwritten text in Assamese [PDF]
Artificial Neural Network (ANN) s has widely been used for recognition of optically scanned character, which partially emulates human thinking in the domain of the Artificial Intelligence.
Bhattacharyya, Kaustubh +1 more
core +1 more source
Dual Long Short-Term Memory Networks for Sub-Character Representation Learning
Characters have commonly been regarded as the minimal processing unit in Natural Language Processing (NLP). But many non-latin languages have hieroglyphic writing systems, involving a big alphabet with thousands or millions of characters.
Feng, Yi +6 more
core +1 more source
Reading Scene Text in Deep Convolutional Sequences [PDF]
We develop a Deep-Text Recurrent Network (DTRN) that regards scene text reading as a sequence labelling problem. We leverage recent advances of deep convolutional neural networks to generate an ordered high-level sequence from a whole word image ...
He, Pan +4 more
core +1 more source

