Results 291 to 300 of about 287,772 (335)
Some of the next articles are maybe not open access.

Sentiment-based text segmentation

2nd International Conference on Systems and Computer Science, 2013
In this paper, we present a text segmentation system based on the sentiments expressed in the text. The system takes as input plain text (product review for instance) and uses two different resources for tagging the sentiment words: a sentiment words dictionary and SentiWordNet.
Costin-Gabriel Chiru   +1 more
openaire   +1 more source

On automatic text segmentation

Proceedings of the 2014 ACM symposium on Document engineering, 2014
Automatic text segmentation, which is the task of breaking a text into topically-consistent segments, is a fundamental problem in Natural Language Processing, Document Classification and Information Retrieval. Text segmentation can significantly improve the performance of various text mining algorithms, by splitting heterogeneous documents into ...
Boris Dadachev   +2 more
openaire   +1 more source

Evaluating Text Segmentation

2013
This thesis investigates the evaluation of automatic and manual text segmentation. Text segmentation is the process of placing boundaries within text to create segments according to some task-dependent criterion. An example of text segmentation is topical segmentation, which aims to segment a text according to the subjective definition of what ...
openaire   +2 more sources

Text Segmentation

2012
This article discusses electronic text as essentially just a sequence of characters. Text needs to be segmented at least into linguistic units such as words, punctuation, numbers, alphanumerics, etc. This process is called tokenization. The article mentions that most natural language processing techniques require text to be segmented into sentences as ...
openaire   +1 more source

Text segmentation in Polish

5th International Conference on Intelligent Systems Design and Applications (ISDA'05), 2005
In the paper a great importance of text segmentation in natural language engineering and in artificial intelligence systems has been pointed out. It has been shown that in Polish all punctuation marks that end sentences have also other functions in sentences.
openaire   +1 more source

Text Segmentation for MRC Document Compression

IEEE Transactions on Image Processing, 2011
The mixed raster content (MRC) standard (ITU-T T.44) specifies a framework for document compression which can dramatically improve the compression/quality tradeoff as compared to traditional lossy image compression algorithms. The key to MRC compression is the separation of the document into foreground and background layers, represented as a binary ...
Eri, Haneda, Charles A, Bouman
openaire   +2 more sources

Boosting text segmentation via progressive classification

Knowledge and Information Systems, 2007
A novel approach for reconciling tuples stored as free text into an existing attribute schema is proposed. The basic idea is to subject the available text to progressive classification, i.e., a multi-stage classification scheme where, at each intermediate stage, a classifier is learnt that analyzes the textual fragments not reconciled at the end of the
Cesario Eugenio   +4 more
openaire   +2 more sources

Text line segmentation in Chinese handwritten text images

2011 17th Korea-Japan Joint Workshop on Frontiers of Computer Vision (FCV), 2011
In this paper, text line segmentation based on 2-D Tensor Voting is proposed, 2-D tensor voting is used to remove outliers and located center points of connected components. The saliency values and direction of normal vectors are represented information of tensors for segmentation. The text images obtained from HIT-MW dataset.
null Chengdong Zhang, null GueeSang Lee
openaire   +1 more source

Chinese Pinyin-Text Conversion on Segmented Text

2009
Most current research and applications on Pinyin to Chinese word conversion employs a hidden Markov model (HMMs) which in turn uses a character-based language model. The reason is because Chinese texts are written without word boundaries. However in some tasks that involve the Pinyin to Chinese conversion, such as Chinese text proofreading, the ...
Wei Liu, Louise Guthrie
openaire   +1 more source

Lexical segments in text

2001
Editors’ introduction Berber Sardinha’s paper deals with a problem, namely text segmentation, which connects at several points with those of the other contributors to this volume. Like Scott, Sinclair and Coulthard, Berber Sardinha is interested in understanding the computer’s understanding of text, or rather the computer’s failure to handle the ...
openaire   +1 more source

Home - About - Disclaimer - Privacy