Results 291 to 300 of about 287,772 (335)
Some of the next articles are maybe not open access.
Sentiment-based text segmentation
2nd International Conference on Systems and Computer Science, 2013In this paper, we present a text segmentation system based on the sentiments expressed in the text. The system takes as input plain text (product review for instance) and uses two different resources for tagging the sentiment words: a sentiment words dictionary and SentiWordNet.
Costin-Gabriel Chiru +1 more
openaire +1 more source
On automatic text segmentation
Proceedings of the 2014 ACM symposium on Document engineering, 2014Automatic text segmentation, which is the task of breaking a text into topically-consistent segments, is a fundamental problem in Natural Language Processing, Document Classification and Information Retrieval. Text segmentation can significantly improve the performance of various text mining algorithms, by splitting heterogeneous documents into ...
Boris Dadachev +2 more
openaire +1 more source
2013
This thesis investigates the evaluation of automatic and manual text segmentation. Text segmentation is the process of placing boundaries within text to create segments according to some task-dependent criterion. An example of text segmentation is topical segmentation, which aims to segment a text according to the subjective definition of what ...
openaire +2 more sources
This thesis investigates the evaluation of automatic and manual text segmentation. Text segmentation is the process of placing boundaries within text to create segments according to some task-dependent criterion. An example of text segmentation is topical segmentation, which aims to segment a text according to the subjective definition of what ...
openaire +2 more sources
2012
This article discusses electronic text as essentially just a sequence of characters. Text needs to be segmented at least into linguistic units such as words, punctuation, numbers, alphanumerics, etc. This process is called tokenization. The article mentions that most natural language processing techniques require text to be segmented into sentences as ...
openaire +1 more source
This article discusses electronic text as essentially just a sequence of characters. Text needs to be segmented at least into linguistic units such as words, punctuation, numbers, alphanumerics, etc. This process is called tokenization. The article mentions that most natural language processing techniques require text to be segmented into sentences as ...
openaire +1 more source
5th International Conference on Intelligent Systems Design and Applications (ISDA'05), 2005
In the paper a great importance of text segmentation in natural language engineering and in artificial intelligence systems has been pointed out. It has been shown that in Polish all punctuation marks that end sentences have also other functions in sentences.
openaire +1 more source
In the paper a great importance of text segmentation in natural language engineering and in artificial intelligence systems has been pointed out. It has been shown that in Polish all punctuation marks that end sentences have also other functions in sentences.
openaire +1 more source
Text Segmentation for MRC Document Compression
IEEE Transactions on Image Processing, 2011The mixed raster content (MRC) standard (ITU-T T.44) specifies a framework for document compression which can dramatically improve the compression/quality tradeoff as compared to traditional lossy image compression algorithms. The key to MRC compression is the separation of the document into foreground and background layers, represented as a binary ...
Eri, Haneda, Charles A, Bouman
openaire +2 more sources
Boosting text segmentation via progressive classification
Knowledge and Information Systems, 2007A novel approach for reconciling tuples stored as free text into an existing attribute schema is proposed. The basic idea is to subject the available text to progressive classification, i.e., a multi-stage classification scheme where, at each intermediate stage, a classifier is learnt that analyzes the textual fragments not reconciled at the end of the
Cesario Eugenio +4 more
openaire +2 more sources
Text line segmentation in Chinese handwritten text images
2011 17th Korea-Japan Joint Workshop on Frontiers of Computer Vision (FCV), 2011In this paper, text line segmentation based on 2-D Tensor Voting is proposed, 2-D tensor voting is used to remove outliers and located center points of connected components. The saliency values and direction of normal vectors are represented information of tensors for segmentation. The text images obtained from HIT-MW dataset.
null Chengdong Zhang, null GueeSang Lee
openaire +1 more source
Chinese Pinyin-Text Conversion on Segmented Text
2009Most current research and applications on Pinyin to Chinese word conversion employs a hidden Markov model (HMMs) which in turn uses a character-based language model. The reason is because Chinese texts are written without word boundaries. However in some tasks that involve the Pinyin to Chinese conversion, such as Chinese text proofreading, the ...
Wei Liu, Louise Guthrie
openaire +1 more source
2001
Editors’ introduction Berber Sardinha’s paper deals with a problem, namely text segmentation, which connects at several points with those of the other contributors to this volume. Like Scott, Sinclair and Coulthard, Berber Sardinha is interested in understanding the computer’s understanding of text, or rather the computer’s failure to handle the ...
openaire +1 more source
Editors’ introduction Berber Sardinha’s paper deals with a problem, namely text segmentation, which connects at several points with those of the other contributors to this volume. Like Scott, Sinclair and Coulthard, Berber Sardinha is interested in understanding the computer’s understanding of text, or rather the computer’s failure to handle the ...
openaire +1 more source

