Results 221 to 230 of about 54,658 (262)
Some of the next articles are maybe not open access.

Contextual Entropy and Text Categorization

2006 Fourth Latin American Web Congress, 2006
In this paper we describe a new approach to text categorization, our focus is in the amount of information (the entropy) in the text. The entropy is computed with the empirical distribution of words in the text. We provide the system with a manually segmented collection of documents in different categories.
Moises Garcia   +2 more
openaire   +1 more source

Performing Text Categorization on Manifold

2006 IEEE International Conference on Systems, Man and Cybernetics, 2006
Text categorization has become the key technology in organizing and processing the large amount of text information. It normally involves an extremely high dimensional space, which makes most existing approaches generate highly biased estimates so as to reduce the classification accuracy.
Guihua Wen, Gan Chen, Lijun Jiang
openaire   +1 more source

Summary evaluation and text categorization

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, 2003
In general terms the evaluation of a summary depends on how close it is to the chief points in the source text. This begets the question as to what are the chief points in the source text and how is this information used in itself in identifying the source text. This is crucially important when we discuss automatic evaluation of summaries.
Khurshid Ahmad 0001   +2 more
openaire   +1 more source

Experiments with a hierarchical text categorizer

2004 IEEE International Conference on Fuzzy Systems (IEEE Cat. No.04CH37542), 2005
HITEC is a hierarchical text categorizer tool that is based on UFEX (universal feature extractor) algorithm. This paper presents experiments on the effectiveness of HITEC on several natural languages (English, German) and with various kinds of text corpora.
Domonkos Tikk   +2 more
openaire   +1 more source

On-line handwritten text categorization

SPIE Proceedings, 2009
As new innovative devices, accepting or producing on-line documents, emerge, managing facilities for these kinds of documents such as topic spotting are required. This means that we should be able to perform text categorization of on-line documents. The textual data available in on-line documents can be extracted through online recognition, a process ...
Peña Saldarriaga, Sebastián   +2 more
openaire   +1 more source

Feature annotation for text categorization

Proceedings of the CUBE International Information Technology Conference, 2012
In text categorization, feature extraction is one of the major strategies that aim at making text classifiers more efficient and accurate. Selecting quickly a suitable strategy for feature extraction out of many strategies proposed by previous studies is difficult. In this paper, we propose an efficient entity extraction approach for feature extraction
Yashodhara V. Haribhakta   +2 more
openaire   +1 more source

Power Law for Text Categorization

2013
Text categorization (TC) is a challenging issue, and the corresponding algorithms can be used in many applications. This paper addresses the online multi-category TC problem abstracted from the applications of online binary TC and batch multi-category TC. Most applications are concerned about the space-time performance of TC algorithms.
Wuying Liu, Lin Wang 0020, Mianzhu Yi
openaire   +1 more source

Text Categorization for Vietnamese Documents

2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, 2009
Many machine learning methods have been proposed for text categorization, but most research has applied them to English documents. Vietnamese is a different language with different features and it is not clear whether the standard methods will work on the categorization of Vietnamese documents.
Giang-Son Nguyen   +2 more
openaire   +1 more source

Associative Classification in Text Categorization

2005
Text categorization has become one of the key techniques for handling and organizing text data. This model is used to classify new article to its most relevant category. In this paper, we propose a novel associative classification algorithm ACTC for text categorization.
Jian Chen 0011   +3 more
openaire   +1 more source

UNCERTAINTY AND TERM SELECTION IN TEXT CATEGORIZATION

International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 2003
This paper discusses the notion of Uncertainty, which has a prominent place in the theory and experimental practice of modern Physics. It argues that the awareness of Uncertainty may also be of tremendous importance to the field of Information Retrieval, and in particular Text Categorization. As an application of Uncertainty in Text Categorization, a
Charles M. E. E. Peters   +1 more
openaire   +2 more sources

Home - About - Disclaimer - Privacy