Results 221 to 230 of about 54,658 (262)
Some of the next articles are maybe not open access.
Contextual Entropy and Text Categorization
2006 Fourth Latin American Web Congress, 2006In this paper we describe a new approach to text categorization, our focus is in the amount of information (the entropy) in the text. The entropy is computed with the empirical distribution of words in the text. We provide the system with a manually segmented collection of documents in different categories.
Moises Garcia +2 more
openaire +1 more source
Performing Text Categorization on Manifold
2006 IEEE International Conference on Systems, Man and Cybernetics, 2006Text categorization has become the key technology in organizing and processing the large amount of text information. It normally involves an extremely high dimensional space, which makes most existing approaches generate highly biased estimates so as to reduce the classification accuracy.
Guihua Wen, Gan Chen, Lijun Jiang
openaire +1 more source
Summary evaluation and text categorization
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, 2003In general terms the evaluation of a summary depends on how close it is to the chief points in the source text. This begets the question as to what are the chief points in the source text and how is this information used in itself in identifying the source text. This is crucially important when we discuss automatic evaluation of summaries.
Khurshid Ahmad 0001 +2 more
openaire +1 more source
Experiments with a hierarchical text categorizer
2004 IEEE International Conference on Fuzzy Systems (IEEE Cat. No.04CH37542), 2005HITEC is a hierarchical text categorizer tool that is based on UFEX (universal feature extractor) algorithm. This paper presents experiments on the effectiveness of HITEC on several natural languages (English, German) and with various kinds of text corpora.
Domonkos Tikk +2 more
openaire +1 more source
On-line handwritten text categorization
SPIE Proceedings, 2009As new innovative devices, accepting or producing on-line documents, emerge, managing facilities for these kinds of documents such as topic spotting are required. This means that we should be able to perform text categorization of on-line documents. The textual data available in on-line documents can be extracted through online recognition, a process ...
Peña Saldarriaga, Sebastián +2 more
openaire +1 more source
Feature annotation for text categorization
Proceedings of the CUBE International Information Technology Conference, 2012In text categorization, feature extraction is one of the major strategies that aim at making text classifiers more efficient and accurate. Selecting quickly a suitable strategy for feature extraction out of many strategies proposed by previous studies is difficult. In this paper, we propose an efficient entity extraction approach for feature extraction
Yashodhara V. Haribhakta +2 more
openaire +1 more source
Power Law for Text Categorization
2013Text categorization (TC) is a challenging issue, and the corresponding algorithms can be used in many applications. This paper addresses the online multi-category TC problem abstracted from the applications of online binary TC and batch multi-category TC. Most applications are concerned about the space-time performance of TC algorithms.
Wuying Liu, Lin Wang 0020, Mianzhu Yi
openaire +1 more source
Text Categorization for Vietnamese Documents
2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, 2009Many machine learning methods have been proposed for text categorization, but most research has applied them to English documents. Vietnamese is a different language with different features and it is not clear whether the standard methods will work on the categorization of Vietnamese documents.
Giang-Son Nguyen +2 more
openaire +1 more source
Associative Classification in Text Categorization
2005Text categorization has become one of the key techniques for handling and organizing text data. This model is used to classify new article to its most relevant category. In this paper, we propose a novel associative classification algorithm ACTC for text categorization.
Jian Chen 0011 +3 more
openaire +1 more source
UNCERTAINTY AND TERM SELECTION IN TEXT CATEGORIZATION
International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 2003This paper discusses the notion of Uncertainty, which has a prominent place in the theory and experimental practice of modern Physics. It argues that the awareness of Uncertainty may also be of tremendous importance to the field of Information Retrieval, and in particular Text Categorization. As an application of Uncertainty in Text Categorization, a
Charles M. E. E. Peters +1 more
openaire +2 more sources

