Results 11 to 20 of about 346,822 (289)
A Superior Arabic Text Categorization Deep Model (SATCDM)
Categorizing Arabic text documents is considered an important research topic in the field of Natural Language Processing (NLP) and Machine Learning (ML).
M. Alhawarat, Ahmad O. Aseeri
doaj +1 more source
Noisy text categorization [PDF]
This work presents categorization experiments performed over noisy texts. By noisy, we mean any text obtained through an extraction process (affected by errors) from media other than digital texts (e.g., transcriptions of speech recordings extracted with a recognition system).
openaire +2 more sources
Using bag-of-concepts to improve the performance of support vector machines in text categorization [PDF]
This paper investigates the use of concept-based representations for text categorization. We introduce a new approach to create concept-based text representations, and apply it to a standard text categorization collection. The representations are used as
Cöster, Rickard, Sahlgren, Magnus
core +3 more sources
Categorization and Conceptualization of Space in Descriptive Text
The relevance of the article is due to the importance of studying spatial semantics in the new scientific paradigm. The possibility of studying genre varieties of description (description-landscape, description-interior, description-portrait, description
Y. N. Varfolomeeva
doaj +1 more source
An efficient approach for textual data classification using deep learning
Text categorization is an effective activity that can be accomplished using a variety of classification algorithms. In machine learning, the classifier is built by learning the features of categories from a set of preset training data.
Abdullah Alqahtani +6 more
doaj +1 more source
Machine Learning in Automated Text Categorization [PDF]
The automated categorization (or classification) of texts into predefined categories has witnessed a booming interest in the last ten years, due to the increased availability of documents in digital form and the ensuing need to organize them.
ANDROUTSOPOULOS I. +106 more
core +4 more sources
Cross-Lingual Text Categorization [PDF]
This article deals with the problem of Cross-Lingual Text Categorization (CLTC), which arises when documents in different languages must be classified according to the same classification tree. We describe practical and cost-effective solutions for automatic Cross-Lingual Text Categorization, both in case a sufficient number of training examples is ...
Bel Rafecas, Núria +2 more
openaire +2 more sources
Parallel noise eliminate: A parallel noise elimination algorithm for massive text categorization
Noise data in text are one of the main factors affecting the quality of text categorization. A parallel noise data elimination algorithm based on principal component analysis method and term frequency-inverse document frequency method for the noise data ...
Xiaojuan Hu +3 more
doaj +1 more source
Fragments and text categorization [PDF]
We introduce two novel methods of text categorization in which documents are split into fragments. We conducted experiments on English, French and Czech. In all cases, the problems referred to a binary document classification. We find that both methods increase the accuracy of text categorization.
Jan Blaták +2 more
openaire +1 more source
Adaptive text mining: Inferring structure from sequences [PDF]
Text mining is about inferring structure from sequences representing natural language text, and may be defined as the process of analyzing text to extract information that is useful for particular purposes.
Witten, Ian H.
core +2 more sources

