Results 11 to 20 of about 54,658 (262)
A Superior Arabic Text Categorization Deep Model (SATCDM)
Categorizing Arabic text documents is considered an important research topic in the field of Natural Language Processing (NLP) and Machine Learning (ML).
M. Alhawarat, Ahmad O. Aseeri
doaj +1 more source
Noisy text categorization [PDF]
This work presents categorization experiments performed over noisy texts. By noisy, we mean any text obtained through an extraction process (affected by errors) from media other than digital texts (e.g., transcriptions of speech recordings extracted with a recognition system).
openaire +3 more sources
Sparse representations for text categorization [PDF]
Sparse representations (SRs) are often used to characterize a test signal using few support training examples, and allow the number of supports to be adapted to the specific signal being categorized. Given the good performance of SRs compared to other classifiers for both image classification and phonetic classification, in this paper, we extended the ...
Tara N. Sainath +5 more
openaire +1 more source
Categorization and Conceptualization of Space in Descriptive Text
The relevance of the article is due to the importance of studying spatial semantics in the new scientific paradigm. The possibility of studying genre varieties of description (description-landscape, description-interior, description-portrait, description
Y. N. Varfolomeeva
doaj +1 more source
An efficient approach for textual data classification using deep learning
Text categorization is an effective activity that can be accomplished using a variety of classification algorithms. In machine learning, the classifier is built by learning the features of categories from a set of preset training data.
Abdullah Alqahtani +6 more
doaj +1 more source
Cross-Lingual Text Categorization [PDF]
This article deals with the problem of Cross-Lingual Text Categorization (CLTC), which arises when documents in different languages must be classified according to the same classification tree. We describe practical and cost-effective solutions for automatic Cross-Lingual Text Categorization, both in case a sufficient number of training examples is ...
Bel Rafecas, Núria +2 more
openaire +2 more sources
Parallel noise eliminate: A parallel noise elimination algorithm for massive text categorization
Noise data in text are one of the main factors affecting the quality of text categorization. A parallel noise data elimination algorithm based on principal component analysis method and term frequency-inverse document frequency method for the noise data ...
Xiaojuan Hu +3 more
doaj +1 more source
Improving Large-Scale k-Nearest Neighbor Text Categorization with Label Autoencoders
In this paper, we introduce a multi-label lazy learning approach to deal with automatic semantic indexing in large document collections in the presence of complex and structured label vocabularies with high inter-label correlation. The proposed method is
Francisco J. Ribadas-Pena +2 more
doaj +1 more source
Text Categorization with Latent Dirichlet Allocation [PDF]
This paper focuses on the text categorization of Slovak text corpora using latent Dirichlet allocation. Our goal is to build text subcorpora that contain similar text documents.
ZLACKÝ Daniel +3 more
doaj
Keyword extraction for text categorization [PDF]
Text categorization (TC) is one of the main applications of machine learning. Many methods have been proposed, such as Rocchio method, Naive bayes based method, and SVM based text classification method. These methods learn labeled text documents and then construct a classifier. A new coming text document's category can be predicted.
Jiyuan An, Yi-Ping Phoebe Chen
openaire +1 more source

