Results 21 to 30 of about 346,822 (289)

Improving Large-Scale k-Nearest Neighbor Text Categorization with Label Autoencoders

open access: yesMathematics, 2022
In this paper, we introduce a multi-label lazy learning approach to deal with automatic semantic indexing in large document collections in the presence of complex and structured label vocabularies with high inter-label correlation. The proposed method is
Francisco J. Ribadas-Pena   +2 more
doaj   +1 more source

Text Categorization with Latent Dirichlet Allocation [PDF]

open access: yesJournal of Electrical and Electronics Engineering, 2014
This paper focuses on the text categorization of Slovak text corpora using latent Dirichlet allocation. Our goal is to build text subcorpora that contain similar text documents.
ZLACKÝ Daniel   +3 more
doaj  

Improving Term Weighting Schemes for Short Text Classification in Vector Space Model

open access: yesIEEE Access, 2019
Short text is one of the predominant forms of communication with unique characteristics such as short length, high sparsity, and lack of shared context and word co-occurrence.
Surender Singh Samant   +2 more
doaj   +1 more source

PAAD: POLITICAL ARABIC ARTICLES DATASET FOR AUTOMATIC TEXT CATEGORIZATION

open access: yesIraqi Journal for Computers and Informatics, 2020
Now day’s text Classification and Sentiment analysis is considered as one of the popular Natural Language Processing (NLP) tasks. This kind of technique plays significant role in human activities and has impact on the daily behaviours.
Dhafar Hamed Abd   +2 more
doaj   +1 more source

Neural Discourse Structure for Text Categorization

open access: yes, 2017
We show that discourse structure, as defined by Rhetorical Structure Theory and provided by an existing discourse parser, benefits text categorization.
Ji, Yangfeng, Smith, Noah
core   +1 more source

Neural Text Categorizer for Exclusive Text Categorization

open access: yesJournal of Information Processing Systems, 2008
This research proposes a new neural network for text categorization which uses alternative representations of documents to numerical vectors. Since the proposed neural network is intended originally only for text categorization, it is called NTC (Neural Text Categorizer) in this research.
openaire   +2 more sources

Categorization of Unorganized Text Corpora for better Domain-Specific Language Modeling

open access: yesAdvances in Electrical and Electronic Engineering, 2013
This paper describes the process of categorization of unorganized text data gathered from the Internet to the in-domain and out-of-domain data for better domain-specific language modeling and speech recognition.
Jan Stas   +3 more
doaj   +1 more source

Non-Standard Words as Features for Text Categorization

open access: yes, 2014
This paper presents categorization of Croatian texts using Non-Standard Words (NSW) as features. Non-Standard Words are: numbers, dates, acronyms, abbreviations, currency, etc.
Beliga, Slobodan   +1 more
core   +1 more source

Text categorization by fuzzy domain adaptation [PDF]

open access: yes, 2013
Machine learning methods have attracted attention of researches in computational fields such as classification/categorization. However, these learning methods work under the assumption that the training and test data distributions are identical.
Behbood, V, Lu, J, Zhang, G
core   +1 more source

Text and Hypertext Categorization [PDF]

open access: yes, 2009
Automatic categorization of text documents has become an important area of research in the last two decades, with features that make it significantly more difficult than the traditional classification tasks studied in machine learning. A more recent development is the need to classify hypertext documents, most notably web pages.
Houda Benbrahim, Max Bramer
openaire   +1 more source

Home - About - Disclaimer - Privacy