Results 11 to 20 of about 59,770 (264)

Topic Detection Based on Sentence Embeddings and Agglomerative Clustering with Markov Moment

open access: yesFuture Internet, 2020
The paper is dedicated to solving the problem of optimal text classification in the area of automated detection of typology of texts. In conventional approaches to topicality-based text classification (including topic modeling), the number of clusters is
Svetlana S. Bodrunova   +4 more
doaj   +1 more source

Using IR Techniques to Improve Automated Text Classification [PDF]

open access: yes, 2004
This paper performs a study on the pre-processing phase of the automated text classification problem. We use the linear Support Vector Machine paradigm applied to datasets written in the English and the European Portuguese languages – the Reuters and the Portuguese Attorney General’s Office datasets, respectively.
Gonçalves, Teresa, Quaresma, Paulo
openaire   +2 more sources

Theme of Khmer document classification in the field of agriculture based on the use of Naïve Bayes method with keywords [PDF]

open access: yesE3S Web of Conferences
By the empower of the technology and the Internet, there are huge amount of electronic text documents becoming available from day to day. Seeking information in the field of agriculture numerous collection is required well organized documentations that ...
Sovila Srun   +5 more
doaj   +1 more source

Annif Analyzer Shootout: Comparing text lemmatization methods for automated subject indexing

open access: yesCode4Lib Journal, 2022
Automated text classification is an important function for many AI systems relevant to libraries, including automated subject indexing and classification.
Osma Suominen, Ilkka Koskenniemi
doaj  

Comparing automated text classification methods

open access: yesInternational Journal of Research in Marketing, 2019
Abstract Online social media drive the growth of unstructured text data. Many marketing applications require structuring this data at scales non-accessible to human coding, e.g., to detect communication shifts in sentiment or other researcher-defined content categories.
Hartmann, Jochen   +3 more
openaire   +1 more source

Sketching a “low-cost” text-classification technique for text topics in English [PDF]

open access: yesIbérica, 2014
The aim of this paper is to sketch a potential methodology for automatic text classification which allows text topic discrimination as a prior step to new case assignment to previously established text topics.
Pascual Cantos Gómez
doaj   +2 more sources

TEXT CLASSIFICATION BASED ON FUZZY RADIAL BASIS FUNCTION

open access: yesIraqi Journal for Computers and Informatics, 2019
Automated classification of text into predefined categories has always been considered as a vital method in the natural language processing field. In this paper new methods based on Radial Basis Function (RBF) and Fuzzy Radial Basis Function (FRBF) are ...
Zuhair Ali
doaj   +1 more source

Automated Coding of Under-Studied Medical Concept Domains: Linking Physical Activity Reports to the International Classification of Functioning, Disability, and Health

open access: yesFrontiers in Digital Health, 2021
Linking clinical narratives to standardized vocabularies and coding systems is a key component of unlocking the information in medical text for analysis.
Denis Newman-Griffis   +2 more
doaj   +1 more source

AutoWS: Automated Weak Supervision Framework for Text Classification

open access: yes, 2023
Creating large, good quality labeled data has become one of the major bottlenecks for developing machine learning applications. Multiple techniques have been developed to either decrease the dependence of labeled data (zero/few-shot learning, weak supervision) or to improve the efficiency of labeling process (active learning).
Bohra, Abhinav   +2 more
openaire   +2 more sources

Bag of Words and Embedding Text Representation Methods for Medical Article Classification

open access: yesInternational Journal of Applied Mathematics and Computer Science, 2023
Text classification has become a standard component of automated systematic literature review (SLR) solutions, where articles are classified as relevant or irrelevant to a particular literature study topic.
Cichosz Paweł
doaj   +1 more source

Home - About - Disclaimer - Privacy