Results 1 to 10 of about 229,295 (345)
Machine learning for semi-automated scoping reviews
Scoping reviews are a type of research synthesis that aim to map the literature on a particular topic or research area. Though originally intended to provide a quick overview of a field of research, scoping review teams have been overwhelmed in recent ...
Sharon Mozgai +6 more
doaj +1 more source
Unsupervised Text Topic-Related Gene Extraction for Large Unbalanced Datasets
There is a common notion that traditional unsupervised feature extraction algorithms follow the assumption that the distribution of the different clusters in a dataset is balanced.
Jing-Tao, Sun +11 more
core +1 more source
Guided Semi-Supervised Non-Negative Matrix Factorization
Classification and topic modeling are popular techniques in machine learning that extract information from large-scale datasets. By incorporating a priori information such as labels or important features, methods have been developed to perform ...
Pengyu Li +6 more
doaj +1 more source
MMT: A Multilingual and Multi-Topic Indian Social Media Dataset
Social media plays a significant role in cross-cultural communication. A vast amount of this occurs in code-mixed and multilingual form, posing a significant challenge to Natural Language Processing (NLP) tools for processing such information, like language identification, topic modeling, and named-entity recognition.
Dwip Dalal +2 more
openaire +2 more sources
Predicting protein function via multi-label supervised topic model on gene ontology
As the biological datasets accumulate rapidly, computational methods designed to automate protein function prediction are critically needed. The problem of protein function prediction can be considered as a multi-label classification problem resulting in
Lin Liu +4 more
doaj +1 more source
TLATR: Automatic Topic Labeling Using Automatic (Domain-Specific) Term Recognition
Topic modeling is a probabilistic graphical model for discovering latent topics in text corpora by using multinomial distributions of topics over words. Topic labeling is used to assign meaningful labels for the discovered topics.
Ciprian-Octavian Truica +1 more
doaj +1 more source
Normalized Datasets of Harnack’s Reconstruction of Marcion’s 'Gospel'
These two datasets are the first born-digital, normalized, peer-reviewed datasets of Harnack’s classic reconstruction of Marcion’s 'Gospel'. The first consists of human-readable postclassical Greek, the second of lemmatized and morphologically tagged ...
Mark G. Bilby
doaj +1 more source
Traditional text classification models, such as text kernels, primarily consider the syntactic aspects of text data. This paper introduces Topic-Weighted Kernels, a new text analytics framework that combines global topical themes with word-level ...
Nikhil V. Chandran +2 more
doaj +1 more source
Topic-Conversation Relevance (TCR) Dataset and Benchmarks
To be published in 38th Conference on Neural Information Processing Systems (NeurIPS 2024) Track on Datasets and ...
Yaran Fan +3 more
openaire +3 more sources
Concept Extraction and Clustering for Topic Digital Library Construction [PDF]
This paper is to introduce a new approach to build topic digital library using concept extraction and document clustering. Firstly, documents in a special domain are automatically produced by document classification approach.
Dan, Wu, Chengzhi, Zhang
core

