Results 51 to 60 of about 561,016 (281)
Mining academic publications to automatically identify data sources
Background Discovering suitable datasets is an important part of health research, particularly for projects working with cohort data, but with the proliferation of so many national and international initiatives, it is becoming increasingly difficult for ...
Athanasios Anastasiou, Karen Tingay
doaj +1 more source
Machine learning for semi-automated scoping reviews
Scoping reviews are a type of research synthesis that aim to map the literature on a particular topic or research area. Though originally intended to provide a quick overview of a field of research, scoping review teams have been overwhelmed in recent ...
Sharon Mozgai +6 more
doaj +1 more source
A Semantic Graph-Based Approach for Mining Common Topics From Multiple Asynchronous Text Streams [PDF]
In the age of Web 2.0, a substantial amount of unstructured content are distributed through multiple text streams in an asynchronous fashion, which makes it increasingly difficult to glean and distill useful information.
Guo Weiwei +6 more
core +1 more source
Guided Semi-Supervised Non-Negative Matrix Factorization
Classification and topic modeling are popular techniques in machine learning that extract information from large-scale datasets. By incorporating a priori information such as labels or important features, methods have been developed to perform ...
Pengyu Li +6 more
doaj +1 more source
Fake news detection: a survey of evaluation datasets
Fake news detection has gained increasing importance among the research community due to the widespread diffusion of fake news through media platforms. Many dataset have been released in the last few years, aiming to assess the performance of fake news ...
Arianna D’ulizia +3 more
semanticscholar +1 more source
Emergent Leadership Detection Across Datasets [PDF]
Automatic detection of emergent leaders in small groups from nonverbal behaviour is a growing research topic in social signal processing but existing methods were evaluated on single datasets -- an unrealistic assumption for real-world applications in ...
Bulling, Andreas, Müller, Philipp
core +3 more sources
Predicting protein function via multi-label supervised topic model on gene ontology
As the biological datasets accumulate rapidly, computational methods designed to automate protein function prediction are critically needed. The problem of protein function prediction can be considered as a multi-label classification problem resulting in
Lin Liu +4 more
doaj +1 more source
TLATR: Automatic Topic Labeling Using Automatic (Domain-Specific) Term Recognition
Topic modeling is a probabilistic graphical model for discovering latent topics in text corpora by using multinomial distributions of topics over words. Topic labeling is used to assign meaningful labels for the discovered topics.
Ciprian-Octavian Truica +1 more
doaj +1 more source
Scalable Topical Phrase Mining from Text Corpora
While most topic modeling algorithms model text corpora with unigrams, human interpretation often relies on inherent grouping of terms into phrases. As such, we consider the problem of discovering topical phrases of mixed lengths.
El-Kishky, Ahmed +4 more
core +1 more source
Using patterns position distribution for software failure detection [PDF]
Pattern-based software failure detection is an important topic of research in recent years. In this method, a set of patterns from program execution traces are extracted, and represented as features, while their occurrence frequencies are treated as the ...
Agrawal R. +20 more
core +3 more sources

