Results 31 to 40 of about 1,551,832 (303)
ClimaText: A Dataset for Climate Change Topic Detection
Accepted for the Tackling Climate Change with Machine Learning Workshop at NeurIPS ...
Varini, Francesco+3 more
openaire +3 more sources
News Topic Detection Based on Capsule Semantic Graph
Most news topic detection methods use word-based methods, which easily ignore the relationship among words and have semantic sparsity, resulting in low topic detection accuracy.
Shuang Yang, Yan Tang
doaj +1 more source
TaxoCom: Topic Taxonomy Completion with Hierarchical Discovery of Novel Topic Clusters [PDF]
Topic taxonomies, which represent the latent topic (or category) structure of document collections, provide valuable knowledge of contents in many applications such as web search and information filtering. Recently, several unsupervised methods have been developed to automatically construct the topic taxonomy from a text corpus, but it is challenging ...
arxiv +1 more source
Mining academic publications to automatically identify data sources
Background Discovering suitable datasets is an important part of health research, particularly for projects working with cohort data, but with the proliferation of so many national and international initiatives, it is becoming increasingly difficult for ...
Athanasios Anastasiou, Karen Tingay
doaj +1 more source
Normalized Datasets of Harnack’s Reconstruction of Marcion’s 'Gospel'
These two datasets are the first born-digital, normalized, peer-reviewed datasets of Harnack’s classic reconstruction of Marcion’s 'Gospel'. The first consists of human-readable postclassical Greek, the second of lemmatized and morphologically tagged ...
Mark G. Bilby
doaj +1 more source
Scientific Dataset Discovery via Topic-level Recommendation
Data intensive research requires the support of appropriate datasets. However, it is often time-consuming to discover usable datasets matching a specific research topic. We formulate the dataset discovery problem on an attributed heterogeneous graph, which is composed of paper-paper citation, paper-dataset citation, and also paper content.
Altaf, Basmah+2 more
openaire +2 more sources
Exploiting Long-Term Dependency for Topic Sentiment Analysis
Most existing unsupervised approaches to detect topic sentiment in social texts consider only the text sequences in corpus and put aside social dynamics, as leads to algorithm’s disability to discover true sentiment of social users. To address the
Faliang Huang+3 more
doaj +1 more source
A latent topic‐aware network for dense video captioning
Multiple events in a long untrimmed video possess the characteristics of similarity and continuity. These characteristics can be considered as a kind of topic semantic information, which probably behaves as same sports, similar scenes, same objects etc ...
Tao Xu+3 more
doaj +1 more source
Guided Semi-Supervised Non-Negative Matrix Factorization
Classification and topic modeling are popular techniques in machine learning that extract information from large-scale datasets. By incorporating a priori information such as labels or important features, methods have been developed to perform ...
Pengyu Li+6 more
doaj +1 more source
Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations [PDF]
Topic models have been the prominent tools for automatic topic discovery from text corpora. Despite their effectiveness, topic models suffer from several limitations including the inability of modeling word ordering information in documents, the difficulty of incorporating external linguistic knowledge, and the lack of both accurate and efficient ...
arxiv +1 more source