Results 31 to 40 of about 1,551,832 (303)

ClimaText: A Dataset for Climate Change Topic Detection

open access: yes, 2020
Accepted for the Tackling Climate Change with Machine Learning Workshop at NeurIPS ...
Varini, Francesco   +3 more
openaire   +3 more sources

News Topic Detection Based on Capsule Semantic Graph

open access: yesBig Data Mining and Analytics, 2022
Most news topic detection methods use word-based methods, which easily ignore the relationship among words and have semantic sparsity, resulting in low topic detection accuracy.
Shuang Yang, Yan Tang
doaj   +1 more source

TaxoCom: Topic Taxonomy Completion with Hierarchical Discovery of Novel Topic Clusters [PDF]

open access: yes, 2022
Topic taxonomies, which represent the latent topic (or category) structure of document collections, provide valuable knowledge of contents in many applications such as web search and information filtering. Recently, several unsupervised methods have been developed to automatically construct the topic taxonomy from a text corpus, but it is challenging ...
arxiv   +1 more source

Mining academic publications to automatically identify data sources

open access: yesInternational Journal of Population Data Science, 2018
Background Discovering suitable datasets is an important part of health research, particularly for projects working with cohort data, but with the proliferation of so many national and international initiatives, it is becoming increasingly difficult for ...
Athanasios Anastasiou, Karen Tingay
doaj   +1 more source

Normalized Datasets of Harnack’s Reconstruction of Marcion’s 'Gospel'

open access: yesJournal of Open Humanities Data, 2021
These two datasets are the first born-digital, normalized, peer-reviewed datasets of Harnack’s classic reconstruction of Marcion’s 'Gospel'. The first consists of human-readable postclassical Greek, the second of lemmatized and morphologically tagged ...
Mark G. Bilby
doaj   +1 more source

Scientific Dataset Discovery via Topic-level Recommendation

open access: yes, 2021
Data intensive research requires the support of appropriate datasets. However, it is often time-consuming to discover usable datasets matching a specific research topic. We formulate the dataset discovery problem on an attributed heterogeneous graph, which is composed of paper-paper citation, paper-dataset citation, and also paper content.
Altaf, Basmah   +2 more
openaire   +2 more sources

Exploiting Long-Term Dependency for Topic Sentiment Analysis

open access: yesIEEE Access, 2020
Most existing unsupervised approaches to detect topic sentiment in social texts consider only the text sequences in corpus and put aside social dynamics, as leads to algorithm’s disability to discover true sentiment of social users. To address the
Faliang Huang   +3 more
doaj   +1 more source

A latent topic‐aware network for dense video captioning

open access: yesIET Computer Vision, 2023
Multiple events in a long untrimmed video possess the characteristics of similarity and continuity. These characteristics can be considered as a kind of topic semantic information, which probably behaves as same sports, similar scenes, same objects etc ...
Tao Xu   +3 more
doaj   +1 more source

Guided Semi-Supervised Non-Negative Matrix Factorization

open access: yesAlgorithms, 2022
Classification and topic modeling are popular techniques in machine learning that extract information from large-scale datasets. By incorporating a priori information such as labels or important features, methods have been developed to perform ...
Pengyu Li   +6 more
doaj   +1 more source

Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations [PDF]

open access: yes, 2022
Topic models have been the prominent tools for automatic topic discovery from text corpora. Despite their effectiveness, topic models suffer from several limitations including the inability of modeling word ordering information in documents, the difficulty of incorporating external linguistic knowledge, and the lack of both accurate and efficient ...
arxiv   +1 more source

Home - About - Disclaimer - Privacy