Results 231 to 240 of about 61,609 (261)
Some of the next articles are maybe not open access.
2010 IEEE International Conference on Data Mining, 2010
In this paper we propose a framework of topic modeling ensembles, a novel solution to combine the models learned by topic modeling over each partition of the whole corpus. It has the potentials for applications such as distributed topic modeling for large corpora, and incremental topic modeling for rapidly growing corpora.
Zhiyong Shen +3 more
openaire +1 more source
In this paper we propose a framework of topic modeling ensembles, a novel solution to combine the models learned by topic modeling over each partition of the whole corpus. It has the potentials for applications such as distributed topic modeling for large corpora, and incremental topic modeling for rapidly growing corpora.
Zhiyong Shen +3 more
openaire +1 more source
Topic-to-Topic Modeling for COVID-19 Mortality
2021 IEEE 9th International Conference on Healthcare Informatics (ICHI), 2021We examine a cohort of 4307 COVID-19 case fatalities from a de-identified national registry in the U.S. using Latent Dirichlet Allocation and group each patient by topic based on their pre-existing conditions in the years prior to infection and again during the last three weeks of life.
Jeffrey Humpherys +6 more
openaire +1 more source
Foundations and Trends® in Information Retrieval, 2017
How can a single person understand what’s going on in a collection of millions of documents? This is an increasingly widespread problem: sifting through an organization’s e-mails, understanding a decade worth of newspapers, or characterizing a scientific field’s research. This monograph explores the ways that humans and computers make sense of document
Jordan L. Boyd-Graber +2 more
openaire +1 more source
How can a single person understand what’s going on in a collection of millions of documents? This is an increasingly widespread problem: sifting through an organization’s e-mails, understanding a decade worth of newspapers, or characterizing a scientific field’s research. This monograph explores the ways that humans and computers make sense of document
Jordan L. Boyd-Graber +2 more
openaire +1 more source
Communications of the ACM, 2011
Surveying a suite of algorithms that offer a solution to managing large document archives.
openaire +1 more source
Surveying a suite of algorithms that offer a solution to managing large document archives.
openaire +1 more source
Latent topic model for image annotation by modeling topic correlation
2013 IEEE International Conference on Multimedia and Expo (ICME), 2013For the task of image annotation, traditional probabilistic topic models based on Latent Dirichlet Allocation (LDA) [1], assume that an image is a mixture of latent topics. An inevitable limitation of LDA is the inability to model topic correlation since topic proportions of an image are generated independently. Motivated by Correlated Topic Model (CTM)
Xing Xu 0001 +2 more
openaire +1 more source
Group topic model: organizing topics into groups
Information Retrieval Journal, 2014Latent Dirichlet allocation defines hidden topics to capture latent semantics in text documents. However, it assumes that all the documents are represented by the same topics, resulting in the "forced topic" problem. To solve this problem, we developed a group latent Dirichlet allocation (GLDA).
Ximing Li 0002 +4 more
openaire +1 more source
Discovering Coherent Topics with Entity Topic Models
2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI), 2016Probabilistic topic models are powerful techniques which are widely used for discovering topics or semantic content from a large collection of documents. However, because topic models are entirely unsupervised, they may lead to topics that are not understandable in applications.
Mehdi Allahyari, Krys J. Kochut
openaire +1 more source
2013
Multiplex document networks have multiple types of links such as citation and coauthor links between scientific papers. Inferring thematic topics from multiplex document networks requires quantifying and balancing the influence from different types of links. It is therefore a problem of considerable interest and represents significant challenges.
Juan Yang +2 more
openaire +1 more source
Multiplex document networks have multiple types of links such as citation and coauthor links between scientific papers. Inferring thematic topics from multiplex document networks requires quantifying and balancing the influence from different types of links. It is therefore a problem of considerable interest and represents significant challenges.
Juan Yang +2 more
openaire +1 more source
Applying a Burst Model to Detect Bursty Topics in a Topic Model
2012This paper focuses on two types of modeling of information flow in news stream, namely, burst analysis and topic modeling. First, when one wants to detect a kind of topics that are paid much more attention than usual, it is usually necessary for him/her to carefully watch every article in news stream at every moment.
Yusuke Takahashi +6 more
openaire +1 more source
Increasing Topic Coherence by Aggregating Topic Models
2016In this paper, we introduce a novel method for aggregating multiple topic models to produce an aggregate model that contains topics with greater coherence than individual models. When generating a topic model a number of parameters must be specified. Depending on the parameters chosen the resulting topics can be very general or very specific.
Stuart J. Blair +2 more
openaire +1 more source

