Results 1 to 10 of about 95,338 (130)

Selection of the Optimal Number of Topics for LDA Topic Model—Taking Patent Policy Analysis as an Example [PDF]

open access: yesEntropy, 2021
This study constructs a comprehensive index to effectively judge the optimal number of topics in the LDA topic model. Based on the requirements for selecting the number of topics, a comprehensive judgment index of perplexity, isolation, stability, and ...
Jingxian Gan, Yong Qi
doaj   +6 more sources

An adaptive method for determining the optimal number of topics in topic modeling [PDF]

open access: yesPeerJ Computer Science
Topic models have been successfully applied to information classification and retrieval. The difficulty in successfully applying these technologies is to select the appropriate number of topics for a given corpus.
Yang Xu   +3 more
doaj   +5 more sources

Topic models with elements of neural networks: investigation of stability, coherence, and determining the optimal number of topics [PDF]

open access: yesPeerJ Computer Science
Topic modeling is a widely used instrument for the analysis of large text collections. In the last few years, neural topic models and models with word embeddings have been proposed to increase the quality of topic solutions.
Sergei Koltcov   +3 more
doaj   +5 more sources

Identifying the optimal number of topics in text mining: a case study on reindeer pastoralism literature

open access: yesItalian Journal of Animal Science
Text mining and topic analysis algorithms which group textual contents in the most efficient way, are becoming increasingly useful to summarise the main information contained in large data corpus of complex scientific fields.
Barbara Contiero   +2 more
doaj   +3 more sources

Analysis and tuning of hierarchical topic models based on Renyi entropy approach [PDF]

open access: yesPeerJ Computer Science, 2021
Hierarchical topic modeling is a potentially powerful instrument for determining topical structures of text collections that additionally allows constructing a hierarchy representing the levels of topic abstractness.
Sergei Koltcov   +3 more
doaj   +2 more sources

Estimation of Topic Similarity and Its Application to Measuring Stability of Topic Modeling [PDF]

open access: yesJiàoyù zīliào yǔ túshūguǎn xué, 2022
Topic modeling stability is a measurement of the extent to which models produced by the same modeling approach for the same corpus and with the same initial conditions have similar topics.
Sung-Chien Lin
doaj   +1 more source

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics [PDF]

open access: yesBernoulli, 2020
We propose a new method of estimation in topic models, that is not a variation on the existing simplex finding algorithms, and that estimates the number of topics K from the observed data. We derive new finite sample minimax lower bounds for the estimation of A, as well as new upper bounds for our proposed estimator. We describe the scenarios where our
Bing, Xin   +2 more
openaire   +4 more sources

An Adaptive LDA Optimal Topic Number Selection Method in News Topic Identification

open access: yesIEEE Access, 2023
Nowadays, news text information is exploding, and people need more and more heterogeneous news content. Therefore, news text topic identification is needed to help viewers quickly and accurately screen and filter news related to their interests to save ...
Mingming Zheng   +3 more
doaj   +1 more source

The Number of Topics Optimization: Clustering Approach [PDF]

open access: yesMachine Learning and Knowledge Extraction, 2019
Although topic models have been used to build clusters of documents for more than ten years, there is still a problem of choosing the optimal number of topics. The authors analyzed many fundamental studies undertaken on the subject in recent years. The main problem is the lack of a stable metric of the quality of topics obtained during the construction
Fedor Krasnov, Anastasiia Sen
openaire   +1 more source

Evaluation of the Optimal Topic Classification for Social Media Data Combined with Text Semantics: A Case Study of Public Opinion Analysis Related to COVID-19 with Microblogs

open access: yesISPRS International Journal of Geo-Information, 2021
Online public opinion reflects social conditions and public attitudes regarding special social events. Therefore, analyzing the temporal and spatial distributions of online public opinion topics can contribute to understanding issues of public concern ...
Qin Liang, Chunchun Hu, Si Chen
doaj   +1 more source

Home - About - Disclaimer - Privacy