Selection of the Optimal Number of Topics for LDA Topic Model—Taking Patent Policy Analysis as an Example [PDF]
This study constructs a comprehensive index to effectively judge the optimal number of topics in the LDA topic model. Based on the requirements for selecting the number of topics, a comprehensive judgment index of perplexity, isolation, stability, and ...
Jingxian Gan, Yong Qi
doaj +6 more sources
An adaptive method for determining the optimal number of topics in topic modeling [PDF]
Topic models have been successfully applied to information classification and retrieval. The difficulty in successfully applying these technologies is to select the appropriate number of topics for a given corpus.
Yang Xu +3 more
doaj +5 more sources
Topic models with elements of neural networks: investigation of stability, coherence, and determining the optimal number of topics [PDF]
Topic modeling is a widely used instrument for the analysis of large text collections. In the last few years, neural topic models and models with word embeddings have been proposed to increase the quality of topic solutions.
Sergei Koltcov +3 more
doaj +5 more sources
Text mining and topic analysis algorithms which group textual contents in the most efficient way, are becoming increasingly useful to summarise the main information contained in large data corpus of complex scientific fields.
Barbara Contiero +2 more
doaj +3 more sources
Analysis and tuning of hierarchical topic models based on Renyi entropy approach [PDF]
Hierarchical topic modeling is a potentially powerful instrument for determining topical structures of text collections that additionally allows constructing a hierarchy representing the levels of topic abstractness.
Sergei Koltcov +3 more
doaj +2 more sources
Estimation of Topic Similarity and Its Application to Measuring Stability of Topic Modeling [PDF]
Topic modeling stability is a measurement of the extent to which models produced by the same modeling approach for the same corpus and with the same initial conditions have similar topics.
Sung-Chien Lin
doaj +1 more source
A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics [PDF]
We propose a new method of estimation in topic models, that is not a variation on the existing simplex finding algorithms, and that estimates the number of topics K from the observed data. We derive new finite sample minimax lower bounds for the estimation of A, as well as new upper bounds for our proposed estimator. We describe the scenarios where our
Bing, Xin +2 more
openaire +4 more sources
An Adaptive LDA Optimal Topic Number Selection Method in News Topic Identification
Nowadays, news text information is exploding, and people need more and more heterogeneous news content. Therefore, news text topic identification is needed to help viewers quickly and accurately screen and filter news related to their interests to save ...
Mingming Zheng +3 more
doaj +1 more source
The Number of Topics Optimization: Clustering Approach [PDF]
Although topic models have been used to build clusters of documents for more than ten years, there is still a problem of choosing the optimal number of topics. The authors analyzed many fundamental studies undertaken on the subject in recent years. The main problem is the lack of a stable metric of the quality of topics obtained during the construction
Fedor Krasnov, Anastasiia Sen
openaire +1 more source
Online public opinion reflects social conditions and public attitudes regarding special social events. Therefore, analyzing the temporal and spatial distributions of online public opinion topics can contribute to understanding issues of public concern ...
Qin Liang, Chunchun Hu, Si Chen
doaj +1 more source

