Results 1 to 10 of about 2,772,512 (136)

Selection of the Optimal Number of Topics for LDA Topic Model—Taking Patent Policy Analysis as an Example [PDF]

open access: goldEntropy, 2021
This study constructs a comprehensive index to effectively judge the optimal number of topics in the LDA topic model. Based on the requirements for selecting the number of topics, a comprehensive judgment index of perplexity, isolation, stability, and ...
Jingxian Gan, Yong Qi
doaj   +7 more sources

An adaptive method for determining the optimal number of topics in topic modeling [PDF]

open access: goldPeerJ Computer Science
Topic models have been successfully applied to information classification and retrieval. The difficulty in successfully applying these technologies is to select the appropriate number of topics for a given corpus.
Yang Xu   +3 more
doaj   +5 more sources

Topic models with elements of neural networks: investigation of stability, coherence, and determining the optimal number of topics [PDF]

open access: goldPeerJ Computer Science
Topic modeling is a widely used instrument for the analysis of large text collections. In the last few years, neural topic models and models with word embeddings have been proposed to increase the quality of topic solutions.
Sergei Koltcov   +3 more
doaj   +6 more sources

An Adaptive LDA Optimal Topic Number Selection Method in News Topic Identification [PDF]

open access: goldIEEE Access, 2023
Nowadays, news text information is exploding, and people need more and more heterogeneous news content. Therefore, news text topic identification is needed to help viewers quickly and accurately screen and filter news related to their interests to save ...
Mingming Zheng   +3 more
doaj   +3 more sources

Identifying the optimal number of topics in text mining: a case study on reindeer pastoralism literature

open access: goldItalian Journal of Animal Science
Text mining and topic analysis algorithms which group textual contents in the most efficient way, are becoming increasingly useful to summarise the main information contained in large data corpus of complex scientific fields.
Barbara Contiero   +2 more
doaj   +4 more sources

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics [PDF]

open access: bronzeBernoulli, 2020
We propose a new method of estimation in topic models, that is not a variation on the existing simplex finding algorithms, and that estimates the number of topics K from the observed data. We derive new finite sample minimax lower bounds for the estimation of A, as well as new upper bounds for our proposed estimator. We describe the scenarios where our
Bing, Xin   +2 more
  +9 more sources

Fractal approach for determining the optimal number of topics in the field of topic modeling.

open access: goldJournal of Physics: Conference Series, 2019
In this paper we apply multifractal formalism to the analysis of statistical behaviour of topic models under condition of varying number of topics. Our analysis reveals the existence of two self-similar regions and one transition region in the function of density-of-states depending on the number of topics.
Ignatenko, Vera   +3 more
openaire   +5 more sources

A data-driven analysis to determine the optimal number of topics 'K' for latent Dirichlet allocation model

open access: goldIndonesian Journal of Electrical Engineering and Computer Science
Topic modeling is an unsupervised machine learning technique successfully used to classify and retrieve textual data. However, the performance of topic models is sensitive to selecting optimal hyperparameters, the number of topics 'K' and Dirichlet priors 'α' and 'β.' This data-driven analysis aims to determine the optimum number of topics, 'K,' within
Astha Goyal, Indu Kashyap
openaire   +3 more sources

The Optimal Number of Topics Detection and their Assessment in Multiple Current Events and Trends Based Datasets Using Topic Models

open access: closedInternational Journal for Research in Applied Science and Engineering Technology
This project explores the application of sophisticated topic models to determine the ideal amount of subjects and evaluate them across several datasets pertaining to trends and current events. We propose a comprehensive approach that leverages the topic models such as Latent Dirichlet Allocation (LDA), Non-negative Matrix Factorization (NMF), Latent ...
G. Manasa
openaire   +2 more sources

Topic modeling and content analysis of people’s anxiety-related concerns raised on a computer-mediated health platform [PDF]

open access: yesScientific Reports
Background About one in four Chinese people might suffer or have already suffered from anxiety conditions, with a lifetime prevalence rate of 4.8%. However, many of those who could have benefited from psychological or pharmacological treatments fail to ...
Yi Liu   +7 more
doaj   +2 more sources

Home - About - Disclaimer - Privacy