Results 11 to 20 of about 93,510 (232)

Analysis and tuning of hierarchical topic models based on Renyi entropy approach [PDF]

open access: yesPeerJ Computer Science, 2021
Hierarchical topic modeling is a potentially powerful instrument for determining topical structures of text collections that additionally allows constructing a hierarchy representing the levels of topic abstractness.
Sergei Koltcov   +3 more
doaj   +2 more sources

Estimation of Topic Similarity and Its Application to Measuring Stability of Topic Modeling [PDF]

open access: yesJiàoyù zīliào yǔ túshūguǎn xué, 2022
Topic modeling stability is a measurement of the extent to which models produced by the same modeling approach for the same corpus and with the same initial conditions have similar topics.
Sung-Chien Lin
doaj   +1 more source

The Number of Topics Optimization: Clustering Approach [PDF]

open access: yesMachine Learning and Knowledge Extraction, 2019
Although topic models have been used to build clusters of documents for more than ten years, there is still a problem of choosing the optimal number of topics. The authors analyzed many fundamental studies undertaken on the subject in recent years. The main problem is the lack of a stable metric of the quality of topics obtained during the construction
Fedor Krasnov, Anastasiia Sen
openaire   +1 more source

Evaluation of the Optimal Topic Classification for Social Media Data Combined with Text Semantics: A Case Study of Public Opinion Analysis Related to COVID-19 with Microblogs

open access: yesISPRS International Journal of Geo-Information, 2021
Online public opinion reflects social conditions and public attitudes regarding special social events. Therefore, analyzing the temporal and spatial distributions of online public opinion topics can contribute to understanding issues of public concern ...
Qin Liang, Chunchun Hu, Si Chen
doaj   +1 more source

Renormalization Analysis of Topic Models

open access: yesEntropy, 2020
In practice, to build a machine learning model of big data, one needs to tune model parameters. The process of parameter tuning involves extremely time-consuming and computationally expensive grid search.
Sergei Koltcov, Vera Ignatenko
doaj   +1 more source

Analisis Topik Tagar Covidindonesia pada Instagram Menggunakan Latent Dirichlet Allocation

open access: yesJISKA (Jurnal Informatika Sunan Kalijaga), 2022
In this era, technology is increasingly sophisticated, this is evidenced by the number of people using the internet via cell phones, laptops, and other communication tools.
Kevin Rafi Adjie Putra Santoso   +3 more
doaj   +1 more source

Mining Method of Academic Research Hotspot Based on Spark [PDF]

open access: yesJisuanji gongcheng, 2019
By optimizing the Latent Dirichlet Allocation(LDA) topic model in Spark Machine Learning Library(MLlib),this paper proposes an improved mining method of academic research hotspots.LDA is used to model the keywords of academic papers.The optimal number of
ZHANG Cong, YI Xiushuang, ZHU Minghao, WANG Xingwei
doaj   +1 more source

Evaluating Topic Modeling for Saudi Newspapers Texts Using LDA: A Computational Linguistics Study

open access: yesJournal of Umm Al-Qura University for Language Sciences and Literature, 2022
This paper is in the field of natural language processing. It applied unsupervised machine learning approach to identifying the latent topics in Saudi newspapers using one of the most important unsupervised topic modeling algorithms.
Afrah Altamimi
doaj   +1 more source

Urban Public Transportation Perspective in Meta-Analysis Study

open access: yesMedia Komunikasi Teknik Sipil, 2021
Urban public transportation is transportation system developed for the public interest that prioritizes the optimal integration of various resources and infrastructure in order to achieve sustainable city that is guided by green technology.
Sri Sarjana
doaj   +1 more source

Topic detection with recursive consensus clustering and semantic enrichment

open access: yesHumanities & Social Sciences Communications, 2023
Extracting meaningful information from short texts like tweets has proved to be a challenging task. Literature on topic detection focuses mostly on methods that try to guess the plausible words that describe topics whose number has been decided in ...
Vincenzo De Leo   +5 more
doaj   +1 more source

Home - About - Disclaimer - Privacy