An investigation of term weighting approaches for microblog retrieval [PDF]
The use of effective term frequency weighting and document length normalisation strategies have been shown over a number of decades to have a significant positive effect for document retrieval.
Ferguson, Paul +4 more
core +2 more sources
Classification of Hadith According to Its Content Based on Supervised Learning Algorithms
Given the importance of the Prophet’s Hadith for Muslims all over the world, where it is the second source of Islam after the Qur’an and the fundamental resource of legislation in the Islam community.
Hammam M. Abdelaal +2 more
doaj +1 more source
Probabilistic models of information retrieval based on measuring the divergence from randomness [PDF]
We introduce and create a framework for deriving probabilistic models of Information Retrieval. The models are nonparametric models of IR obtained in the language model approach.
Allan J. +20 more
core +1 more source
PREFERENCE BASED TERM WEIGHTING FOR ARABIC FIQH DOCUMENT RANKING
In document retrieval, besides the suitability of query with search results, there is also a subjective user assessment that is expected to be a deciding factor in document ranking.
Khadijah Fahmi Hayati Holle +2 more
doaj +1 more source
USTW Vs. STW: A Comparative Analysis for Exam Question Classification based on Bloom’s Taxonomy
Bloom’s Taxonomy (BT) is widely used in educational institutions to produce high-quality exam papers to evaluate students’ knowledge at different cognitive levels.
Mohammed Osman Gani +3 more
doaj +1 more source
Evaluation of Term Ranking Algorithms for Pseudo-Relevance Feedback in MEDLINE Retrieval [PDF]
ObjectivesThe purpose of this study was to investigate the effects of query expansion algorithms for MEDLINE retrieval within a pseudo-relevance feedback framework.MethodsA number of query expansion algorithms were tested using various term ranking ...
Sooyoung Yoo, Jinwook Choi
doaj +1 more source
Term Weighting Schemes for Slovak Text Document Clustering [PDF]
Text representation is the task of transforming the textual data into a multidimensional space with corresponding weights for every word. Wehave tested several widely used term weighting methods on manually created database from Slovak Wikipedia articles.
ZLACKÝ Daniel +3 more
doaj
Successful modeling and prediction depend on effective methods for the extraction of domain-relevant variables. This paper proposes a methodology for identifying domain-specific terms. The proposed methodology relies on a collection of documents labeled
Mariano Maisonnave +3 more
doaj +1 more source
Binned Term Count: An Alternative to Term Frequency for Text Categorization
In text categorization, a well-known problem related to document length is that larger term counts in longer documents cause classification algorithms to become biased.
Farhan Shehzad +5 more
doaj +1 more source
Volatility Prediction using Financial Disclosures Sentiments with Word Embedding-based IR Models [PDF]
Volatility prediction--an essential concept in financial markets--has recently been addressed using sentiment analysis methods. We investigate the sentiment of annual disclosures of companies in stock markets to forecast volatility.
Anderson, Linda +5 more
core +3 more sources

