Results 171 to 180 of about 8,324 (219)
Some of the next articles are maybe not open access.
Improvement and Application of TF * IDF Algorithm
2011The traditional TF-IDF probability model is a relatively simple formula. For a few words which are commonly used and not stop words in a paper,it is lack of better differentiate and is not suitable for many specific cases, such as news advertising service module, about extraction of key words of the article, according to the deficiencies and the demand
Ji-Rui Li, Yan-Fang Mao, Kai Yang
openaire +1 more source
Clustering the topics using TF-IDF for model fusion
Proceedings of the 2nd PhD workshop on Information and knowledge management, 2008Users tend to express their queries in various ways: sometimes they use more general terms, sometimes more specific terms. Information retrieval systems need to be able to accommodate this variety of user needs. Some retrieval models perform better when the queries are general, others perform better when the queries are more specific, and others when a
Muath Alzghool, Diana Zaiu Inkpen
openaire +1 more source
The Automatic option of inference rules for the fuzzy TF-IDF
2020 IEEE 2nd International Conference on Electronics, Control, Optimization and Computer Science (ICECOCS), 2020For several NLP tasks, we use the Fuzzy Inference System (FIS) to develop a set of decisional systems. Still, the selection rules FIS process needs some modification to be more useful. In this paper, we use FIS to produce terms weight for electronic documents basing on the popular TF-IDF (term frequency-inverse term frequency) components.
Mariem Bounabi +2 more
openaire +1 more source
An information-theoretic perspective of tf–idf measures
Information Processing & Management, 2003zbMATH Open Web Interface contents unavailable due to conflicting licenses.
openaire +1 more source
Chinese readability assessment using TF-IDF and SVM
2011 International Conference on Machine Learning and Cybernetics, 2011This paper proposes a simple yet effective method to automatically determine the readability of Chinese articles. We use mutual information to select the most important terms from the training data, calculate TF-IDF values based on those terms, and use those values as features for SVM to build classification models that identify articles suitable for ...
Yaw-Huei Chen, Yi-Han Tsai, Yu-Ta Chen
openaire +1 more source
Research on case reasoning method based on TF-IDF
International Journal of System Assurance Engineering and Management, 2021With the continuous expansion of the application field of Case-Based Reasoning (CBR) technology, it is increasingly difficult for programmers to acquire and express professional knowledge. Therefore, this paper first gives a structured expression of professional knowledge, and combines the Case-Based Reasoning method with the scientific measurement of ...
openaire +1 more source
Movie genre classification using TF-IDF and SVM
Proceedings of the 2019 7th International Conference on Information Technology: IoT and Smart City, 2019This paper studies the classification principle and process of SVM algorithm, and classifies the text containing movie information, so as to achieve the research purpose of movie classification. It focuses on the various steps that need to be completed in the process of classification, such as text word segmentation, feature engineering, text ...
Ning Fei, Yangyang Zhang
openaire +1 more source
From Tf-Idf to Learning-to-Rank
2016Ranking a set of documents based on their relevances with respect to a given query is a central problem of information retrieval (IR). Traditionally people have been using unsupervised scoring methods like tf-idf, BM25, Language Model etc., but recently supervised machine learning framework is being used successfully to learn a ranking function, which ...
Muhammad Ibrahim, Manzur Murshed
openaire +1 more source
Improvement of TF-IDF Algorithm Based on Knowledge Graph
2018 IEEE 16th International Conference on Software Engineering Research, Management and Applications (SERA), 2018The TF-IDF algorithm is commonly used for text information retrieval and data mining. The traditional TF-IDF algorithm does not consider the domain characteristics of the article, and does not consider the distribution ratio. Currently, the solution proposed by many scholars only solves the problems of distribution ratio and the like, and does not ...
Yanpeng Wang +4 more
openaire +1 more source
Ontology-Based Genes Similarity Calculation with TF-IDF
2012The Gene Ontology (GO) provides a controlled vocabulary of terms for describing genes from different data resources. In this paper, we proposed a novel method determining semantic similarity of genes based on GO. The key principle of our method relies on the introduction of Term Frequency (TF) and Inverse Document Frequency (IDF) to quantify the ...
Yue Huang, Mingxin Gan, Rui Jiang 0001
openaire +1 more source

