Results 21 to 30 of about 31,193 (295)

Analyse de documents avec TF-IDF

open access: yesThe Programming Historian en Français, 2022
Cette leçon présente une méthode de traitement automatique des langues et de recherche d’informations nommée Term Frequency - Inverse Document Frequency (tf-idf).
Matthew J. Lavin
doaj   +1 more source

Comparative Evaluation of NLP-Based Approaches for Linking CAPEC Attack Patterns from CVE Vulnerability Information

open access: yesApplied Sciences, 2022
Vulnerability and attack information must be collected to assess the severity of vulnerabilities and prioritize countermeasures against cyberattacks quickly and accurately.
Kenta Kanakogi   +8 more
doaj   +1 more source

Why inverse document frequency? [PDF]

open access: yesSecond meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies 2001 - NAACL '01, 2001
Inverse Document Frequency (IDF) is a popular measure of a word's importance. The IDF invariably appears in a host of heuristic measures used in information retrieval. However, so far the IDF has itself been a heuristic. In this paper, we show IDF to be optimal in a principled sense.
openaire   +2 more sources

Impact of Different Approaches to Preparing Notes for Analysis With Natural Language Processing on the Performance of Prediction Models in Intensive Care

open access: yesCritical Care Explorations, 2021
OBJECTIVES:. To evaluate whether different approaches in note text preparation (known as preprocessing) can impact machine learning model performance in the case of mortality prediction ICU. DESIGN:.
Malini Mahendra, MD   +5 more
doaj   +1 more source

Automatic Annotation of Images in Persian Scientific Documents Based on Text Analysis Methods

open access: yesIranian Journal of Information Processing & Management, 2022
In this paper a new method for annotating images in Persian scientific documents is suggested. Images in scientific documents contain valuable information.
Azadeh fakhrzadeh   +2 more
doaj  

Term Weighting Schemes for Slovak Text Document Clustering [PDF]

open access: yesJournal of Electrical and Electronics Engineering, 2013
Text representation is the task of transforming the textual data into a multidimensional space with corresponding weights for every word. Wehave tested several widely used term weighting methods on manually created database from Slovak Wikipedia articles.
ZLACKÝ Daniel   +3 more
doaj  

Text Classification Using Document-Relational Graph Convolutional Networks

open access: yesIEEE Access, 2022
Graph Convolutional Networks (GCNs) have received considerable attention in the field of artificial machine intelligence (AMI) and natural language processing research because they can build more sophisticated accompanying graph structures than ...
Chongyi Liu, Xiangyu Wang, Honglei Xu
doaj   +1 more source

Study of Statistical Text Representation Methods for Performance Improvement of a Hierarchical Attention Network

open access: yesApplied Sciences, 2021
To effectively process textual data, many approaches have been proposed to create text representations. The transformation of a text into a form of numbers that can be computed using computers is crucial for further applications in downstream tasks such ...
Adam Wawrzyński, Julian Szymański
doaj   +1 more source

DETEKSI EMOSI MEDIA SOSIAL MENGGUNAKAN TERM FREQUENCY- INVERSE DOCUMENT FREQUENCY

open access: yesCSRID (Computer Science Research and Its Development Journal), 2021
<em>Pada saat ini, manusia cenderung mengekspresikan pendapat, dan emosi melalui media sosial. Keterbukaan ekspresi pada media sosial membuat batasan batasan pribadi seseorang menjadi lebur. Orang tidak lagi sungkan menulis kehidupan pribadinya melalui postingan status pembaharuan untuk dilihat oleh orang lain.
Arif Nur Rohman   +3 more
openaire   +2 more sources

Algoritma Term Frequency – Inverse Document Frequency (TF-IDF) dan K-Means Clustering Untuk Menentukan Kategori Dokumen [PDF]

open access: yes, 2022
The development of technology is speedy; one of the results is developing documents in research articles. Searching for documents in a repository will take a long time if they are not stored grouped by document category.
Arifin, Rizal   +4 more
core  

Home - About - Disclaimer - Privacy