Results 11 to 20 of about 17,818 (198)

Levenshtein Distances Fail to Identify Language Relationships Accurately [PDF]

open access: yesComputational Linguistics, 2011
The Levenshtein distance is a simple distance metric derived from the number of edit operations needed to transform one string into another. This metric has received recent attention as a means of automatically classifying languages into genealogical subgroups.
Simon J Greenhill
openaire   +5 more sources

DETEKSI PLAGIARISME MENGGUNAKAN ALGORITMA LEVENSHTEIN DISTANCE

open access: yesJurnal Teknologi Informasi Universitas Lambung Mangkurat (JTIULM), 2021
Deteksi kesamaan dokumen untuk sistem plagiarisme termasuk dalam riset Natural Language Processing dalam bidang kecerdasan buatan. Plagiarisme banyak terjadi pada dokumen di lingkungan akademisi, begitupun yang terjadi pada PSMTS ULM. Deteksi plagiarisme diperlukan agar menjaga orisinalitas dari hasil tesis mahasiswa.
null Yuslena   +2 more
openaire   +2 more sources

A Levenshtein distance-based method for word segmentation in corpus augmentation of geoscience texts

open access: yesAnnals of GIS, 2023
For geoscience text, rich domain corpora have become the basis of improving the model performance in word segmentation. However, the lack of domain-specific corpus with annotation labelled has become a major obstacle to professional information mining in
Jinqu Zhang   +6 more
doaj   +1 more source

Scientific reference style using rule-based machine learning

open access: yesIJAIN (International Journal of Advances in Intelligent Informatics), 2023
Regular Expressions (RegEx) can be employed as a technique for supervised learning to define and search for specific patterns inside text. This work devised a method that utilizes regular expressions to convert the reference style of academic papers into
Afrida Helen   +2 more
doaj   +1 more source

Similarity Hashing Based on Levenshtein Distances [PDF]

open access: yes, 2014
It is increasingly common in forensic investigations to use automated pre-processing techniques to reduce the massive volumes of data that are encountered. This is typically accomplished by comparing fingerprints (typically cryptographic hashes) of files against existing databases.
Breitinger, Frank   +3 more
openaire   +2 more sources

Distance Measure for Controlled Random Tests

open access: yesДоклады Белорусского государственного университета информатики и радиоэлектроники, 2022
The problem of constructing characteristics of the difference between test sequences is investigated. Its relevance for generating controlled random tests and the complexity of finding difference measures for symbolic tests are substantiated.
V. N. Yarmolik   +2 more
doaj   +1 more source

Damerau Levenshtein Distance for Indonesian Spelling Correction

open access: yesJurnal Informatika, 2019
Word correction used to find an incorrect word in writing. Levenshtein distance is one of algorithm to correcting typing error. It is an algorithm that calculates a difference between two strings.
Puji Santoso   +3 more
doaj   +1 more source

A Novel String Distance Function based on Most Frequent K Characters [PDF]

open access: yes, 2014
This study aims to publish a novel similarity metric to increase the speed of comparison operations. Also the new metric is suitable for distance-based operations among strings.
Altun, Oguz   +3 more
core   +1 more source

Analisis Komparatif Pengukuran Kemiripan Artikel Ilmiah menggunakan Jaccard dan Levenshtein serta Blocking

open access: yesJuTISI (Jurnal Teknik Informatika dan Sistem Informasi), 2023
Mesin pencarian artikel telah memudahkan akademisi melakukan studi literatur. Namun, mudah bukan berarti akurat. Untuk topik niche tertentu, hasil pencarian sering kali belum sesuai.
Muhammad Rizqi Nur   +2 more
doaj   +1 more source

Normalisasi Kata Tidak Baku yang Tidak Disingkat dengan Jarak Perubahan

open access: yesJurnal Nasional Teknik Elektro dan Teknologi Informasi, 2019
Voice assistant technology is growing rapidly and its use has begun to spread to daily use. However, voice assistant usages are still limited to standard conversation languages.
I Gusti Bagus Baskara Nugraha   +1 more
doaj   +1 more source

Home - About - Disclaimer - Privacy