Results 11 to 20 of about 17,818 (198)
Levenshtein Distances Fail to Identify Language Relationships Accurately [PDF]
The Levenshtein distance is a simple distance metric derived from the number of edit operations needed to transform one string into another. This metric has received recent attention as a means of automatically classifying languages into genealogical subgroups.
Simon J Greenhill
openaire +5 more sources
DETEKSI PLAGIARISME MENGGUNAKAN ALGORITMA LEVENSHTEIN DISTANCE
Deteksi kesamaan dokumen untuk sistem plagiarisme termasuk dalam riset Natural Language Processing dalam bidang kecerdasan buatan. Plagiarisme banyak terjadi pada dokumen di lingkungan akademisi, begitupun yang terjadi pada PSMTS ULM. Deteksi plagiarisme diperlukan agar menjaga orisinalitas dari hasil tesis mahasiswa.
null Yuslena +2 more
openaire +2 more sources
A Levenshtein distance-based method for word segmentation in corpus augmentation of geoscience texts
For geoscience text, rich domain corpora have become the basis of improving the model performance in word segmentation. However, the lack of domain-specific corpus with annotation labelled has become a major obstacle to professional information mining in
Jinqu Zhang +6 more
doaj +1 more source
Scientific reference style using rule-based machine learning
Regular Expressions (RegEx) can be employed as a technique for supervised learning to define and search for specific patterns inside text. This work devised a method that utilizes regular expressions to convert the reference style of academic papers into
Afrida Helen +2 more
doaj +1 more source
Similarity Hashing Based on Levenshtein Distances [PDF]
It is increasingly common in forensic investigations to use automated pre-processing techniques to reduce the massive volumes of data that are encountered. This is typically accomplished by comparing fingerprints (typically cryptographic hashes) of files against existing databases.
Breitinger, Frank +3 more
openaire +2 more sources
Distance Measure for Controlled Random Tests
The problem of constructing characteristics of the difference between test sequences is investigated. Its relevance for generating controlled random tests and the complexity of finding difference measures for symbolic tests are substantiated.
V. N. Yarmolik +2 more
doaj +1 more source
Damerau Levenshtein Distance for Indonesian Spelling Correction
Word correction used to find an incorrect word in writing. Levenshtein distance is one of algorithm to correcting typing error. It is an algorithm that calculates a difference between two strings.
Puji Santoso +3 more
doaj +1 more source
A Novel String Distance Function based on Most Frequent K Characters [PDF]
This study aims to publish a novel similarity metric to increase the speed of comparison operations. Also the new metric is suitable for distance-based operations among strings.
Altun, Oguz +3 more
core +1 more source
Mesin pencarian artikel telah memudahkan akademisi melakukan studi literatur. Namun, mudah bukan berarti akurat. Untuk topik niche tertentu, hasil pencarian sering kali belum sesuai.
Muhammad Rizqi Nur +2 more
doaj +1 more source
Normalisasi Kata Tidak Baku yang Tidak Disingkat dengan Jarak Perubahan
Voice assistant technology is growing rapidly and its use has begun to spread to daily use. However, voice assistant usages are still limited to standard conversation languages.
I Gusti Bagus Baskara Nugraha +1 more
doaj +1 more source

