Results 21 to 30 of about 83,132 (184)

Improving database quality through eliminating duplicate records

open access: yesData Science Journal, 2006
Redundant or duplicate data are the most troublesome problem in database management and applications. Approximate field matching is the key solution to resolve the problem by identifying semantically equivalent string values in syntactically different ...
Mingzhen Wei   +2 more
doaj   +1 more source

Approximate String Matching with SIMD

open access: yesThe Computer Journal, 2021
Abstract We consider the $k$ mismatches version of approximate string matching for a single pattern and multiple patterns. For these problems, we present new algorithms utilizing the single instruction multiple data (SIMD) instruction set extensions for patterns of up to 32 characters. We apply SIMD computation in three ways: in counting
Fiori, Fernando J.   +3 more
openaire   +2 more sources

APPLYING A Q-GRAM BASED MULTIPLE STRING MATCHING ALGORITHM FOR APPROXIMATE MATCHING

open access: yesInformatyka, Automatyka, Pomiary w Gospodarce i Ochronie Środowiska, 2017
We consider the application of multiple pattern matching (Multi AOSO on q-Grams) algorithm for approximate pattern matching. We propose the on-line approach which translates the problem from approximate pattern matching into a multiple pattern one ...
Robert Susik
doaj   +1 more source

CDKAM: a taxonomic classification tool using discriminative k-mers and approximate matching strategies

open access: yesBMC Bioinformatics, 2020
Background Current taxonomic classification tools use exact string matching algorithms that are effective to tackle the data from the next generation sequencing technology.
Van-Kien Bui, Chaochun Wei
doaj   +1 more source

Predictive Quantization and Symbolic Dynamics

open access: yesAlgorithms, 2022
Capturing long-term statistics of signals and time series is important for modeling recurrent phenomena, especially when such recurrences are a-periodic and can be characterized by the approximate repetition of variable length motifs, such as patterns in
Shlomo Dubnov
doaj   +1 more source

Compressed computations using wavelets for hidden Markov models with continuous observations.

open access: yesPLoS ONE, 2023
Compression as an accelerant of computation is increasingly recognized as an important component in engineering fast real-world machine learning methods for big data; c.f., its impact on genome-scale approximate string matching. Previous work showed that
Luca Bello   +2 more
doaj   +2 more sources

Optical Character Recognition (OCR) enhancement using an approximate string matching technique

open access: yesEngineering and Applied Science Research, 2018
Many researchershavefocusedon improving optical character recognition (OCR) efficiency by developing new techniques using image processing based methodologies.
Kraisak Kesorn   +1 more
doaj   +1 more source

A Technique for Discovering Similarities between Texts Based on Extracting Features from the Text [PDF]

open access: yesمجلة جامعة الانبار للعلوم الصرفة, 2019
The discovery of the similarity between two texts is very important and useful in many applications. The similarity between texts is the core research area of dataset, data warehouse, and data mining.
Alaa Abdalqahar Jihad, Mortadha M. Hamad
doaj   +1 more source

Business Process Automation: A Workflow Incorporating Optical Character Recognition and Approximate String and Pattern Matching for Solving Practical Industry Problems

open access: yesApplied System Innovation, 2019
Companies are relying more on artificial intelligence and machine learning in order to enhance and automate existing business processes. While the power of OCR (Optical Character Recognition) technologies can be harnessed for the digitization of image ...
Coenrad de Jager, Marinda Nel
doaj   +1 more source

Improved Approximate String Matching and Regular Expression Matching on Ziv-Lempel Compressed Texts

open access: yes, 2007
We study the approximate string matching and regular expression matching problem for the case when the text to be searched is compressed with the Ziv-Lempel adaptive dictionary compression schemes.
A. Amir   +15 more
core   +4 more sources

Home - About - Disclaimer - Privacy