Results 271 to 280 of about 21,703 (289)
Some of the next articles are maybe not open access.
Quicker Similarity Joins in Metric Spaces
2013We consider the join operation in metric spaces. Given two sets A and B of objects drawn from some universe $\mathbb U$ , we want to compute the set $A \Join B = \{a,b \in A \times B\;|\;da,b \leq r\}$ efficiently, where $d : \mathbb U \times \mathbb U \to \mathbb R^+$ is a metric distance function and r∈ℝ+ is user supplied query ...
Braithwaite Billy, Fredriksson Kimmo
openaire +1 more source
An Efficient Similarity Join Algorithm with Cosine Similarity Predicate
2010Given a large collection of objects, finding all pairs of similar objects, namely similarity join, is widely used to solve various problems in many application domains.Computation time of similarity join is critical issue, since similarity join requires computing similarity values for all possible pairs of objects.
Dongjoo Lee +3 more
openaire +1 more source
Parallelizing String Similarity Join Algorithms
2018A key operation in data cleaning and integration is the use of string similarity join (SSJ) algorithms to identify and remove duplicates or similar records within data sets. With the advent of big data, a natural question is how to parallelize SSJ algorithms.
Ling-Chih Yao, Lipyeow Lim
openaire +1 more source
String Similarity Join with Different Thresholds
2015String similarity join is an essential operation of many applications that need to find all similar string pairs from given two collections. The existing approaches are using the uniform and predefined similarity thresholds. While in real applications, regarding that the longer string pairs typically tolerate many more typos, it is necessary to apply ...
Chuitian Rong, Xiangling Zhang
openaire +1 more source
GPU Acceleration of Set Similarity Joins
2015We propose a scheme of efficient set similarity joins on Graphics Processing Units GPUs. Due to the rapid growth and diversification of data, there is an increasing demand for fast execution of set similarity joins in applications that vary from data integration to plagiarism detection.
Mateus S. H. Cruz +3 more
openaire +1 more source
A distributed framework for large-scale semantic trajectory similarity join
Multimedia Tools and Applications, 2023Ruijie Tian, Weishi Zhang, Li Jiajun
exaly
Dynamic Set Similarity Join: An Update Log Based Approach
IEEE Transactions on Knowledge and Data Engineering, 2023Chengcheng Yang, Lisi Chen, Hao Wang
exaly
Parallel set similarity join on big data based on Locality-Sensitive Hashing
Science of Computer Programming, 2017Mohammad Karim Sohrabi
exaly
Top-k Similarity Join in Heterogeneous Information Networks
IEEE Transactions on Knowledge and Data Engineering, 2015Yun Xiong, Yangyong Zhu, Philip S Yu
exaly
Extending string similarity join to tolerant fuzzy token matching
ACM Transactions on Database Systems, 2014Jiannan Wang, Guoliang Li, Liguoliang
exaly

