Results 221 to 230 of about 30,335 (280)

Learning to deduplicate

Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries, 2006
Identifying record replicas in Digital Libraries and other types of digital repositories is fundamental to improve the quality of their content and services as well as to yield eventual sharing efforts. Several deduplication strategies are available, but most of them rely on manually chosen settings to combine evidence used to identify records as being
Moisés G. de Carvalho   +3 more
openaire   +1 more source

Lazy Exact Deduplication

ACM Transactions on Storage, 2016
Deduplication aims to reduce duplicate data in storage systems by removing redundant copies of data blocks, which are compared to one another using fingerprints. However, repeated on-disk fingerprint lookups lead to high disk traffic, which results in a bottleneck.
Jingwei Ma   +6 more
openaire   +1 more source

Home - About - Disclaimer - Privacy