Quality and complexity measures for data linkage and deduplication [PDF]
Summary. Deduplicating one data set or linking several data sets are increasingly important tasks in the data preparation steps of many data mining projects. The aim of such linkages is to match all records relating to the same entity.
C Shearer +17 more
core +1 more source
Hematopoietic (stem) cells—The elixir of life?
The aging of HSCs (hematopoietic stem cells) and the blood system leads to the decline of other organs. Rejuvenating aged HSCs improves the function of the blood system, slowing the aging of the heart, kidney, brain, and liver, and the occurrence of age‐related diseases.
Emilie L. Cerezo +4 more
wiley +1 more source
A Comparison of Blocking Methods for Record Linkage
Record linkage seeks to merge databases and to remove duplicates when unique identifiers are not available. Most approaches use blocking techniques to reduce the computational complexity associated with record linkage.
A. Goldenberg +10 more
core +1 more source
Accuracy of Probabilistic Linkage Using the Enhanced Matching System for Public Health and Epidemiological Studies [PDF]
The Enhanced Matching System (EMS) is a probabilistic record linkage program developed by the tuberculosis section at Public Health England to match data for individuals across two datasets. This paper outlines how EMS works and investigates its accuracy
Abubakar, I +3 more
core +3 more sources
Bridging the gap: Multi‐stakeholder perspectives of molecular diagnostics in oncology
Although molecular diagnostics is transforming cancer care, implementing novel technologies remains challenging. This study identifies unmet needs and technology requirements through a two‐step stakeholder involvement. Liquid biopsies for monitoring applications and predictive biomarker testing emerge as key unmet needs. Technology requirements vary by
Jorine Arnouts +8 more
wiley +1 more source
A Bayesian Approach to Graphical Record Linkage and De-duplication [PDF]
We propose an unsupervised approach for linking records across arbitrarily many files, while simultaneously detecting duplicate records within files. Our key innovation involves the representation of the pattern of links between records as a bipartite ...
Fienberg, Stephen E. +2 more
core +2 more sources
Consenting to health record linkage: evidence from a multi-purpose longitudinal survey of a general population [PDF]
Background: The British Household Panel Survey (BHPS) is the first long-running UK longitudinal survey with a non-medical focus and a sample covering the whole age range to have asked for permission to link to a range of administrative health records ...
AR Tate +18 more
core +3 more sources
This study indicates that Merkel cell carcinoma (MCC) does not originate from Merkel cells, and identifies gene, protein & cellular expression of immune‐linked and neuroendocrine markers in primary and metastatic Merkel cell carcinoma (MCC) tumor samples, linked to Merkel cell polyomavirus (MCPyV) status, with enrichment of B‐cell and other immune cell
Richie Jeremian +10 more
wiley +1 more source
A hierarchical Bayesian approach to record linkage and population size problems
We propose and illustrate a hierarchical Bayesian approach for matching statistical records observed on different occasions. We show how this model can be profitably adopted both in record linkage problems and in capture--recapture setups, where the size
Liseo, Brunero, Tancredi, Andrea
core +1 more source
Sociodemographic and Health Characteristics, Rather Than Primary Care Supply, are Major Drivers of Geographic Variation in Preventable Hospitalizations in Australia [PDF]
ACKNOWLEDGMENTS: The authors thank the many thousands of people participating in the 45 and Up Study. The authors also thank the Sax Institute, the NSW Ministry of Health, and the NSW Register of Births, Deaths, and Marriages for allowing access to the ...
Blyth, Fiona M +5 more
core +1 more source

