Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A scaling approach to record linkage.

Literature DB >> 28303597

A scaling approach to record linkage.

Harvey Goldstein^1,2, Katie Harron³, Mario Cortina-Borja².

Abstract

With increasing availability of large datasets derived from administrative and other sources, there is an increasing demand for the successful linking of these to provide rich sources of data for further analysis. Variation in the quality of identifiers used to carry out linkage means that existing approaches are often based upon 'probabilistic' models, which are based on a number of assumptions, and can make heavy computational demands. In this paper, we suggest a new approach to classifying record pairs in linkage, based upon weights (scores) derived using a scaling algorithm. The proposed method does not rely on training data, is computationally fast, requires only moderate amounts of storage and has intuitive appeal.

Entities: Gene Species

Keywords: correspondence analysis; data linkage; record linkage; scaling

Mesh：

Year: 2017 PMID： 28303597 PMCID： PMC6205620 DOI： 10.1002/sim.7287

Source DB: PubMed Journal: Stat Med ISSN： 0277-6715 Impact factor: 2.373

4 in total

1. Record linkage: statistical models for matching computer records.

Authors: J B Copas; F J Hilton
Journal: J R Stat Soc Ser A Stat Soc Date: 1990 Impact factor: 2.483

2. Ignoring dependency between linking variables and its impact on the outcome of probabilistic record linkage studies.

Authors: Miranda Tromp; Nora Méray; Anita C J Ravelli; Johannes B Reitsma; Gouke J Bonsel
Journal: J Am Med Inform Assoc Date: 2008-06-25 Impact factor: 4.497

3. The analysis of record-linked data using multiple imputation with data value priors.

Authors: Harvey Goldstein; Katie Harron; Angie Wade
Journal: Stat Med Date: 2012-07-17 Impact factor: 2.373

4. Linkage, evaluation and analysis of national electronic healthcare data: application to providing enhanced blood-stream infection surveillance in paediatric intensive care.

Authors: Katie Harron; Harvey Goldstein; Angie Wade; Berit Muller-Pebody; Roger Parslow; Ruth Gilbert
Journal: PLoS One Date: 2013-12-20 Impact factor: 3.240

4 in total

5 in total

A scaling approach to record linkage.

1. Record linkage: statistical models for matching computer records.

2. Ignoring dependency between linking variables and its impact on the outcome of probabilistic record linkage studies.

3. The analysis of record-linked data using multiple imputation with data value priors.

4. Linkage, evaluation and analysis of national electronic healthcare data: application to providing enhanced blood-stream infection surveillance in paediatric intensive care.

1. Assessing data linkage quality in cohort studies.

2. Demystifying probabilistic linkage: Common myths and misconceptions.

3. On the Accuracy and Scalability of Probabilistic Data Linkage Over the Brazilian 114 Million Cohort.

4. Linkage of Hospital Records and Death Certificates by a Search Engine and Machine Learning.

5. A guide to evaluating linkage quality for the analysis of linked data.