Literature DB >> 22061295

A transparent and transportable methodology for evaluating Data Linkage software.

Anna Ferrante1, James Boyd.   

Abstract

There has been substantial growth in Data Linkage (DL) activities in recent years. This reflects growth in both the demand for, and the supply of, linked or linkable data. Increased utilisation of DL "services" has brought with it increased need for impartial information about the suitability and performance capabilities of DL software programs and packages. Although evaluations of DL software exist; most have been restricted to the comparison of two or three packages. Evaluations of a large number of packages are rare because of the time and resource burden placed on the evaluators and the need for a suitable "gold standard" evaluation dataset. In this paper we present an evaluation methodology that overcomes a number of these difficulties. Our approach involves the generation and use of representative synthetic data; the execution of a series of linkages using a pre-defined linkage strategy; and the use of standard linkage quality metrics to assess performance. The methodology is both transparent and transportable, producing genuinely comparable results. The methodology was used by the Centre for Data Linkage (CDL) at Curtin University in an evaluation of ten DL software packages. It is also being used to evaluate larger linkage systems (not just packages). The methodology provides a unique opportunity to benchmark the quality of linkages in different operational environments.
Copyright © 2011 Elsevier Inc. All rights reserved.

Mesh:

Year:  2011        PMID: 22061295     DOI: 10.1016/j.jbi.2011.10.006

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  13 in total

1.  Data linkage infrastructure for cross-jurisdictional health-related research in Australia.

Authors:  James H Boyd; Anna M Ferrante; Christine M O'Keefe; Alfred J Bass; Sean M Randall; James B Semmens
Journal:  BMC Health Serv Res       Date:  2012-12-29       Impact factor: 2.655

2.  Estimating parameters for probabilistic linkage of privacy-preserved datasets.

Authors:  Adrian P Brown; Sean M Randall; Anna M Ferrante; James B Semmens; James H Boyd
Journal:  BMC Med Res Methodol       Date:  2017-07-10       Impact factor: 4.615

3.  Sociodemographic differences in linkage error: an examination of four large-scale datasets.

Authors:  Sean Randall; Adrian Brown; James Boyd; Rainer Schnell; Christian Borgs; Anna Ferrante
Journal:  BMC Health Serv Res       Date:  2018-09-03       Impact factor: 2.655

4.  A profile of the Centre for Health Record Linkage.

Authors:  K Irvine; R Hall; L Taylor
Journal:  Int J Popul Data Sci       Date:  2019-11-29

5.  The effect of data cleaning on record linkage quality.

Authors:  Sean M Randall; Anna M Ferrante; James H Boyd; James B Semmens
Journal:  BMC Med Inform Decis Mak       Date:  2013-06-05       Impact factor: 2.796

6.  Accuracy and completeness of patient pathways--the benefits of national data linkage in Australia.

Authors:  James H Boyd; Sean M Randall; Anna M Ferrante; Jacqueline K Bauer; Kevin McInneny; Adrian P Brown; Katrina Spilsbury; Margo Gillies; James B Semmens
Journal:  BMC Health Serv Res       Date:  2015-08-08       Impact factor: 2.655

7.  A guide to evaluating linkage quality for the analysis of linked data.

Authors:  Katie L Harron; James C Doidge; Hannah E Knight; Ruth E Gilbert; Harvey Goldstein; David A Cromwell; Jan H van der Meulen
Journal:  Int J Epidemiol       Date:  2017-10-01       Impact factor: 7.196

8.  Challenges in administrative data linkage for research.

Authors:  Katie Harron; Chris Dibben; James Boyd; Anders Hjern; Mahmoud Azimaee; Mauricio L Barreto; Harvey Goldstein
Journal:  Big Data Soc       Date:  2017-12-05

9.  Evaluation of approximate comparison methods on Bloom filters for probabilistic linkage.

Authors:  A P Brown; S M Randall; J H Boyd; A M Ferrante
Journal:  Int J Popul Data Sci       Date:  2019-05-23

10.  Population Data Centre Profiles: Centre for Data Linkage.

Authors:  J H Boyd; S M Randall; A P Brown; M Maller; D Botes; M Gillies; A Ferrante
Journal:  Int J Popul Data Sci       Date:  2020-03-11
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.