Literature DB >> 32049329

A hybrid approach to record linkage using a combination of deterministic and probabilistic methodology.

Toan C Ong1, Lindsey M Duca2, Michael G Kahn1, Tessa L Crume2.   

Abstract

OBJECTIVE: The disjointed healthcare system and the nonexistence of a universal patient identifier across systems necessitates accurate record linkage (RL). We aim to describe the implementation and evaluation of a hybrid record linkage method in a statewide surveillance system for congenital heart disease.
MATERIALS AND METHODS: Clear-text personally identifiable information on individuals in the Colorado Congenital Heart Disease surveillance system was obtained from 5 electronic health record and medical claims data sources. Two deterministic methods and 1 probabilistic RL method using first name, last name, social security number, date of birth, and house number were initially implemented independently and then sequentially in a hybrid approach to assess RL performance.
RESULTS: 16 480 nonunique individuals with congenital heart disease were ascertained. Deterministic linkage methods, when performed independently, yielded 4505 linked pairs (consisting of 2 records linked together within or across data sources). Probabilistic RL, using 3 initial characters of last name and gender for blocking, yielded 6294 linked pairs when executed independently. Using a hybrid linkage routine resulted in 6451 linkages and an additional 18%-24% correct linked pairs as compared to the independent methods. A hybrid linkage routine resulted in higher recall and F-measure scores compared to probabilistic and deterministic methods performed independently. DISCUSSION: The hybrid approach resulted in increased linkage accuracy and identified pairs of linked record that would have otherwise been missed when using any independent linkage technique.
CONCLUSION: When performing RL within and across disparate data sources, the hybrid RL routine outperformed independent deterministic and probabilistic methods.
© The Author(s) 2020. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  congenital heart disease; data harmonization; hybrid; patient matching; record linkage

Year:  2020        PMID: 32049329      PMCID: PMC7647290          DOI: 10.1093/jamia/ocz232

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  15 in total

1.  An empirical comparison of record linkage procedures.

Authors:  Shanti Gomatam; Randy Carter; Mario Ariet; Glenn Mitchell
Journal:  Stat Med       Date:  2002-05-30       Impact factor: 2.373

2.  Extending the Fellegi-Sunter probabilistic record linkage method for approximate field comparators.

Authors:  Scott L DuVall; Richard A Kerber; Alun Thomas
Journal:  J Biomed Inform       Date:  2009-08-13       Impact factor: 6.317

3.  Case study of linking dental and medical healthcare records.

Authors:  Mary Kay Theis; Robert J Reid; Monica Chaudhari; Katherine M Newton; Leslie Spangler; David C Grossman; Ronald E Inge
Journal:  Am J Manag Care       Date:  2010-02-01       Impact factor: 2.229

4.  Improving record linkage performance in the presence of missing linkage data.

Authors:  Toan C Ong; Michael V Mannino; Lisa M Schilling; Michael G Kahn
Journal:  J Biomed Inform       Date:  2014-02-10       Impact factor: 6.317

5.  Design and implementation of a privacy preserving electronic health record linkage tool in Chicago.

Authors:  Abel N Kho; John P Cashy; Kathryn L Jackson; Adam R Pah; Satyender Goel; Jörn Boehnke; John Eric Humphries; Scott Duke Kominers; Bala N Hota; Shannon A Sims; Bradley A Malin; Dustin D French; Theresa L Walunas; David O Meltzer; Erin O Kaleba; Roderick C Jones; William L Galanter
Journal:  J Am Med Inform Assoc       Date:  2015-06-23       Impact factor: 4.497

6.  Probabilistic linkage of large public health data files.

Authors:  M A Jaro
Journal:  Stat Med       Date:  1995 Mar 15-Apr 15       Impact factor: 2.373

7.  Using global unique identifiers to link autism collections.

Authors:  Stephen B Johnson; Glen Whitney; Matthew McAuliffe; Hailong Wang; Evan McCreedy; Leon Rozenblit; Clark C Evans
Journal:  J Am Med Inform Assoc       Date:  2010 Nov-Dec       Impact factor: 4.497

8.  Linking clinical registry data with administrative data using indirect identifiers: implementation and validation in the congenital heart surgery population.

Authors:  Sara K Pasquali; Jeffrey P Jacobs; Gregory J Shook; Sean M O'Brien; Matthew Hall; Marshall L Jacobs; Karl F Welke; J William Gaynor; Eric D Peterson; Samir S Shah; Jennifer S Li
Journal:  Am Heart J       Date:  2010-12       Impact factor: 4.749

9.  The development of a data-matching algorithm to define the 'case patient'.

Authors:  Shelley Cox; Rohan Martin; Piyali Somaia; Karen Smith
Journal:  Aust Health Rev       Date:  2013-02       Impact factor: 1.990

10.  Fine-grained record integration and linkage tool.

Authors:  Pawel Jurczyk; James J Lu; Li Xiong; Janet D Cragan; Adolfo Correa
Journal:  Birth Defects Res A Clin Mol Teratol       Date:  2008-11
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.