Literature DB >> 19683070

Extending the Fellegi-Sunter probabilistic record linkage method for approximate field comparators.

Scott L DuVall1, Richard A Kerber, Alun Thomas.   

Abstract

Probabilistic record linkage is a method commonly used to determine whether demographic records refer to the same person. The Fellegi-Sunter method is a probabilistic approach that uses field weights based on log likelihood ratios to determine record similarity. This paper introduces an extension of the Fellegi-Sunter method that incorporates approximate field comparators in the calculation of field weights. The data warehouse of a large academic medical center was used as a case study. The approximate comparator extension was compared with the Fellegi-Sunter method in its ability to find duplicate records previously identified in the data warehouse using different demographic fields and matching cutoffs. The approximate comparator extension misclassified 25% fewer pairs and had a larger Welch's T statistic than the Fellegi-Sunter method for all field sets and matching cutoffs. The accuracy gain provided by the approximate comparator extension grew as less information was provided and as the matching cutoff increased. Given the ubiquity of linkage in both clinical and research settings, the incremental improvement of the extension has the potential to make a considerable impact.

Entities:  

Mesh:

Year:  2009        PMID: 19683070     DOI: 10.1016/j.jbi.2009.08.004

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  17 in total

1.  A benchmark comparison of deterministic and probabilistic methods for defining manual review datasets in duplicate records reconciliation.

Authors:  Erel Joffe; Michael J Byrne; Phillip Reeder; Jorge R Herskovic; Craig W Johnson; Allison B McCoy; Dean F Sittig; Elmer V Bernstam
Journal:  J Am Med Inform Assoc       Date:  2013-05-23       Impact factor: 4.497

2.  Using Probabilistic Record Linkage of Structured and Unstructured Data to Identify Duplicate Cases in Spontaneous Adverse Event Reporting Systems.

Authors:  Kory Kreimeyer; David Menschik; Scott Winiecki; Wendy Paul; Faith Barash; Emily Jane Woo; Meghna Alimchandani; Deepa Arya; Craig Zinderman; Richard Forshee; Taxiarchis Botsis
Journal:  Drug Saf       Date:  2017-07       Impact factor: 5.606

3.  Optimized dual threshold entity resolution for electronic health record databases--training set size and active learning.

Authors:  Erel Joffe; Michael J Byrne; Phillip Reeder; Jorge R Herskovic; Craig W Johnson; Allison B McCoy; Elmer V Bernstam
Journal:  AMIA Annu Symp Proc       Date:  2013-11-16

4.  Private medical record linkage with approximate matching.

Authors:  Elizabeth Durham; Yuan Xue; Murat Kantarcioglu; Bradley Malin
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

5.  Colorectal Cancer in Inflammatory Bowel Diseases: A Population-Based Study in Utah.

Authors:  N Jewel Samadder; John F Valentine; Stephen Guthery; Harminder Singh; Charles N Bernstein; Yuan Wan; Jathine Wong; Kenneth Boucher; Lisa Pappas; Kerry Rowe; Mary Bronner; Cornelia M Ulrich; Randall W Burt; Karen Curtin; Ken R Smith
Journal:  Dig Dis Sci       Date:  2017-01-03       Impact factor: 3.199

6.  A practical approach to achieve private medical record linkage in light of public resources.

Authors:  Mehmet Kuzu; Murat Kantarcioglu; Elizabeth Ashley Durham; Csaba Toth; Bradley Malin
Journal:  J Am Med Inform Assoc       Date:  2012-07-30       Impact factor: 4.497

7.  A hybrid approach to record linkage using a combination of deterministic and probabilistic methodology.

Authors:  Toan C Ong; Lindsey M Duca; Michael G Kahn; Tessa L Crume
Journal:  J Am Med Inform Assoc       Date:  2020-04-01       Impact factor: 4.497

8.  Linking clinical research data to population databases.

Authors:  Linda S Edelman; Jia-Wen Guo; Alison Fraser; Susan L Beck
Journal:  Nurs Res       Date:  2013 Nov-Dec       Impact factor: 2.381

9.  Embracing the Sparse, Noisy, and Interrelated Aspects of Patient Demographics for use in Clinical Medical Record Linkage.

Authors:  Stephen M Ash; King Ip-Lin
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2015-03-25

10.  Linkage, evaluation and analysis of national electronic healthcare data: application to providing enhanced blood-stream infection surveillance in paediatric intensive care.

Authors:  Katie Harron; Harvey Goldstein; Angie Wade; Berit Muller-Pebody; Roger Parslow; Ruth Gilbert
Journal:  PLoS One       Date:  2013-12-20       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.