Literature DB >> 26302242

Accuracy of Probabilistic Linkage Using the Enhanced Matching System for Public Health and Epidemiological Studies.

Robert W Aldridge1, Kunju Shaji2, Andrew C Hayward3, Ibrahim Abubakar4.   

Abstract

BACKGROUND: The Enhanced Matching System (EMS) is a probabilistic record linkage program developed by the tuberculosis section at Public Health England to match data for individuals across two datasets. This paper outlines how EMS works and investigates its accuracy for linkage across public health datasets.
METHODS: EMS is a configurable Microsoft SQL Server database program. To examine the accuracy of EMS, two public health databases were matched using National Health Service (NHS) numbers as a gold standard unique identifier. Probabilistic linkage was then performed on the same two datasets without inclusion of NHS number. Sensitivity analyses were carried out to examine the effect of varying matching process parameters.
RESULTS: Exact matching using NHS number between two datasets (containing 5931 and 1759 records) identified 1071 matched pairs. EMS probabilistic linkage identified 1068 record pairs. The sensitivity of probabilistic linkage was calculated as 99.5% (95%CI: 98.9, 99.8), specificity 100.0% (95%CI: 99.9, 100.0), positive predictive value 99.8% (95%CI: 99.3, 100.0), and negative predictive value 99.9% (95%CI: 99.8, 100.0). Probabilistic matching was most accurate when including address variables and using the automatically generated threshold for determining links with manual review.
CONCLUSION: With the establishment of national electronic datasets across health and social care, EMS enables previously unanswerable research questions to be tackled with confidence in the accuracy of the linkage process. In scenarios where a small sample is being matched into a very large database (such as national records of hospital attendance) then, compared to results presented in this analysis, the positive predictive value or sensitivity may drop according to the prevalence of matches between databases. Despite this possible limitation, probabilistic linkage has great potential to be used where exact matching using a common identifier is not possible, including in low-income settings, and for vulnerable groups such as homeless populations, where the absence of unique identifiers and lower data quality has historically hindered the ability to identify individuals across datasets.

Entities:  

Mesh:

Year:  2015        PMID: 26302242      PMCID: PMC4547731          DOI: 10.1371/journal.pone.0136179

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


  16 in total

1.  Trends in mortality rates comparing underlying-cause and multiple-cause coding in an English population 1979-1998.

Authors:  Michael J Goldacre; Marie E Duncan; Paula Cook-Mozaffari; Myfanwy Griffith
Journal:  J Public Health Med       Date:  2003-09

2.  Analysis of a probabilistic record linkage technique without human review.

Authors:  Shaun J Grannis; J Marc Overhage; Siu Hui; Clement J McDonald
Journal:  AMIA Annu Symp Proc       Date:  2003

Review 3.  Probabilistic record linkage and a method to calculate the positive predictive value.

Authors:  Tony Blakely; Clare Salmond
Journal:  Int J Epidemiol       Date:  2002-12       Impact factor: 7.196

4.  [Accuracy of the probabilistic record linkage methodology to ascertain deaths in survival studies].

Authors:  Evandro Silva Freire Coutinho; Cláudia Medina Coeli
Journal:  Cad Saude Publica       Date:  2006-10       Impact factor: 1.632

5.  Searching for cancer deaths in Australia: National Death Index vs. cancer registries.

Authors:  Christina M Nagle; David M Purdie; Penelope M Webb; Adèle C Green; Christopher J Bain
Journal:  Asian Pac J Cancer Prev       Date:  2006 Jan-Mar

6.  Changes in safety on England's roads: analysis of hospital statistics.

Authors:  Mike Gill; Michael J Goldacre; David G R Yeates
Journal:  BMJ       Date:  2006-06-23

Review 7.  Accuracy of probabilistic record linkage applied to health databases: systematic review.

Authors:  Daniele Pinto da Silveira; Elizabeth Artmann
Journal:  Rev Saude Publica       Date:  2009-09-25       Impact factor: 2.106

Review 8.  Use of computerized record linkage in cohort studies.

Authors:  G R Howe
Journal:  Epidemiol Rev       Date:  1998       Impact factor: 6.222

9.  Bias due to misclassification in the estimation of relative risk.

Authors:  K T Copeland; H Checkoway; A J McMichael; R H Holbrook
Journal:  Am J Epidemiol       Date:  1977-05       Impact factor: 4.897

10.  Record-linkage and capture-recapture analysis to estimate the incidence and completeness of reporting of tuberculosis in England 1999-2002.

Authors:  N A H VAN Hest; A Story; A D Grant; D Antoine; J P Crofts; J M Watson
Journal:  Epidemiol Infect       Date:  2008-03-17       Impact factor: 2.451

View more
  39 in total

1.  Using Probabilistic Record Linkage of Structured and Unstructured Data to Identify Duplicate Cases in Spontaneous Adverse Event Reporting Systems.

Authors:  Kory Kreimeyer; David Menschik; Scott Winiecki; Wendy Paul; Faith Barash; Emily Jane Woo; Meghna Alimchandani; Deepa Arya; Craig Zinderman; Richard Forshee; Taxiarchis Botsis
Journal:  Drug Saf       Date:  2017-07       Impact factor: 5.606

2.  Evaluating Clinical Outcomes From Administrative Databases.

Authors:  William S Weintraub; Brandon K Bellows
Journal:  JACC Cardiovasc Interv       Date:  2020-07-15       Impact factor: 11.195

3.  Outcomes analysis of new entrant screening for active tuberculosis in Heathrow and Gatwick airports, United Kingdom 2009/2010.

Authors:  Ettore Severi; Helen Maguire; Chikwe Ihekweazu; Graham Bickler; Ibrahim Abubakar
Journal:  BMC Infect Dis       Date:  2016-04-22       Impact factor: 3.090

4.  Pulmonary Mycobacterium avium-intracellulare is the main driver of the rise in non-tuberculous mycobacteria incidence in England, Wales and Northern Ireland, 2007-2012.

Authors:  Neeraj M Shah; Jennifer A Davidson; Laura F Anderson; Maeve K Lalor; Jusang Kim; H Lucy Thomas; Marc Lipman; Ibrahim Abubakar
Journal:  BMC Infect Dis       Date:  2016-05-06       Impact factor: 3.090

5.  Utilising identifier error variation in linkage of large administrative data sources.

Authors:  Katie Harron; Gareth Hagger-Johnson; Ruth Gilbert; Harvey Goldstein
Journal:  BMC Med Res Methodol       Date:  2017-02-07       Impact factor: 4.615

6.  GUILD: GUidance for Information about Linking Data sets.

Authors:  Ruth Gilbert; Rosemary Lafferty; Gareth Hagger-Johnson; Katie Harron; Li-Chun Zhang; Peter Smith; Chris Dibben; Harvey Goldstein
Journal:  J Public Health (Oxf)       Date:  2018-03-01       Impact factor: 2.341

7.  Building the National Database of Health Centred on the Individual: Administrative and Epidemiological Record Linkage - Brazil, 2000-2015.

Authors:  Augusto Afonso Guerra Junior; Ramon Gonçalves Pereira; Eli Iola Gurgel; Mariangela Cherchiglia; Leonardo Vinicius Dias; Juliano D Ávila; Núbia Santos; Afonso Reis; Francisco Assis Acurcio; Wagner Meira Junior
Journal:  Int J Popul Data Sci       Date:  2018-11-14

8.  Screening for tuberculosis among high-risk groups attending London emergency departments: a prospective observational study.

Authors:  Rishi K Gupta; Swaib A Lule; Maria Krutikov; Lara Gosce; Nathan Green; Jo Southern; Ambreen Imran; Robert W Aldridge; Heinke Kunst; Marc Lipman; William Lynn; Helen Burgess; Asif Rahman; Dee Menezes; Ananna Rahman; Simon Tiberi; Peter J White; Ibrahim Abubakar
Journal:  Eur Respir J       Date:  2021-06-24       Impact factor: 16.671

9.  Tuberculosis and HIV coinfection in Europe: looking at one reality from two angles.

Authors:  Marieke J van der Werf; Csaba Ködmön; Phillip Zucs; Vahur Hollo; Andrew J Amato-Gauci; Anastasia Pharris
Journal:  AIDS       Date:  2016-11-28       Impact factor: 4.177

10.  Tuberculosis in migrants moving from high-incidence to low-incidence countries: a population-based cohort study of 519 955 migrants screened before entry to England, Wales, and Northern Ireland.

Authors:  Robert W Aldridge; Dominik Zenner; Peter J White; Elizabeth J Williamson; Morris C Muzyamba; Poonam Dhavan; Davide Mosca; H Lucy Thomas; Maeve K Lalor; Ibrahim Abubakar; Andrew C Hayward
Journal:  Lancet       Date:  2016-10-11       Impact factor: 79.321

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.