Literature DB >> 24935712

Evaluating latent class models with conditional dependence in record linkage.

Joanne Daggy1, Huiping Xu, Siu Hui, Shaun Grannis.   

Abstract

Record linkage methods commonly use a traditional latent class model to classify record pairs from different sources as true matches or non-matches. This approach was first formally described by Fellegi and Sunter and assumes that the agreement in fields is independent conditional on the latent class. Consequences of violating the conditional independence assumption include bias in parameter estimates from the model. We sought to further characterize the impact of conditional dependence on the overall misclassification rate, sensitivity, and positive predictive value in the record linkage problem when the conditional independence assumption is violated. Additionally, we evaluate various methods to account for the conditional dependence. These methods include loglinear models with appropriate interaction terms identified through the correlation residual plot as well as Gaussian random effects models. The proposed models are used to link newborn screening data obtained from a health information exchange. On the basis of simulations, loglinear models with interaction terms demonstrated the best misclassification rate, although this type of model cannot accommodate other data features such as continuous measures for agreement. Results indicate that Gaussian random effects models, which can handle additional data features, perform better than assuming conditional independence and in some situations perform as well as the loglinear model with interaction terms.
Copyright © 2014 John Wiley & Sons, Ltd.

Keywords:  latent class; loglinear model; random effects; record linkage

Mesh:

Year:  2014        PMID: 24935712     DOI: 10.1002/sim.6230

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  3 in total

1.  A simple two-step procedure using the Fellegi-Sunter model for frequency-based record linkage.

Authors:  Huiping Xu; Xiaochun Li; Shaun Grannis
Journal:  J Appl Stat       Date:  2021-05-04       Impact factor: 1.416

2.  Evaluation of real-world referential and probabilistic patient matching to advance patient identification strategy.

Authors:  Shaun J Grannis; Jennifer L Williams; Suranga Kasthuri; Molly Murray; Huiping Xu
Journal:  J Am Med Inform Assoc       Date:  2022-07-12       Impact factor: 7.942

3.  Embracing the Sparse, Noisy, and Interrelated Aspects of Patient Demographics for use in Clinical Medical Record Linkage.

Authors:  Stephen M Ash; King Ip-Lin
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2015-03-25
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.