Literature DB >> 22807060

Methods for analyzing data from probabilistic linkage strategies based on partially identifying variables.

M H P Hof1, A H Zwinderman.   

Abstract

In record linkage studies, unique identifiers are often not available, and therefore, the linkage procedure depends on combinations of partially identifying variables with low discriminating power. As a consequence, wrongly linked covariate and outcome pairs will be created and bias further analysis of the linked data. In this article, we investigated two estimators that correct for linkage error in regression analysis. We extended the estimators developed by Lahiri and Larsen and also suggested a weighted least squares approach to deal with linkage error. We considered both linear and logistic regression problems and evaluated the performance of both methods with simulations. Our results show that all wrong covariate and outcome pairs need to be removed from the analysis in order to calculate unbiased regression coefficients in both approaches. This removal requires strong assumptions on the structure of the data. In addition, the bias significantly increases when the assumptions do not hold and wrongly linked records influence the coefficient estimation. Our simulations showed that both methods had similar performance in linear regression problems. With logistic regression problems, the weighted least squares method showed less bias. Because the specific structure of the data in record linkage problems often leads to different assumptions, it is necessary that the analyst has prior knowledge on the nature of the data. These assumptions are more easily introduced in the weighted least squares approach than in the Lahiri and Larsen estimator.
Copyright © 2012 John Wiley & Sons, Ltd.

Mesh:

Year:  2012        PMID: 22807060     DOI: 10.1002/sim.5498

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  6 in total

1.  Error adjustments for file linking methods using encrypted unique client identifier (eUCI) with application to recently released prisoners who are HIV+.

Authors:  R Gutman; C J Sammartino; T C Green; B T Montague
Journal:  Stat Med       Date:  2015-07-21       Impact factor: 2.373

2.  A MULTIPLE IMPUTATION PROCEDURE FOR RECORD LINKAGE AND CAUSAL INFERENCE TO ESTIMATE THE EFFECTS OF HOME-DELIVERED MEALS.

Authors:  Mingyang Shan; Kali S Thomas; Roee Gutman
Journal:  Ann Appl Stat       Date:  2021-03-18       Impact factor: 1.959

3.  Effect of parental and ART treatment characteristics on perinatal outcomes.

Authors:  M Pontesilli; M H Hof; A C J Ravelli; A J van Altena; A T Soufan; B W Mol; E H Kostelijk; E Slappendel; D Consten; A E P Cantineau; L A J van der Westerlaken; W van Inzen; J C M Dumoulin; L Ramos; E B Baart; F J M Broekmans; P M Rijnders; M H J M Curfs; S Mastenbroek; S Repping; T J Roseboom; R C Painter
Journal:  Hum Reprod       Date:  2021-05-17       Impact factor: 6.918

4.  ATLAS: an automated association test using probabilistically linked health records with application to genetic studies.

Authors:  Harrison G Zhang; Boris P Hejblum; Griffin M Weber; Nathan P Palmer; Susanne E Churchill; Peter Szolovits; Shawn N Murphy; Katherine P Liao; Isaac S Kohane; Tianxi Cai
Journal:  J Am Med Inform Assoc       Date:  2021-11-25       Impact factor: 7.942

5.  Evaluating bias due to data linkage error in electronic healthcare records.

Authors:  Katie Harron; Angie Wade; Ruth Gilbert; Berit Muller-Pebody; Harvey Goldstein
Journal:  BMC Med Res Methodol       Date:  2014-03-05       Impact factor: 4.615

6.  The high risk for type 2 diabetes among ethnic minority populations is not explained by low-grade inflammation.

Authors:  Mirthe Muilwijk; Max Nieuwdorp; Marieke B Snijder; Michel H P Hof; Karien Stronks; Irene G M van Valkengoed
Journal:  Sci Rep       Date:  2019-12-27       Impact factor: 4.379

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.