Literature DB >> 31158554

Statistical supervised meta-ensemble algorithm for medical record linkage.

Kha Vo1, Jitendra Jonnagaddala2, Siaw-Teng Liaw3.   

Abstract

Identifying unique patients across multiple care facilities or services is a major challenge in providing continuous care and undertaking health research. Identifying and linking patients without compromising privacy and security is an emerging issue in the big data era. The large quantity and complexity of the patient data emphasize the need for effective linkage methods that are both scalable and accurate. In this study, we aim to develop and evaluate an ensemble classification method using the three most typically used supervised learning methods, namely support vector machines, logistic regression and standard feed-forward neural networks, to link records that belong to the same patient across multiple service locations. Our ensemble method is the combination of bagging and stacking. Each base learner's critical hyperparameters were selected through grid search technique. Two synthetic datasets were used in this study namely FEBRL and ePBRN. ePBRN linkage dataset was based on linkage errors noticed in the Australian primary care setting. The overall linkage performance was determined by assessing the blocking performance and classification performance. Our ensemble method outperformed the base learners in all evaluation metrics on one dataset. More specifically, the precision, which is average of individual precision scores in case of base learners increased from 90.70% to 94.85% in FEBRL, and from 62.17% to 99.28% in ePBRN. Similarly, the F-score increased from 94.92% to 98.18% in FEBRL, and from 72.99% to 91.72% in ePBRN. Our experiments suggest that we can significantly improve the linkage performance of individual algorithms by employing ensemble strategies.
Copyright © 2019 Elsevier Inc. All rights reserved.

Entities:  

Year:  2019        PMID: 31158554     DOI: 10.1016/j.jbi.2019.103220

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  5 in total

1.  Virtual Learning Environment of the Brazilian Health System (AVASUS): Efficiency of Results, Impacts, and Contributions.

Authors:  Ricardo A M Valentim; Carlos A P de Oliveira; Eloiza S G Oliveira; Eduardo L Ribeiro; Soneide M da Costa; Ione R D Morais; Felipe R Dos S Fernandes; Alexandre R Caitano; Cristine M G Gusmão; Aliete Cunha-Oliveira; Maria C F D Rêgo; Karilany D Coutinho; Daniele M S Barros; Ricardo B Ceccim
Journal:  Front Med (Lausanne)       Date:  2022-06-02

2.  Improving the Accuracy of Ensemble Machine Learning Classification Models Using a Novel Bit-Fusion Algorithm for Healthcare AI Systems.

Authors:  Sashikala Mishra; Kailash Shaw; Debahuti Mishra; Shruti Patil; Ketan Kotecha; Satish Kumar; Simi Bajaj
Journal:  Front Public Health       Date:  2022-05-04

3.  The OpenDeID corpus for patient de-identification.

Authors:  Jitendra Jonnagaddala; Aipeng Chen; Sean Batongbacal; Chandini Nekkantti
Journal:  Sci Rep       Date:  2021-10-07       Impact factor: 4.379

4.  Moving with the Times: The Health Science Alliance (HSA) Biobank, Pathway to Sustainability.

Authors:  Carmel M Quinn; Mamta Porwal; Nicola S Meagher; Anusha Hettiaratchi; Carl Power; Jitendra Jonnaggadala; Sue McCullough; Stephanie Macmillan; Katrina Tang; Winston Liauw; David Goldstein; Nikolajs Zeps; Philip J Crowe
Journal:  Biomark Insights       Date:  2021-03-27

5.  Statin Prescription Patterns and Associations with Subclinical Inflammation.

Authors:  Preetham Kadappu; Jitendra Jonnagaddala; Siaw-Teng Liaw; Blake J Cochran; Kerry-Anne Rye; Kwok Leung Ong
Journal:  Medicina (Kaunas)       Date:  2022-08-14       Impact factor: 2.948

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.