Literature DB >> 29027512

Challenges associated with missing data in electronic health records: A case study of a risk prediction model for diabetes using data from Slovenian primary care.

Gregor Stiglic, Primoz Kocbek, Nino Fijacko1, Aziz Sheikh2,3, Majda Pajnkihar1.   

Abstract

The increasing availability of data stored in electronic health records brings substantial opportunities for advancing patient care and population health. This is, however, fundamentally dependant on the completeness and quality of data in these electronic health records. We sought to use electronic health record data to populate a risk prediction model for identifying patients with undiagnosed type 2 diabetes mellitus. We, however, found substantial (up to 90%) amounts of missing data in some healthcare centres. Attempts at imputing for these missing data or using reduced dataset by removing incomplete records resulted in a major deterioration in the performance of the prediction model. This case study illustrates the substantial wasted opportunities resulting from incomplete records by simulation of missing and incomplete records in predictive modelling process. Government and professional bodies need to prioritise efforts to address these data shortcomings in order to ensure that electronic health record data are maximally exploited for patient and population benefit.

Entities:  

Keywords:  databases and data mining; electronic health records; missing data; primary care; quality control; type 2 diabetes

Year:  2017        PMID: 29027512     DOI: 10.1177/1460458217733288

Source DB:  PubMed          Journal:  Health Informatics J        ISSN: 1460-4582            Impact factor:   2.681


  7 in total

1.  Recurrent Neural Network based Time-Series Modeling for Long-term Prognosis Following Acute Traumatic Brain Injury.

Authors:  Amin Nayebi; Sindhu Tipirneni; Brandon Foreman; Jonathan Ratcliff; Chandan K Reddy; Vignesh Subbian
Journal:  AMIA Annu Symp Proc       Date:  2022-02-21

2.  Missing data in primary care research: importance, implications and approaches.

Authors:  Miguel Marino; Jennifer Lucas; Emile Latour; John D Heintzman
Journal:  Fam Pract       Date:  2021-03-29       Impact factor: 2.267

3.  The National ReferAll Database: An Open Dataset of Exercise Referral Schemes Across the UK.

Authors:  James Steele; Matthew Wade; Robert J Copeland; Stuart Stokes; Rachel Stokes; Steven Mann
Journal:  Int J Environ Res Public Health       Date:  2021-04-30       Impact factor: 3.390

4.  Unanticipated Respiratory Compromise and Unplanned Intubations on General Medical and Surgical Floors.

Authors:  Armando D Bedoya; Nrupen A Bhavsar; Bhargav Adagarla; Courtney B Page; Benjamin A Goldstein; Neil R MacIntyre
Journal:  Respir Care       Date:  2020-03-10       Impact factor: 2.339

5.  Identifying early-measured variables associated with APACHE IVa providing incorrect in-hospital mortality predictions for critical care patients.

Authors:  Shuo Feng; Joel A Dubin
Journal:  Sci Rep       Date:  2021-11-12       Impact factor: 4.379

6.  Building interpretable models for polypharmacy prediction in older chronic patients based on drug prescription records.

Authors:  Simon Kocbek; Primoz Kocbek; Andraz Stozer; Tina Zupanic; Tudor Groza; Gregor Stiglic
Journal:  PeerJ       Date:  2018-10-12       Impact factor: 2.984

7.  Data Mining in Healthcare: Applying Strategic Intelligence Techniques to Depict 25 Years of Research Development.

Authors:  Maikel Luis Kolling; Leonardo B Furstenau; Michele Kremer Sott; Bruna Rabaioli; Pedro Henrique Ulmi; Nicola Luigi Bragazzi; Leonel Pablo Carvalho Tedesco
Journal:  Int J Environ Res Public Health       Date:  2021-03-17       Impact factor: 3.390

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.