Literature DB >> 28323112

Strategies for handling missing clinical data for automated surgical site infection detection from the electronic health record.

Zhen Hu1, Genevieve B Melton2, Elliot G Arsoniadis2, Yan Wang1, Mary R Kwaan3, Gyorgy J Simon4.   

Abstract

Proper handling of missing data is important for many secondary uses of electronic health record (EHR) data. Data imputation methods can be used to handle missing data, but their use for analyzing EHR data is limited and specific efficacy for postoperative complication detection is unclear. Several data imputation methods were used to develop data models for automated detection of three types (i.e., superficial, deep, and organ space) of surgical site infection (SSI) and overall SSI using American College of Surgeons National Surgical Quality Improvement Project (NSQIP) Registry 30-day SSI occurrence data as a reference standard. Overall, models with missing data imputation almost always outperformed reference models without imputation that included only cases with complete data for detection of SSI overall achieving very good average area under the curve values. Missing data imputation appears to be an effective means for improving postoperative SSI detection using EHR clinical data.
Copyright © 2017 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Electronic health records; Missing data; Surgical site infections

Mesh:

Year:  2017        PMID: 28323112      PMCID: PMC5474942          DOI: 10.1016/j.jbi.2017.03.009

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  28 in total

1.  Multiple imputation by chained equations: what is it and how does it work?

Authors:  Melissa J Azur; Elizabeth A Stuart; Constantine Frangakis; Philip J Leaf
Journal:  Int J Methods Psychiatr Res       Date:  2011-03       Impact factor: 4.035

2.  Improving risk-adjusted measures of surgical site infection for the national healthcare safety network.

Authors:  Yi Mu; Jonathan R Edwards; Teresa C Horan; Sandra I Berrios-Torres; Scott K Fridkin
Journal:  Infect Control Hosp Epidemiol       Date:  2011-09-01       Impact factor: 3.254

3.  Combining Fourier and lagged k-nearest neighbor imputation for biomedical time series data.

Authors:  Shah Atiqur Rahman; Yuxiao Huang; Jan Claassen; Nathaniel Heintzman; Samantha Kleinberg
Journal:  J Biomed Inform       Date:  2015-10-21       Impact factor: 6.317

4.  Compass: a hybrid method for clinical and biobank data mining.

Authors:  K Krysiak-Baltyn; T Nordahl Petersen; K Audouze; Niels Jørgensen; L Angquist; S Brunak
Journal:  J Biomed Inform       Date:  2013-10-26       Impact factor: 6.317

5.  Surgical site infections and cost in obese patients undergoing colorectal surgery.

Authors:  Elizabeth C Wick; Kenzo Hirose; Andrew D Shore; Jeanne M Clark; Susan L Gearhart; Jonathan Efron; Martin A Makary
Journal:  Arch Surg       Date:  2011-05-16

6.  The impact of surgical site infection on the development of incisional hernia and small bowel obstruction in colorectal surgery.

Authors:  Bryce W Murray; Daisha J Cipher; Thai Pham; Thomas Anthony
Journal:  Am J Surg       Date:  2011-09-14       Impact factor: 2.565

Review 7.  Missing data analysis using multiple imputation: getting to the heart of the matter.

Authors:  Yulei He
Journal:  Circ Cardiovasc Qual Outcomes       Date:  2010-01

8.  Missing data imputation using statistical and machine learning methods in a real breast cancer problem.

Authors:  José M Jerez; Ignacio Molina; Pedro J García-Laencina; Emilio Alba; Nuria Ribelles; Miguel Martín; Leonardo Franco
Journal:  Artif Intell Med       Date:  2010-07-16       Impact factor: 5.326

9.  Electronic health record-based detection of risk factors for Clostridium difficile infection relapse.

Authors:  Courtney Hebert; Hongyan Du; Lance R Peterson; Ari Robicsek
Journal:  Infect Control Hosp Epidemiol       Date:  2013-02-13       Impact factor: 3.254

10.  The prevention and handling of the missing data.

Authors:  Hyun Kang
Journal:  Korean J Anesthesiol       Date:  2013-05-24
View more
  15 in total

1.  Causal Phenotyping for Susceptibility to Cardiotoxicity from Antineoplastic Breast Cancer Medications.

Authors:  Deyu Sun; Gyorgy J Simon; Steven Skube; Anne H Blaes; Genevieve B Melton; Rui Zhang
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

2.  Secondary analysis of electronic health records in critical care medicine.

Authors:  Sven Van Poucke; Alberto Alexander Gayle; Milan Vukicevic
Journal:  Ann Transl Med       Date:  2018-02

3.  Development and Validation of Machine Learning Models for Prediction of 1-Year Mortality Utilizing Electronic Medical Record Data Available at the End of Hospitalization in Multicondition Patients: a Proof-of-Concept Study.

Authors:  Nishant Sahni; Gyorgy Simon; Rashi Arora
Journal:  J Gen Intern Med       Date:  2018-01-30       Impact factor: 5.128

4.  Predicting Missing Values in Medical Data via XGBoost Regression.

Authors:  Xinmeng Zhang; Chao Yan; Cheng Gao; Bradley A Malin; You Chen
Journal:  J Healthc Inform Res       Date:  2020-08-03

5.  A Nuisance-Free Inference Procedure Accounting for the Unknown Missingness with Application to Electronic Health Records.

Authors:  Jiwei Zhao; Chi Chen
Journal:  Entropy (Basel)       Date:  2020-10-14       Impact factor: 2.524

6.  ImputEHR: A Visualization Tool of Imputation for the Prediction of Biomedical Data.

Authors:  Yi-Hui Zhou; Ehsan Saghapour
Journal:  Front Genet       Date:  2021-07-02       Impact factor: 4.599

7.  DPARD: rationale, design and initial results from the Dutch national diabetes registry.

Authors:  Jessica C G Bak; Dick Mul; Erik H Serné; Harold W de Valk; Theo C J Sas; Petronella H Geelhoed-Duijvestijn; Mark H H Kramer; Max Nieuwdorp; Carianne L Verheugt
Journal:  BMC Endocr Disord       Date:  2021-06-16       Impact factor: 2.763

8.  Similarity-based health risk prediction using Domain Fusion and electronic health records data.

Authors:  Jia Guo; Chi Yuan; Ning Shang; Tian Zheng; Natalie A Bello; Krzysztof Kiryluk; Chunhua Weng; Shuang Wang
Journal:  J Biomed Inform       Date:  2021-02-19       Impact factor: 8.000

9.  Maximizing the reusability of gene expression data by predicting missing metadata.

Authors:  Pei-Yau Lung; Dongrui Zhong; Xiaodong Pang; Yan Li; Jinfeng Zhang
Journal:  PLoS Comput Biol       Date:  2020-11-06       Impact factor: 4.475

10.  Combining structured and unstructured data for predictive models: a deep learning approach.

Authors:  Dongdong Zhang; Changchang Yin; Jucheng Zeng; Xiaohui Yuan; Ping Zhang
Journal:  BMC Med Inform Decis Mak       Date:  2020-10-29       Impact factor: 2.796

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.