Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Cleansing and Imputation of Body Mass Index Data and Its Impact on a Machine Learning Based Prediction Model.

Literature DB >> 29726427

Cleansing and Imputation of Body Mass Index Data and Its Impact on a Machine Learning Based Prediction Model.

Stefanie Jauk¹, Diether Kramer², Werner Leodolter².

Abstract

BACKGROUND: A challenge of using electronic health records for secondary analyses is data quality. Body mass index (BMI) is an important predictor for various diseases but often not documented properly.
OBJECTIVES: The aim of our study is to perform data cleansing on BMI values and to find the best method for an imputation of missing values in order to increase data quality. Further, we want to assess the effect of changes in data quality on the performance of a prediction model based on machine learning.
METHODS: After data cleansing on BMI data, we compared machine learning methods and statistical methods in their accuracy of imputed values using the root mean square error. In a second step, we used three variations of BMI data as a training set for a model predicting the occurrence of delirium.
RESULTS: Neural network and linear regression models performed best for imputation. There were no changes in model performance for different BMI input data.
CONCLUSION: Although data quality issues may lead to biases, it does not always affect performance of secondary analyses.

Entities: Disease

Keywords: Electronic health records; body mass index; data cleansing; data imputation; machine learning; predictive modelling

Mesh：

Year: 2018 PMID： 29726427

Source DB: PubMed Journal: Stud Health Technol Inform ISSN： 0926-9630

Keyword Cloud
Cited

2 in total

1. Automatic Classification of Sarcopenia Level in Older Adults: A Case Study at Tijuana General Hospital.

Authors: Cristián Castillo-Olea; Begonya García-Zapirain Soto; Christian Carballo Lozano; Clemente Zuñiga
Journal: Int J Environ Res Public Health Date: 2019-09-06 Impact factor: 3.390

2. Body Mass Index Variable Interpolation to Expand the Utility of Real-world Administrative Healthcare Claims Database Analyses.

Authors: Bingcao Wu; Wing Chow; Monish Sakthivel; Onkar Kakade; Kartikeya Gupta; Debra Israel; Yen-Wen Chen; Aarti Susan Kuruvilla
Journal: Adv Ther Date: 2021-01-11 Impact factor: 3.845

2 in total