Literature DB >> 27925597

Tackling Missing Data in Community Health Studies Using Additive LS-SVM Classifier.

Guanjin Wang, Zhaohong Deng, Kup-Sze Choi.   

Abstract

Missing data is a common issue in community health and epidemiological studies. Direct removal of samples with missing data can lead to reduced sample size and information bias, which deteriorates the significance of the results. While data imputation methods are available to deal with missing data, they are limited in performance and could introduce noises into the dataset. Instead of data imputation, a novel method based on additive least square support vector machine (LS-SVM) is proposed in this paper for predictive modeling when the input features of the model contain missing data. The method also determines simultaneously the influence of the features with missing values on the classification accuracy using the fast leave-one-out cross-validation strategy. The performance of the method is evaluated by applying it to predict the quality of life (QOL) of elderly people using health data collected in the community. The dataset involves demographics, socioeconomic status, health history, and the outcomes of health assessments of 444 community-dwelling elderly people, with 5% to 60% of data missing in some of the input features. The QOL is measured using a standard questionnaire of the World Health Organization. Results show that the proposed method outperforms four conventional methods for handling missing data-case deletion, feature deletion, mean imputation, and K-nearest neighbor imputation, with the average QOL prediction accuracy reaching 0.7418. It is potentially a promising technique for tackling missing data in community health research and other applications.

Entities:  

Mesh:

Year:  2016        PMID: 27925597     DOI: 10.1109/JBHI.2016.2634587

Source DB:  PubMed          Journal:  IEEE J Biomed Health Inform        ISSN: 2168-2194            Impact factor:   5.772


  4 in total

1.  Development of a Healthcare Information System for Community Care of Older Adults and Evaluation of Its Acceptance and Usability.

Authors:  Kup-Sze Choi; Sze-Ho Chan; Cho-Lik Ho; Marek Matejak
Journal:  Digit Health       Date:  2022-06-20

2.  Linking clinotypes to phenotypes and genotypes from laboratory test results in comprehensive physical exams.

Authors:  Sisi Zeng; Ni Cao; Thanh Nguyen; Tongbin Zhang; Geoffrey Fox; Chuandi Pan; Jake Y Chen
Journal:  BMC Med Inform Decis Mak       Date:  2021-02-24       Impact factor: 3.298

3.  Embedded Data Imputation for Environmental Intelligent Sensing: A Case Study.

Authors:  Laura Erhan; Mario Di Mauro; Ashiq Anjum; Ovidiu Bagdasar; Wei Song; Antonio Liotta
Journal:  Sensors (Basel)       Date:  2021-11-23       Impact factor: 3.576

4.  A computer-aid multi-task light-weight network for macroscopic feces diagnosis.

Authors:  Ziyuan Yang; Lu Leng; Ming Li; Jun Chu
Journal:  Multimed Tools Appl       Date:  2022-02-28       Impact factor: 2.577

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.