Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Missing data imputation on biomedical data using deeply learned clustering and L2 regularized regression based on symmetric uncertainty.

Literature DB >> 34998512

Missing data imputation on biomedical data using deeply learned clustering and L2 regularized regression based on symmetric uncertainty.

Abstract

Big data era in healthcare led to the generation of high dimensional datasets like genomic datasets, electronic health records etc. One among the critical issues to be addressed in such datasets is handling incomplete data that may yield misleading results if not handled properly. Imputation is considered to be an effective way when the missing data rate is high. While imputation accuracy and classification accuracy are the two important metrics generally considered by most of the imputation techniques, high dimensional datasets such as genomic datasets motivated the need for imputation techniques that are also computationally efficient and preserves the structure of the dataset. This paper proposes a novel approach to missing data imputation in biomedical datasets using an ensemble of deeply learned clustering and L2 regularized regression based on symmetric uncertainty. The experiments are conducted with different proportion of missing data on both genomic and non-genomic biomedical datasets for different types of missingness pattern. Our proposed approach is compared with seven proven baseline imputation methods and two recently proposed imputation approaches. The results show that the proposed approach outperforms the other approaches considered in our experimentation in terms of imputation accuracy and computational efficiency despite preserving the structure of the dataset. Thus, the overall classification accuracy of the biomedical classification tasks is also improved when our proposed missing data imputation technique is used.

Entities: Chemical

Keywords: Biomedical datasets; Deeply learned clustering; L2 regularization; Missing data imputation

Mesh：

Year: 2021 PMID： 34998512 DOI： 10.1016/j.artmed.2021.102214

Source DB: PubMed Journal: Artif Intell Med ISSN： 0933-3657 Impact factor: 5.326

Keyword Cloud
Cited

2 in total

1. A Sequential Machine Learning-cum-Attention Mechanism for Effective Segmentation of Brain Tumor.

Authors: Tahir Mohammad Ali; Ali Nawaz; Attique Ur Rehman; Rana Zeeshan Ahmad; Abdul Rehman Javed; Thippa Reddy Gadekallu; Chin-Ling Chen; Chih-Ming Wu
Journal: Front Oncol Date: 2022-06-01 Impact factor: 5.738

2. A survey on COVID-19 impact in the healthcare domain: worldwide market implementation, applications, security and privacy issues, challenges and future prospects.

Authors: Tanzeela Shakeel; Shaista Habib; Wadii Boulila; Anis Koubaa; Abdul Rehman Javed; Muhammad Rizwan; Thippa Reddy Gadekallu; Mahmood Sufiyan
Journal: Complex Intell Systems Date: 2022-05-31

2 in total