Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Efficient treatment of outliers and class imbalance for diabetes prediction.

Literature DB >> 32498997

Efficient treatment of outliers and class imbalance for diabetes prediction.

Abstract

Learning from outliers and imbalanced data remains one of the major difficulties for machine learning classifiers. Among the numerous techniques dedicated to tackle this problem, data preprocessing solutions are known to be efficient and easy to implement. In this paper, we propose a selective data preprocessing approach that embeds knowledge of the outlier instances into artificially generated subset to achieve an even distribution. The Synthetic Minority Oversampling TEchnique (SMOTE) was used to balance the training data by introducing artificial minority instances. However, this was not before the outliers were identified and oversampled (irrespective of class). The aim is to balance the training dataset while controlling the effect of outliers. The experiments prove that such selective oversampling empowers SMOTE, ultimately leading to improved classification performance.

Entities: Disease

Keywords: Data preprocessing; Imbalanced data; Machine learning; Outlier detection; Oversampling; SMOTE

Year: 2020 PMID： 32498997 DOI： 10.1016/j.artmed.2020.101815

Source DB: PubMed Journal: Artif Intell Med ISSN： 0933-3657 Impact factor: 5.326

Keyword Cloud
Cited

3 in total

Efficient treatment of outliers and class imbalance for diabetes prediction.

1. Predicting CoVID-19 community mortality risk using machine learning and development of an online prognostic tool.

2. Predictive Analysis of Diabetes-Risk with Class Imbalance.

Review 3. Instrumented Analysis of the Sit-to-Stand Movement for Geriatric Screening: A Systematic Review.