Literature DB >> 32498997

Efficient treatment of outliers and class imbalance for diabetes prediction.

Nonso Nnamoko1, Ioannis Korkontzelos2.   

Abstract

Learning from outliers and imbalanced data remains one of the major difficulties for machine learning classifiers. Among the numerous techniques dedicated to tackle this problem, data preprocessing solutions are known to be efficient and easy to implement. In this paper, we propose a selective data preprocessing approach that embeds knowledge of the outlier instances into artificially generated subset to achieve an even distribution. The Synthetic Minority Oversampling TEchnique (SMOTE) was used to balance the training data by introducing artificial minority instances. However, this was not before the outliers were identified and oversampled (irrespective of class). The aim is to balance the training dataset while controlling the effect of outliers. The experiments prove that such selective oversampling empowers SMOTE, ultimately leading to improved classification performance.
Copyright © 2020 The Authors. Published by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Data preprocessing; Imbalanced data; Machine learning; Outlier detection; Oversampling; SMOTE

Year:  2020        PMID: 32498997     DOI: 10.1016/j.artmed.2020.101815

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  3 in total

1.  Predicting CoVID-19 community mortality risk using machine learning and development of an online prognostic tool.

Authors:  Ashis Kumar Das; Shiba Mishra; Saji Saraswathy Gopalan
Journal:  PeerJ       Date:  2020-09-28       Impact factor: 2.984

2.  Predictive Analysis of Diabetes-Risk with Class Imbalance.

Authors:  Ahmed I ElSeddawy; Faten Khalid Karim; Aisha Mohamed Hussein; Doaa Sami Khafaga
Journal:  Comput Intell Neurosci       Date:  2022-10-11

Review 3.  Instrumented Analysis of the Sit-to-Stand Movement for Geriatric Screening: A Systematic Review.

Authors:  Brajesh Shukla; Jennifer Bassement; Vivek Vijay; Sandeep Yadav; David Hewson
Journal:  Bioengineering (Basel)       Date:  2020-11-06
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.