Literature DB >> 32674629

Improved Overlap-based Undersampling for Imbalanced Dataset Classification with Application to Epilepsy and Parkinson's Disease.

Pattaramon Vuttipittayamongkol1, Eyad Elyan1.   

Abstract

Classification of imbalanced datasets has attracted substantial research interest over the past decades. Imbalanced datasets are common in several domains such as health, finance, security and others. A wide range of solutions to handle imbalanced datasets focus mainly on the class distribution problem and aim at providing more balanced datasets by means of resampling. However, existing literature shows that class overlap has a higher negative impact on the learning process than class distribution. In this paper, we propose overlap-based undersampling methods for maximizing the visibility of the minority class instances in the overlapping region. This is achieved by the use of soft clustering and the elimination threshold that is adaptable to the overlap degree to identify and eliminate negative instances in the overlapping region. For more accurate clustering and detection of overlapped negative instances, the presence of the minority class at the borderline areas is emphasized by means of oversampling. Extensive experiments using simulated and real-world datasets covering a wide range of imbalance and overlap scenarios including extreme cases were carried out. Results show significant improvement in sensitivity and competitive performance with well-established and state-of-the-art methods.

Entities:  

Keywords:  Class overlap; Parkinson’s disease; adaptive threshold; classification; epilepsy; fuzzy C-means; imbalanced data; undersampling

Mesh:

Year:  2020        PMID: 32674629     DOI: 10.1142/S0129065720500434

Source DB:  PubMed          Journal:  Int J Neural Syst        ISSN: 0129-0657            Impact factor:   5.866


  1 in total

1.  A novel early diagnostic framework for chronic diseases with class imbalance.

Authors:  Xiaohan Yuan; Shuyu Chen; Chuan Sun; Lu Yuwen
Journal:  Sci Rep       Date:  2022-05-21       Impact factor: 4.996

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.