Literature DB >> 25474818

Diversified Sensitivity-Based Undersampling for Imbalance Classification Problems.

Wing W Y Ng, Junjie Hu, Daniel S Yeung, Shaohua Yin, Fabio Roli.   

Abstract

Undersampling is a widely adopted method to deal with imbalance pattern classification problems. Current methods mainly depend on either random resampling on the majority class or resampling at the decision boundary. Random-based undersampling fails to take into consideration informative samples in the data while resampling at the decision boundary is sensitive to class overlapping. Both techniques ignore the distribution information of the training dataset. In this paper, we propose a diversified sensitivity-based undersampling method. Samples of the majority class are clustered to capture the distribution information and enhance the diversity of the resampling. A stochastic sensitivity measure is applied to select samples from both clusters of the majority class and the minority class. By iteratively clustering and sampling, a balanced set of samples yielding high classifier sensitivity is selected. The proposed method yields a good generalization capability for 14 UCI datasets.

Year:  2014        PMID: 25474818     DOI: 10.1109/TCYB.2014.2372060

Source DB:  PubMed          Journal:  IEEE Trans Cybern        ISSN: 2168-2267            Impact factor:   11.448


  5 in total

1.  Discriminative Ridge Machine: A Classifier for High-Dimensional Data or Imbalanced Data.

Authors:  Chong Peng; Qiang Cheng
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2021-06-02       Impact factor: 14.255

2.  A Metaheuristic Optimization Approach for Parameter Estimation in Arrhythmia Classification from Unbalanced Data.

Authors:  Juan Carlos Carrillo-Alarcón; Luis Alberto Morales-Rosales; Héctor Rodríguez-Rángel; Mariana Lobato-Báez; Antonio Muñoz; Ignacio Algredo-Badillo
Journal:  Sensors (Basel)       Date:  2020-06-02       Impact factor: 3.576

3.  Speech Emotion Recognition Based on Selective Interpolation Synthetic Minority Over-Sampling Technique in Small Sample Environment.

Authors:  Zhen-Tao Liu; Bao-Han Wu; Dan-Yun Li; Peng Xiao; Jun-Wei Mao
Journal:  Sensors (Basel)       Date:  2020-04-17       Impact factor: 3.576

4.  Predicting reference soil groups using legacy data: A data pruning and Random Forest approach for tropical environment (Dano catchment, Burkina Faso).

Authors:  Kpade O L Hounkpatin; Karsten Schmidt; Felix Stumpf; Gerald Forkuor; Thorsten Behrens; Thomas Scholten; Wulf Amelung; Gerhard Welp
Journal:  Sci Rep       Date:  2018-07-02       Impact factor: 4.379

5.  Radial Basis Function Neural Network with Localized Stochastic-Sensitive Autoencoder for Home-Based Activity Recognition.

Authors:  Wing W Y Ng; Shichao Xu; Ting Wang; Shuai Zhang; Chris Nugent
Journal:  Sensors (Basel)       Date:  2020-03-08       Impact factor: 3.576

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.