Literature DB >> 21954215

Developing new fitness functions in genetic programming for classification with unbalanced data.

Urvesh Bhowan1, Mark Johnston, Mengjie Zhang.   

Abstract

Machine learning algorithms such as genetic programming (GP) can evolve biased classifiers when data sets are unbalanced. Data sets are unbalanced when at least one class is represented by only a small number of training examples (called the minority class) while other classes make up the majority. In this scenario, classifiers can have good accuracy on the majority class but very poor accuracy on the minority class(es) due to the influence that the larger majority class has on traditional training criteria in the fitness function. This paper aims to both highlight the limitations of the current GP approaches in this area and develop several new fitness functions for binary classification with unbalanced data. Using a range of real-world classification problems with class imbalance, we empirically show that these new fitness functions evolve classifiers with good performance on both the minority and majority classes. Our approaches use the original unbalanced training data in the GP learning process, without the need to artificially balance the training examples from the two classes (e.g., via sampling).

Year:  2011        PMID: 21954215     DOI: 10.1109/TSMCB.2011.2167144

Source DB:  PubMed          Journal:  IEEE Trans Syst Man Cybern B Cybern        ISSN: 1083-4419


  2 in total

1.  A Novel Ensemble Method for Imbalanced Data Learning: Bagging of Extrapolation-SMOTE SVM.

Authors:  Qi Wang; ZhiHao Luo; JinCai Huang; YangHe Feng; Zhong Liu
Journal:  Comput Intell Neurosci       Date:  2017-01-30

2.  On the use of multi-objective evolutionary classifiers for breast cancer detection.

Authors:  Laura Dioşan; Anca Andreica; Irina Voiculescu
Journal:  PLoS One       Date:  2022-07-19       Impact factor: 3.752

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.