Literature DB >> 19095540

Exploratory undersampling for class-imbalance learning.

Xu-Ying Liu1, Jianxin Wu, Zhi-Hua Zhou.   

Abstract

Undersampling is a popular method in dealing with class-imbalance problems, which uses only a subset of the majority class and thus is very efficient. The main deficiency is that many majority class examples are ignored. We propose two algorithms to overcome this deficiency. EasyEnsemble samples several subsets from the majority class, trains a learner using each of them, and combines the outputs of those learners. BalanceCascade trains the learners sequentially, where in each step, the majority class examples that are correctly classified by the current trained learners are removed from further consideration. Experimental results show that both methods have higher Area Under the ROC Curve, F-measure, and G-mean values than many existing class-imbalance learning methods. Moreover, they have approximately the same training time as that of undersampling when the same number of weak classifiers is used, which is significantly faster than other methods.

Year:  2008        PMID: 19095540     DOI: 10.1109/TSMCB.2008.2007853

Source DB:  PubMed          Journal:  IEEE Trans Syst Man Cybern B Cybern        ISSN: 1083-4419


  101 in total

1.  Improving predictions in imbalanced data using Pairwise Expanded Logistic Regression.

Authors:  Xiaoqian Jiang; Robert El-Kareh; Lucila Ohno-Machado
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

2.  Analysis of sampling techniques for imbalanced data: An n = 648 ADNI study.

Authors:  Rashmi Dubey; Jiayu Zhou; Yalin Wang; Paul M Thompson; Jieping Ye
Journal:  Neuroimage       Date:  2013-10-29       Impact factor: 6.556

3.  Confirm or refute?: A comparative study on citation sentiment classification in clinical research publications.

Authors:  Halil Kilicoglu; Zeshan Peng; Shabnam Tafreshi; Tung Tran; Graciela Rosemblat; Jodi Schneider
Journal:  J Biomed Inform       Date:  2019-02-10       Impact factor: 6.317

4.  A unified methodology based on sparse field level sets and boosting algorithms for false positives reduction in lung nodules detection.

Authors:  Soudeh Saien; Hamid Abrishami Moghaddam; Mohsen Fathian
Journal:  Int J Comput Assist Radiol Surg       Date:  2017-08-09       Impact factor: 2.924

5.  Imbalanced class learning in epigenetics.

Authors:  M Muksitul Haque; Michael K Skinner; Lawrence B Holder
Journal:  J Comput Biol       Date:  2014-05-05       Impact factor: 1.479

6.  An ensemble learning method for asthma control level detection with leveraging medical knowledge-based classifier and supervised learning.

Authors:  Roghaye Khasha; Mohammad Mehdi Sepehri; Seyed Alireza Mahdaviani
Journal:  J Med Syst       Date:  2019-04-26       Impact factor: 4.460

7.  Concordance between Composite International Diagnostic Interview and self-reports of depressive symptoms: a re-analysis.

Authors:  Tom Rosenström; Marko Elovainio; Markus Jokela; Sami Pirkola; Seppo Koskinen; Olavi Lindfors; Liisa Keltikangas-Järvinen
Journal:  Int J Methods Psychiatr Res       Date:  2015-07-03       Impact factor: 4.035

8.  Optimal breast cancer diagnostic strategy using combined ultrasound and diffuse optical tomography.

Authors:  K M Shihab Uddin; Menghao Zhang; Mark Anastasio; Quing Zhu
Journal:  Biomed Opt Express       Date:  2020-04-24       Impact factor: 3.732

9.  Development and validation of an electronic medical record-based alert score for detection of inpatient deterioration outside the ICU.

Authors:  Patricia Kipnis; Benjamin J Turk; David A Wulf; Juan Carlos LaGuardia; Vincent Liu; Matthew M Churpek; Santiago Romero-Brufau; Gabriel J Escobar
Journal:  J Biomed Inform       Date:  2016-09-20       Impact factor: 6.317

10.  An Imbalanced Learning based MDR-TB Early Warning System.

Authors:  Sheng Li; Bo Tang; Haibo He
Journal:  J Med Syst       Date:  2016-05-21       Impact factor: 4.460

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.