Literature DB >> 20876935

Ensemble learning with active example selection for imbalanced biomedical data classification.

Sangyoon Oh1, Min Su Lee, Byoung-Tak Zhang.   

Abstract

In biomedical data, the imbalanced data problem occurs frequently and causes poor prediction performance for minority classes. It is because the trained classifiers are mostly derived from the majority class. In this paper, we describe an ensemble learning method combined with active example selection to resolve the imbalanced data problem. Our method consists of three key components: 1) an active example selection algorithm to choose informative examples for training the classifier, 2) an ensemble learning method to combine variations of classifiers derived by active example selection, and 3) an incremental learning scheme to speed up the iterative training procedure for active example selection. We evaluate the method on six real-world imbalanced data sets in biomedical domains, showing that the proposed method outperforms both the random under sampling and the ensemble with under sampling methods. Compared to other approaches to solving the imbalanced data problem, our method excels by 0.03-0.15 points in AUC measure.

Mesh:

Year:  2011        PMID: 20876935     DOI: 10.1109/TCBB.2010.96

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  9 in total

1.  Learning from data: recognizing glaucomatous defect patterns and detecting progression from visual field measurements.

Authors:  Siamak Yousefi; Michael H Goldbaum; Madhusudhanan Balasubramanian; Felipe A Medeiros; Linda M Zangwill; Jeffrey M Liebmann; Christopher A Girkin; Robert N Weinreb; Christopher Bowd
Journal:  IEEE Trans Biomed Eng       Date:  2014-04-01       Impact factor: 4.538

Review 2.  Overcome support vector machine diagnosis overfitting.

Authors:  Henry Han; Xiaoqian Jiang
Journal:  Cancer Inform       Date:  2014-12-09

3.  Imbalanced target prediction with pattern discovery on clinical data repositories.

Authors:  Tak-Ming Chan; Yuxi Li; Choo-Chiap Chiau; Jane Zhu; Jie Jiang; Yong Huo
Journal:  BMC Med Inform Decis Mak       Date:  2017-04-20       Impact factor: 2.796

4.  Imbalanced biomedical data classification using self-adaptive multilayer ELM combined with dynamic GAN.

Authors:  Liyuan Zhang; Huamin Yang; Zhengang Jiang
Journal:  Biomed Eng Online       Date:  2018-12-04       Impact factor: 2.819

5.  Risk factor analysis of device-related infections: value of re-sampling method on the real-world imbalanced dataset.

Authors:  Xiang-Fei Feng; Ling-Chao Yang; Li-Zhuang Tan; Yi-Gang Li
Journal:  BMC Med Inform Decis Mak       Date:  2019-09-11       Impact factor: 2.796

6.  An ensemble learning with active sampling to predict the prognosis of postoperative non-small cell lung cancer patients.

Authors:  Danqing Hu; Huanyao Zhang; Shaolei Li; Huilong Duan; Nan Wu; Xudong Lu
Journal:  BMC Med Inform Decis Mak       Date:  2022-09-19       Impact factor: 3.298

7.  A computational pipeline for the development of multi-marker bio-signature panels and ensemble classifiers.

Authors:  Oliver P Günther; Virginia Chen; Gabriela Cohen Freue; Robert F Balshaw; Scott J Tebbutt; Zsuzsanna Hollander; Mandeep Takhar; W Robert McMaster; Bruce M McManus; Paul A Keown; Raymond T Ng
Journal:  BMC Bioinformatics       Date:  2012-12-08       Impact factor: 3.169

8.  Diagnostic biases in translational bioinformatics.

Authors:  Henry Han
Journal:  BMC Med Genomics       Date:  2015-08-01       Impact factor: 3.063

9.  Application of Convolutional Neural Network in the Diagnosis of Jaw Tumors.

Authors:  Wiwiek Poedjiastoeti; Siriwan Suebnukarn
Journal:  Healthc Inform Res       Date:  2018-07-31
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.