Literature DB >> 19068445

SVMs modeling for highly imbalanced classification.

Yuchun Tang, Yan-Qing Zhang, Nitesh V Chawla, Sven Krasser.   

Abstract

Traditional classification algorithms can be limited in their performance on highly unbalanced data sets. A popular stream of work for countering the problem of class imbalance has been the application of a sundry of sampling strategies. In this paper, we focus on designing modifications to support vector machines (SVMs) to appropriately tackle the problem of class imbalance. We incorporate different "rebalance" heuristics in SVM modeling, including cost-sensitive learning, and over- and undersampling. These SVM-based strategies are compared with various state-of-the-art approaches on a variety of data sets by using various metrics, including G-mean, area under the receiver operating characteristic curve, F-measure, and area under the precision/recall curve. We show that we are able to surpass or match the previously known best algorithms on each data set. In particular, of the four SVM variations considered in this paper, the novel granular SVMs-repetitive undersampling algorithm (GSVM-RU) is the best in terms of both effectiveness and efficiency. GSVM-RU is effective, as it can minimize the negative effect of information loss while maximizing the positive effect of data cleaning in the undersampling process. GSVM-RU is efficient by extracting much less support vectors and, hence, greatly speeding up SVM prediction.

Entities:  

Mesh:

Year:  2008        PMID: 19068445     DOI: 10.1109/TSMCB.2008.2002909

Source DB:  PubMed          Journal:  IEEE Trans Syst Man Cybern B Cybern        ISSN: 1083-4419


  58 in total

1.  Positive-unlabeled ensemble learning for kinase substrate prediction from dynamic phosphoproteomics data.

Authors:  Pengyi Yang; Sean J Humphrey; David E James; Yee Hwa Yang; Raja Jothi
Journal:  Bioinformatics       Date:  2015-09-22       Impact factor: 6.937

2.  Integrating new data balancing technique with committee networks for imbalanced data: GRSOM approach.

Authors:  Danaipong Chetchotsak; Sirorat Pattanapairoj; Banchar Arnonkijpanich
Journal:  Cogn Neurodyn       Date:  2015-07-31       Impact factor: 5.082

3.  Parameter estimation of atherosclerotic tissue optical properties from three-dimensional intravascular optical coherence tomography.

Authors:  Madhusudhana Gargesha; Ronny Shalev; David Prabhu; Kentaro Tanaka; Andrew M Rollins; Marco Costa; Hiram G Bezerra; David L Wilson
Journal:  J Med Imaging (Bellingham)       Date:  2015-01-02

4.  Augmentation of Physician Assessments with Multi-Omics Enhances Predictability of Drug Response: A Case Study of Major Depressive Disorder.

Authors:  Arjun Athreya; Ravishankar Iyer; Drew Neavin; Liewei Wang; Richard Weinshilboum; Rima Kaddurah-Daouk; John Rush; Mark Frye; William Bobo
Journal:  IEEE Comput Intell Mag       Date:  2018-07-20       Impact factor: 11.356

5.  Logistic Regression Confined by Cardinality-Constrained Sample and Feature Selection.

Authors:  Ehsan Adeli; Xiaorui Li; Dongjin Kwon; Yong Zhang; Kilian M Pohl
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2019-02-26       Impact factor: 6.226

6.  Optimal classification for the diagnosis of duchenne muscular dystrophy images using support vector machines.

Authors:  Ming-Huan Zhang; Jun-Shan Ma; Ying Shen; Ying Chen
Journal:  Int J Comput Assist Radiol Surg       Date:  2015-10-17       Impact factor: 2.924

7.  Computer-aided assessment of regional abdominal fat with food residue removal in CT.

Authors:  Sokratis Makrogiannis; Giorgio Caturegli; Christos Davatzikos; Luigi Ferrucci
Journal:  Acad Radiol       Date:  2013-11       Impact factor: 3.173

8.  Accept/decline decision module for the liver simulated allocation model.

Authors:  Sang-Phil Kim; Diwakar Gupta; Ajay K Israni; Bertram L Kasiske
Journal:  Health Care Manag Sci       Date:  2014-08-30

9.  A novel method for mining highly imbalanced high-throughput screening data in PubChem.

Authors:  Qingliang Li; Yanli Wang; Stephen H Bryant
Journal:  Bioinformatics       Date:  2009-10-13       Impact factor: 6.937

Review 10.  Getting the most out of PubChem for virtual screening.

Authors:  Sunghwan Kim
Journal:  Expert Opin Drug Discov       Date:  2016-08-05       Impact factor: 6.098

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.