Literature DB >> 18363775

ROC-based utility function maximization for feature selection and classification with applications to high-dimensional protease data.

Zhenqiu Liu1, Ming Tan.   

Abstract

SUMMARY: In medical diagnosis, the diseased and nondiseased classes are usually unbalanced and one class may be more important than the other depending on the diagnosis purpose. Most standard classification methods, however, are designed to maximize the overall accuracy and cannot incorporate different costs to different classes explicitly. In this article, we propose a novel nonparametric method to directly maximize the weighted specificity and sensitivity of the receiver operating characteristic curve. Combining advances in machine learning, optimization theory, and statistics, the proposed method has excellent generalization property and assigns different error costs to different classes explicitly. We present experiments that compare the proposed algorithms with support vector machines and regularized logistic regression using data from a study on HIV-1 protease as well as six public available datasets. Our main conclusion is that the performance of proposed algorithm is significantly better in most cases than the other classifiers tested. Software package in MATLAB is available upon request.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18363775     DOI: 10.1111/j.1541-0420.2008.01015.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  5 in total

1.  Plasma phospholipids identify antecedent memory impairment in older adults.

Authors:  Mark Mapstone; Amrita K Cheema; Massimo S Fiandaca; Xiaogang Zhong; Timothy R Mhyre; Linda H MacArthur; William J Hall; Susan G Fisher; Derick R Peterson; James M Haley; Michael D Nazar; Steven A Rich; Dan J Berlau; Carrie B Peltz; Ming T Tan; Claudia H Kawas; Howard J Federoff
Journal:  Nat Med       Date:  2014-03-09       Impact factor: 53.440

2.  Survival associated pathway identification with group Lp penalized global AUC maximization.

Authors:  Zhenqiu Liu; Laurence S Magder; Terry Hyslop; Li Mao
Journal:  Algorithms Mol Biol       Date:  2010-08-16       Impact factor: 1.405

3.  A comparative study of variable selection methods in the context of developing psychiatric screening instruments.

Authors:  Feihan Lu; Eva Petkova
Journal:  Stat Med       Date:  2013-08-11       Impact factor: 2.373

4.  Deep Learning in Label-free Cell Classification.

Authors:  Claire Lifan Chen; Ata Mahjoubfar; Li-Chia Tai; Ian K Blaby; Allen Huang; Kayvan Reza Niazi; Bahram Jalali
Journal:  Sci Rep       Date:  2016-03-15       Impact factor: 4.379

5.  Hellinger distance-based stable sparse feature selection for high-dimensional class-imbalanced data.

Authors:  Guang-Hui Fu; Yuan-Jiao Wu; Min-Jie Zong; Jianxin Pan
Journal:  BMC Bioinformatics       Date:  2020-03-23       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.