Literature DB >> 25395692

Optimizing area under the ROC curve using semi-supervised learning.

Shijun Wang1, Diana Li1, Nicholas Petrick2, Berkman Sahiner2, Marius George Linguraru3, Ronald M Summers1.   

Abstract

Receiver operating characteristic (ROC) analysis is a standard methodology to evaluate the performance of a binary classification system. The area under the ROC curve (AUC) is a performance metric that summarizes how well a classifier separates two classes. Traditional AUC optimization techniques are supervised learning methods that utilize only labeled data (i.e., the true class is known for all data) to train the classifiers. In this work, inspired by semi-supervised and transductive learning, we propose two new AUC optimization algorithms hereby referred to as semi-supervised learning receiver operating characteristic (SSLROC) algorithms, which utilize unlabeled test samples in classifier training to maximize AUC. Unlabeled samples are incorporated into the AUC optimization process, and their ranking relationships to labeled positive and negative training samples are considered as optimization constraints. The introduced test samples will cause the learned decision boundary in a multidimensional feature space to adapt not only to the distribution of labeled training data, but also to the distribution of unlabeled test data. We formulate the semi-supervised AUC optimization problem as a semi-definite programming problem based on the margin maximization theory. The proposed methods SSLROC1 (1-norm) and SSLROC2 (2-norm) were evaluated using 34 (determined by power analysis) randomly selected datasets from the University of California, Irvine machine learning repository. Wilcoxon signed rank tests showed that the proposed methods achieved significant improvement compared with state-of-the-art methods. The proposed methods were also applied to a CT colonography dataset for colonic polyp classification and showed promising results.

Entities:  

Keywords:  AUC; RankBoost; Receiver operating characteristic; SSLROC; SVMROC; Semi-supervised learning; Semidefinite programming; Transfer learning

Year:  2015        PMID: 25395692      PMCID: PMC4226543          DOI: 10.1016/j.patcog.2014.07.025

Source DB:  PubMed          Journal:  Pattern Recognit        ISSN: 0031-3203            Impact factor:   7.740


  14 in total

1.  Ensemble learning via negative correlation.

Authors:  Y Liu; X Yao
Journal:  Neural Netw       Date:  1999-12

Review 2.  Improving the accuracy of CTC interpretation: computer-aided detection.

Authors:  Ronald M Summers
Journal:  Gastrointest Endosc Clin N Am       Date:  2010-04

3.  Computed tomographic virtual colonoscopy to screen for colorectal neoplasia in asymptomatic adults.

Authors:  Perry J Pickhardt; J Richard Choi; Inku Hwang; James A Butler; Michael L Puckett; Hans A Hildebrandt; Roy K Wong; Pamela A Nugent; Pauline A Mysliwiec; William R Schindler
Journal:  N Engl J Med       Date:  2003-12-01       Impact factor: 91.245

4.  Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms.

Authors: 
Journal:  Neural Comput       Date:  1998-09-15       Impact factor: 2.026

5.  Caveats and pitfalls of ROC analysis in clinical microarray research (and how to avoid them).

Authors:  Daniel Berrar; Peter Flach
Journal:  Brief Bioinform       Date:  2011-03-21       Impact factor: 11.622

6.  Massive-training artificial neural network coupled with Laplacian-eigenfunction-based dimensionality reduction for computer-aided detection of polyps in CT colonography.

Authors:  Kenji Suzuki; Jun Zhang; Jianwu Xu
Journal:  IEEE Trans Med Imaging       Date:  2010-06-21       Impact factor: 10.048

Review 7.  Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine.

Authors:  M H Zweig; G Campbell
Journal:  Clin Chem       Date:  1993-04       Impact factor: 8.327

8.  Introduction to sample size determination and power analysis for clinical trials.

Authors:  J M Lachin
Journal:  Control Clin Trials       Date:  1981-06

9.  Seeing is believing: video classification for computed tomographic colonography using multiple-instance learning.

Authors:  Shijun Wang; Matthew T McKenna; Tan B Nguyen; Joseph E Burns; Nicholas Petrick; Berkman Sahiner; Ronald M Summers
Journal:  IEEE Trans Med Imaging       Date:  2012-05       Impact factor: 10.048

10.  Combining Statistical and Geometric Features for Colonic Polyp Detection in CTC Based on Multiple Kernel Learning.

Authors:  Shijun Wang; Jianhua Yao; Nicholas Petrick; Ronald M Summers
Journal:  Int J Comput Intell Appl       Date:  2010-01-01
View more
  1 in total

1.  The diagnostic accuracy of artificial intelligence in thoracic diseases: A protocol for systematic review and meta-analysis.

Authors:  Yi Yang; Gang Jin; Yao Pang; Wenhao Wang; Hongyi Zhang; Guangxin Tuo; Peng Wu; Zequan Wang; Zijiang Zhu
Journal:  Medicine (Baltimore)       Date:  2020-02       Impact factor: 1.817

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.