Literature DB >> 15554662

Active learning with support vector machine applied to gene expression data for cancer classification.

Ying Liu1.   

Abstract

There is growing interest in the application of machine learning techniques in bioinformatics. The supervised machine learning approach has been widely applied to bioinformatics and gained a lot of success in this research area. With this learning approach researchers first develop a large training set, which is a time-consuming and costly process. Moreover, the proportion of the positive examples and negative examples in the training set may not represent the real-world data distribution, which causes concept drift. Active learning avoids these problems. Unlike most conventional learning methods where the training set used to derive the model remains static, the classifier can actively choose the training data and the size of training set increases. We introduced an algorithm for performing active learning with support vector machine and applied the algorithm to gene expression profiles of colon cancer, lung cancer, and prostate cancer samples. We compared the classification performance of active learning with that of passive learning. The results showed that employing the active learning method can achieve high accuracy and significantly reduce the need for labeled training instances. For lung cancer classification, to achieve 96% of the total positives, only 31 labeled examples were needed in active learning whereas in passive learning 174 labeled examples were required. That meant over 82% reduction was realized by active learning. In active learning the areas under the receiver operating characteristic (ROC) curves were over 0.81, while in passive learning the areas under the ROC curves were below 0.50.

Entities:  

Mesh:

Year:  2004        PMID: 15554662     DOI: 10.1021/ci049810a

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  34 in total

1.  Functional census of mutation sequence spaces: the example of p53 cancer rescue mutants.

Authors:  Samuel A Danziger; S Joshua Swamidass; Jue Zeng; Lawrence R Dearth; Qiang Lu; Jonathan H Chen; Jianlin Cheng; Vinh P Hoang; Hiroto Saigo; Ray Luo; Pierre Baldi; Rainer K Brachmann; Richard H Lathrop
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2006 Apr-Jun       Impact factor: 3.710

2.  Choosing where to look next in a mutation sequence space: Active Learning of informative p53 cancer rescue mutants.

Authors:  Samuel A Danziger; Jue Zeng; Ying Wang; Rainer K Brachmann; Richard H Lathrop
Journal:  Bioinformatics       Date:  2007-07-01       Impact factor: 6.937

3.  Discovery of antibiotics-derived polymers for gene delivery using combinatorial synthesis and cheminformatics modeling.

Authors:  Thrimoorthy Potta; Zhuo Zhen; Taraka Sai Pavan Grandhi; Matthew D Christensen; James Ramos; Curt M Breneman; Kaushal Rege
Journal:  Biomaterials       Date:  2013-12-10       Impact factor: 12.479

4.  Cross-topic learning for work prioritization in systematic review creation and update.

Authors:  Aaron M Cohen; Kyle Ambert; Marian McDonagh
Journal:  J Am Med Inform Assoc       Date:  2009-06-30       Impact factor: 4.497

5.  An active learning framework for enhancing identification of non-artifactual intracranial pressure waveforms.

Authors:  Murad Megjhani; Ayham Alkhachroum; Kalijah Terilli; Jenna Ford; Clio Rubinos; Julie Kromm; Brendan K Wallace; E Sander Connolly; David Roh; Sachin Agarwal; Jan Claassen; Raghav Padmanabhan; Xiao Hu; Soojin Park
Journal:  Physiol Meas       Date:  2019-01-18       Impact factor: 2.833

6.  Active Learning-based corpus annotation--the PathoJen experience.

Authors:  Udo Hahn; Elena Beisswanger; Ekaterina Buyko; Erik Faessler
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

7.  Co-expression network analysis of Down's syndrome based on microarray data.

Authors:  Jianping Zhao; Zhengguo Zhang; Shumin Ren; Yanan Zong; Xiangdong Kong
Journal:  Exp Ther Med       Date:  2016-06-17       Impact factor: 2.447

8.  Efficient modeling and active learning discovery of biological responses.

Authors:  Armaghan W Naik; Joshua D Kangas; Christopher J Langmead; Robert F Murphy
Journal:  PLoS One       Date:  2013-12-17       Impact factor: 3.240

9.  Active learning for clinical text classification: is it better than random sampling?

Authors:  Rosa L Figueroa; Qing Zeng-Treitler; Long H Ngo; Sergey Goryachev; Eduardo P Wiechmann
Journal:  J Am Med Inform Assoc       Date:  2012-06-15       Impact factor: 4.497

10.  Detection and significance of serum protein markers of small-cell lung cancer.

Authors:  Mingyong Han; Qi Liu; Jiekai Yu; Shu Zheng
Journal:  J Clin Lab Anal       Date:  2008       Impact factor: 2.352

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.