Literature DB >> 20671323

Nonnegative principal component analysis for cancer molecular pattern discovery.

Xiaoxu Han1.   

Abstract

As a well-established feature selection algorithm, principal component analysis (PCA) is often combined with the state-of-the-art classification algorithms to identify cancer molecular patterns in microarray data. However, the algorithm's global feature selection mechanism prevents it from effectively capturing the latent data structures in the high-dimensional data. In this study, we investigate the benefit of adding nonnegative constraints on PCA and develop a nonnegative principal component analysis algorithm (NPCA) to overcome the global nature of PCA. A novel classification algorithm NPCA-SVM is proposed for microarray data pattern discovery. We report strong classification results from the NPCA-SVM algorithm on five benchmark microarray data sets by direct comparison with other related algorithms. We have also proved mathematically and interpreted biologically that microarray data will inevitably encounter overfitting for an SVM/PCA-SVM learning machine under a Gaussian kernel. In addition, we demonstrate that nonnegative principal component analysis can be used to capture meaningful biomarkers effectively.

Entities:  

Mesh:

Year:  2010        PMID: 20671323     DOI: 10.1109/TCBB.2009.36

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  11 in total

1.  A Self-Training Subspace Clustering Algorithm under Low-Rank Representation for Cancer Classification on Gene Expression Data.

Authors:  Chun-Qiu Xia; Ke Han; Yong Qi; Yang Zhang; Dong-Jun Yu
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2017-06-06       Impact factor: 3.710

2.  Transcriptome marker diagnostics using big data.

Authors:  Henry Han; Ying Liu
Journal:  IET Syst Biol       Date:  2016-02       Impact factor: 1.615

3.  Nonnegative principal component analysis for mass spectral serum profiles and biomarker discovery.

Authors:  Henry Han
Journal:  BMC Bioinformatics       Date:  2010-01-18       Impact factor: 3.169

4.  Multi-resolution independent component analysis for high-performance tumor classification and biomarker discovery.

Authors:  Henry Han; Xiao-Li Li
Journal:  BMC Bioinformatics       Date:  2011-02-15       Impact factor: 3.169

5.  Learning a weighted meta-sample based parameter free sparse representation classification for microarray data.

Authors:  Bo Liao; Yan Jiang; Guanqun Yuan; Wen Zhu; Lijun Cai; Zhi Cao
Journal:  PLoS One       Date:  2014-08-12       Impact factor: 3.240

Review 6.  Overcome support vector machine diagnosis overfitting.

Authors:  Henry Han; Xiaoqian Jiang
Journal:  Cancer Inform       Date:  2014-12-09

7.  Disease Biomarker Query from RNA-Seq Data.

Authors:  Henry Han; Xiaoqian Jiang
Journal:  Cancer Inform       Date:  2014-10-14

8.  Distribution based Fuzzy Estimate Spectral Clustering for Cancer Detection with Protein Sequence and Structural Motifs

Authors:  Thenmozhi K; Karthikeyani Visalakshi N; Shanthi S
Journal:  Asian Pac J Cancer Prev       Date:  2018-07-27

9.  Diagnostic biases in translational bioinformatics.

Authors:  Henry Han
Journal:  BMC Med Genomics       Date:  2015-08-01       Impact factor: 3.063

10.  Derivative component analysis for mass spectral serum proteomic profiles.

Authors:  Henry Han
Journal:  BMC Med Genomics       Date:  2014-05-08       Impact factor: 3.063

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.