Literature DB >> 16926220

Eigengene-based linear discriminant model for tumor classification using gene expression microarray data.

Ronglai Shen1, Debashis Ghosh, Arul Chinnaiyan, Zhaoling Meng.   

Abstract

MOTIVATION: The nearest shrunken centroids classifier has become a popular algorithm in tumor classification problems using gene expression microarray data. Feature selection is an embedded part of the method to select top-ranking genes based on a univariate distance statistic calculated for each gene individually. The univariate statistics summarize gene expression profiles outside of the gene co-regulation network context, leading to redundant information being included in the selection procedure.
RESULTS: We propose an Eigengene-based Linear Discriminant Analysis (ELDA) to address gene selection in a multivariate framework. The algorithm uses a modified rotated Spectral Decomposition (SpD) technique to select 'hub' genes that associate with the most important eigenvectors. Using three benchmark cancer microarray datasets, we show that ELDA selects the most characteristic genes, leading to substantially smaller classifiers than the univariate feature selection based analogues. The resulting de-correlated expression profiles make the gene-wise independence assumption more realistic and applicable for the shrunken centroids classifier and other diagonal linear discriminant type of models. Our algorithm further incorporates a misclassification cost matrix, allowing differential penalization of one type of error over another. In the breast cancer data, we show false negative prognosis can be controlled via a cost-adjusted discriminant function. AVAILABILITY: R code for the ELDA algorithm is available from author upon request.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16926220     DOI: 10.1093/bioinformatics/btl442

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  15 in total

1.  A glance at DNA microarray technology and applications.

Authors:  Amir Ata Saei; Yadollah Omidi
Journal:  Bioimpacts       Date:  2011-08-04

2.  Bias-corrected diagonal discriminant rules for high-dimensional classification.

Authors:  Song Huang; Tiejun Tong; Hongyu Zhao
Journal:  Biometrics       Date:  2010-12       Impact factor: 2.571

3.  Comparative analysis of cis-regulation following stroke and seizures in subspaces of conserved eigensystems.

Authors:  Michal Dabrowski; Norbert Dojer; Malgorzata Zawadzka; Jakub Mieczkowski; Bozena Kaminska
Journal:  BMC Syst Biol       Date:  2010-06-17

4.  Investigating Transcriptional Dynamics Changes and Time-Dependent Marker Gene Expression in the Early Period After Skeletal Muscle Injury in Rats.

Authors:  Kang Ren; Liangliang Wang; Liang Wang; Qiuxiang Du; Jie Cao; Qianqian Jin; Guoshuai An; Na Li; Lihong Dang; Yingjie Tian; Yingyuan Wang; Junhong Sun
Journal:  Front Genet       Date:  2021-06-17       Impact factor: 4.599

5.  TESTING SIGNIFICANCE OF FEATURES BY LASSOED PRINCIPAL COMPONENTS.

Authors:  Daniela M Witten; Robert Tibshirani
Journal:  Ann Appl Stat       Date:  2008-09-01       Impact factor: 2.083

6.  Molecular classifiers for acute kidney transplant rejection in peripheral blood by whole genome gene expression profiling.

Authors:  S M Kurian; A N Williams; T Gelbart; D Campbell; T S Mondala; S R Head; S Horvath; L Gaber; R Thompson; T Whisenant; W Lin; P Langfelder; E H Robison; R L Schaffer; J S Fisher; J Friedewald; S M Flechner; L K Chan; A C Wiseman; H Shidban; R Mendez; R Heilman; M M Abecassis; C L Marsh; D R Salomon
Journal:  Am J Transplant       Date:  2014-04-11       Impact factor: 8.086

7.  A quantitative system for discriminating induced pluripotent stem cells, embryonic stem cells and somatic cells.

Authors:  Anyou Wang; Ying Du; Qianchuan He; Chunxiao Zhou
Journal:  PLoS One       Date:  2013-02-13       Impact factor: 3.240

8.  Gene network modular-based classification of microarray samples.

Authors:  Pingzhao Hu; Shelley B Bull; Hui Jiang
Journal:  BMC Bioinformatics       Date:  2012-06-25       Impact factor: 3.169

9.  Combining multiple hypothesis testing and affinity propagation clustering leads to accurate, robust and sample size independent classification on gene expression data.

Authors:  Argiris Sakellariou; Despina Sanoudou; George Spyrou
Journal:  BMC Bioinformatics       Date:  2012-10-17       Impact factor: 3.169

10.  Optimality driven nearest centroid classification from genomic data.

Authors:  Alan R Dabney; John D Storey
Journal:  PLoS One       Date:  2007-10-03       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.