Literature DB >> 15290759

Developing optimal prediction models for cancer classification using gene expression data.

Mat Soukup1, Jae K Lee.   

Abstract

Microarrays can provide genome-wide expression patterns for various cancers, especially for tumor sub-types that may exhibit substantially different patient prognosis. Using such gene expression data, several approaches have been proposed to classify tumor sub-types accurately. These classification methods are not robust, and often dependent on a particular training sample for modelling, which raises issues in utilizing these methods to administer proper treatment for a future patient. We propose to construct an optimal, robust prediction model for classifying cancer sub-types using gene expression data. Our model is constructed in a step-wise fashion implementing cross-validated quadratic discriminant analysis. At each step, all identified models are validated by an independent sample of patients to develop a robust model for future data. We apply the proposed methods to two microarray data sets of cancer: the acute leukemia data by Golub et al. and the colon cancer data by Alon et al. We have found that the dimensionality of our optimal prediction models is relatively small for these cases and that our prediction models with one or two gene factors outperforms or has competing performance, especially for independent samples, to other methods based on 50 or more predictive gene factors. The methodology is implemented and developed by the procedures in R and Splus. The source code can be obtained at http://hesweb1.med.virginia.edu/bioinformatics.

Entities:  

Mesh:

Year:  2004        PMID: 15290759     DOI: 10.1142/s0219720004000351

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  6 in total

1.  Classifying gene expression profiles from pairwise mRNA comparisons.

Authors:  Donald Geman; Christian d'Avignon; Daniel Q Naiman; Raimond L Winslow
Journal:  Stat Appl Genet Mol Biol       Date:  2004-08-30

2.  Personalized medicine in breast cancer: a systematic review.

Authors:  Sang-Hoon Cho; Jongsu Jeon; Seung Il Kim
Journal:  J Breast Cancer       Date:  2012-09-28       Impact factor: 3.588

3.  Data mining in genomics.

Authors:  Jae K Lee; Paul D Williams; Sooyoung Cheon
Journal:  Clin Lab Med       Date:  2008-03       Impact factor: 1.935

4.  Evaluating microarray-based classifiers: an overview.

Authors:  A-L Boulesteix; C Strobl; T Augustin; M Daumer
Journal:  Cancer Inform       Date:  2008-02-29

5.  A simple method to combine multiple molecular biomarkers for dichotomous diagnostic classification.

Authors:  Manju R Mamtani; Tushar P Thakre; Mrunal Y Kalkonde; Manik A Amin; Yogeshwar V Kalkonde; Amit P Amin; Hemant Kulkarni
Journal:  BMC Bioinformatics       Date:  2006-10-10       Impact factor: 3.169

Review 6.  Pathway and network approaches for identification of cancer signature markers from omics data.

Authors:  Jinlian Wang; Yiming Zuo; Yan-Gao Man; Itzhak Avital; Alexander Stojadinovic; Meng Liu; Xiaowei Yang; Rency S Varghese; Mahlet G Tadesse; Habtom W Ressom
Journal:  J Cancer       Date:  2015-01-01       Impact factor: 4.207

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.