Literature DB >> 19183004

Integrating biological knowledge with gene expression profiles for survival prediction of cancer.

Xi Chen1, Lily Wang.   

Abstract

Due to the large variability in survival times between cancer patients and the plethora of genes on microarrays unrelated to outcome, building accurate prediction models that are easy to interpret remains a challenge. In this paper, we propose a general strategy for improving performance and interpretability of prediction models by integrating gene expression data with prior biological knowledge. First, we link gene identifiers in expression dataset with gene annotation databases such as Gene Ontology (GO). Then we construct "supergenes" for each gene category by summarizing information from genes related to outcome using a modified principal component analysis (PCA) method. Finally, instead of using genes as predictors, we use these supergenes representing information from each gene category as predictors to predict survival outcome. In addition to identifying gene categories associated with outcome, the proposed approach also carries out additional within-category selection to select important genes within each gene set. We show, using two real breast cancer microarray datasets, that the prediction models constructed based on gene sets (or pathway) information outperform the prediction models based on expression values of single genes, with improved prediction accuracy and interpretability.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19183004      PMCID: PMC3198940          DOI: 10.1089/cmb.2008.12TT

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  39 in total

1.  Microarray gene expression data with linked survival phenotypes: diffuse large-B-cell lymphoma revisited.

Authors:  Mark R Segal
Journal:  Biostatistics       Date:  2005-11-11       Impact factor: 5.899

2.  Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data.

Authors:  Jiang Gui; Hongzhe Li
Journal:  Bioinformatics       Date:  2005-04-06       Impact factor: 6.937

Review 3.  From signatures to models: understanding cancer using microarrays.

Authors:  Eran Segal; Nir Friedman; Naftali Kaminski; Aviv Regev; Daphne Koller
Journal:  Nat Genet       Date:  2005-06       Impact factor: 38.330

4.  Development and evaluation of therapeutically relevant predictive classifiers using gene expression profiling.

Authors:  Richard Simon
Journal:  J Natl Cancer Inst       Date:  2006-09-06       Impact factor: 13.506

Review 5.  p53, apoptosis and axon-guidance molecules.

Authors:  H Arakawa
Journal:  Cell Death Differ       Date:  2005-08       Impact factor: 15.828

6.  Group testing for pathway analysis improves comparability of different microarray datasets.

Authors:  Theodora Manoli; Norbert Gretz; Hermann-Josef Gröne; Marc Kenzelmann; Roland Eils; Benedikt Brors
Journal:  Bioinformatics       Date:  2006-08-07       Impact factor: 6.937

7.  An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival.

Authors:  Lance D Miller; Johanna Smeds; Joshy George; Vinsensius B Vega; Liza Vergara; Alexander Ploner; Yudi Pawitan; Per Hall; Sigrid Klaar; Edison T Liu; Jonas Bergh
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-02       Impact factor: 11.205

8.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

Review 9.  Dual role of transforming growth factor beta in mammary tumorigenesis and metastatic progression.

Authors:  Rebecca S Muraoka-Cook; Nancy Dumont; Carlos L Arteaga
Journal:  Clin Cancer Res       Date:  2005-01-15       Impact factor: 12.531

10.  Estrogen up-regulates neuropeptide Y Y1 receptor expression in a human breast cancer cell line.

Authors:  Hassane Amlal; Somia Faroqui; Ambikaipakan Balasubramaniam; Sulaiman Sheriff
Journal:  Cancer Res       Date:  2006-04-01       Impact factor: 12.701

View more
  21 in total

Review 1.  Triple-negative breast cancer: present challenges and new perspectives.

Authors:  Franca Podo; Lutgarde M C Buydens; Hadassa Degani; Riet Hilhorst; Edda Klipp; Ingrid S Gribbestad; Sabine Van Huffel; Hanneke W M van Laarhoven; Jan Luts; Daniel Monleon; Geert J Postma; Nicole Schneiderhan-Marra; Filippo Santoro; Hans Wouters; Hege G Russnes; Therese Sørlie; Elda Tagliabue; Anne-Lise Børresen-Dale
Journal:  Mol Oncol       Date:  2010-04-24       Impact factor: 6.603

2.  Improving biomarker list stability by integration of biological knowledge in the learning process.

Authors:  Tiziana Sanavia; Fabio Aiolli; Giovanni Da San Martino; Andrea Bisognin; Barbara Di Camillo
Journal:  BMC Bioinformatics       Date:  2012-03-28       Impact factor: 3.169

3.  Integrating gene expression and GO classification for PCA by preclustering.

Authors:  Jorn R De Haan; Ester Piek; Rene C van Schaik; Jacob de Vlieg; Susanne Bauerschmidt; Lutgarde M C Buydens; Ron Wehrens
Journal:  BMC Bioinformatics       Date:  2010-03-26       Impact factor: 3.169

4.  An Integrative Pathway-based Clinical-genomic Model for Cancer Survival Prediction.

Authors:  Xi Chen; Lily Wang; Hemant Ishwaran
Journal:  Stat Probab Lett       Date:  2010-09-07       Impact factor: 0.870

5.  Interpreting personal transcriptomes: personalized mechanism-scale profiling of RNA-seq data.

Authors:  Alan Perez-Rathke; Haiquan Li; Yves A Lussier
Journal:  Pac Symp Biocomput       Date:  2013

6.  A comparative study on gene-set analysis methods for assessing differential expression associated with the survival phenotype.

Authors:  Seungyeoun Lee; Jinheum Kim; Sunho Lee
Journal:  BMC Bioinformatics       Date:  2011-09-26       Impact factor: 3.169

7.  Single sample expression-anchored mechanisms predict survival in head and neck cancer.

Authors:  Xinan Yang; Kelly Regan; Yong Huang; Qingbei Zhang; Jianrong Li; Tanguy Y Seiwert; Ezra E W Cohen; H Rosie Xing; Yves A Lussier
Journal:  PLoS Comput Biol       Date:  2012-01-26       Impact factor: 4.475

8.  Lung cancer gene expression database analysis incorporating prior knowledge with support vector machine-based classification method.

Authors:  Peng Guan; Desheng Huang; Miao He; Baosen Zhou
Journal:  J Exp Clin Cancer Res       Date:  2009-07-18

9.  Knowledge Driven Variable Selection (KDVS) - a new approach to enrichment analysis of gene signatures obtained from high-throughput data.

Authors:  Grzegorz Zycinski; Annalisa Barla; Margherita Squillario; Tiziana Sanavia; Barbara Di Camillo; Alessandro Verri
Journal:  Source Code Biol Med       Date:  2013-01-09

10.  Structural and functional-annotation of an equine whole genome oligoarray.

Authors:  Lauren A Bright; Shane C Burgess; Bhanu Chowdhary; Cyprianna E Swiderski; Fiona M McCarthy
Journal:  BMC Bioinformatics       Date:  2009-10-08       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.