Literature DB >> 17940531

Protein subnetwork markers improve prediction of cancer outcome.

Charles Auffray.   

Abstract

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17940531      PMCID: PMC2063583          DOI: 10.1038/msb4100183

Source DB:  PubMed          Journal:  Mol Syst Biol        ISSN: 1744-4292            Impact factor:   11.429


× No keyword cloud information.
The reliability of gene predictors of cancer outcome has been recently questioned, pointing to deficiencies in experimental design, insufficient statistical power due to small sample size, and flaws in predictor generation and performance assessment, with proposed guidelines to overcome these limitations (Ntzani and Ioannidis, 2003; Michiels , Dupuy and Simon, 2007). Now, in a recent article published in Molecular Systems Biology (Chuang ), a complementary strategy has been proposed based on integration of expression profiles with protein interactions, demonstrating that more reproducible and robust predictors can be generated with the additional benefit of including mutated genes which are excluded in the classical analyses, and also providing models for the molecular mechanisms involved in metastasis formation. This is achieved through combination of mRNA expression profiles with curated protein–protein interaction data, which became recently available (Rual ), leveraging methods for modular subnetwork identification and biological validation (Segal ; Poyatos and Hurst, 2004). During the past decade, transcriptome analysis has been used increasingly to monitor expression profiles of extensive collections of genes in cancer samples, providing insights into the molecular mechanisms underlying cancer development and outcome. Gene signatures developed from these data sets allowed characterization of cancer types and stages, formation of metastasis or response to therapy with the potential to complement or even outperform traditional cellular, molecular and clinical markers. In early-diagnosed primary breast cancer, gene predictors from several studies appeared capable of informing clinicians on the likelihood of metastasis formation, so that they could eventually restrict exposure of their patients to aggressive chemotherapy after surgical removal of the tumor to those cases with poor prognosis. The expected benefit is to avoid unnecessary secondary effects of the treatment in the large majority of patients for which chemotherapy is not beneficial. While large-scale clinical trials are underway to establish such predictors as prognosis markers for widespread use in the clinical setting, a number of observations have questioned their validity, stability or robustness. For example, cross-comparison of the predictors from two landmark studies (van't Veer ; Wang ) revealed a very limited overlap with only 3 genes out of 70 or 76 in common, and much less successful predictions on the other sample collection. Extensive statistical re-analysis of the two data sets suggested that differences in the samples analyzed, the microarray platforms or the data analysis schemes used were not sufficient to account for these discrepancies, arguing for the need of analyzing thousands of samples to obtain largely overlapping gene signatures (Ein-Dor ). The novel strategy described by Trey Ideker and co-workers (Chuang ) is based on the identification of protein interaction subnetworks with coherent expression patterns of their component genes, which can distinguish the samples of patients which developed distant metastasis after surgery from those that did not. This is achieved by overlaying protein–protein interaction data on the gene expression profiles, generating an activity score for protein subnetworks in all patient samples, then computing the mutual information between activity scores and metastasis potential to assess their discriminative potential (Figure 1). The prospect is to use the selected subnetworks as novel markers for prognosis of metastasis formation in newly diagnosed breast cancer patients instead of the predictors based on collections of non-interconnected genes.
Figure 1

Identification of protein subnetwork markers. Gene expression profiles from metastatic and nonmetastatic tumor sample are overlayed onto a protein–protein interaction network. Iterative exploration of all possible protein subnetworks generates an activity score for protein subnetworks in all patient samples. Computing the mutual information between activity scores and metastasis potential enables the identification of subnetworks with a high discriminative potential. Statistical significance of subnetwork discriminative power is assessed by comparison with three null distributions obtained by randomizing gene labels, network structure and sample labels, respectively. (Figure courtesy of Trey Ideker).

Of course, iterative exploration of the high-dimensional space of all possible protein subnetworks seeded at all nodes of a highly branched and interconnected network requires the use of a greedy algorithm. Moreover, it is essential at all steps of the process to assess statistically the significance and performance of the selected subnetworks for their discriminative power when compared to randomly generated or permutated networks and sample classes on one hand, and to the established predictors on the other hand. This is also computationally demanding, but not a limiting factor of the approach with the ever-increasing high-performance computing power becoming available within laboratory workstations. The findings that are very encouraging is that subnetwork markers developed by this approach overlap much more extensively and are more accurate in the classification of metastasis than the previous predictors. An added value of the approach is that the selected protein subnetworks are significantly enriched in protein involved in common biological processes, thus providing potential insights into the molecular mechanisms involved in metastasis formation. In addition, the vast majority of the selected subnetworks contain highly interconnected proteins encoded by genes that are not by themselves discriminative, because they are not detected as differentially expressed. These include a significant number of previously identified breast cancer susceptibility genes absent in the established predictors. The approach proposed by Chuang should now be complemented by integration with other high-precision data sets (Hwang ). The reported advances are significant, but they also have limitations: the subnetwork markers selected from the two data sets still overlap only very partially, and contain only a minority of the cancer susceptibility genes. It is well possible that the level and/or modulation of expression of many of the functionally relevant genes are of low magnitude and beyond the detection capabilities of current microarray technologies. This suggests that there is a wealth of unexploited information in these and other similar data sets available in human and other species, and calls for re-analysis of sufficiently powered studies when high-quality protein–protein interaction data is available. Even more profound insights can be expected when novel and more sensitive measurement technologies will become available.
  11 in total

1.  Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data.

Authors:  Eran Segal; Michael Shapira; Aviv Regev; Dana Pe'er; David Botstein; Daphne Koller; Nir Friedman
Journal:  Nat Genet       Date:  2003-06       Impact factor: 38.330

2.  A data integration methodology for systems biology: experimental verification.

Authors:  Daehee Hwang; Jennifer J Smith; Deena M Leslie; Andrea D Weston; Alistair G Rust; Stephen Ramsey; Pedro de Atauri; Andrew F Siegel; Hamid Bolouri; John D Aitchison; Leroy Hood
Journal:  Proc Natl Acad Sci U S A       Date:  2005-11-21       Impact factor: 11.205

3.  Prediction of cancer outcome with microarrays: a multiple random validation strategy.

Authors:  Stefan Michiels; Serge Koscielny; Catherine Hill
Journal:  Lancet       Date:  2005 Feb 5-11       Impact factor: 79.321

4.  Towards a proteome-scale map of the human protein-protein interaction network.

Authors:  Jean-François Rual; Kavitha Venkatesan; Tong Hao; Tomoko Hirozane-Kishikawa; Amélie Dricot; Ning Li; Gabriel F Berriz; Francis D Gibbons; Matija Dreze; Nono Ayivi-Guedehoussou; Niels Klitgord; Christophe Simon; Mike Boxem; Stuart Milstein; Jennifer Rosenberg; Debra S Goldberg; Lan V Zhang; Sharyl L Wong; Giovanni Franklin; Siming Li; Joanna S Albala; Janghoo Lim; Carlene Fraughton; Estelle Llamosas; Sebiha Cevik; Camille Bex; Philippe Lamesch; Robert S Sikorski; Jean Vandenhaute; Huda Y Zoghbi; Alex Smolyar; Stephanie Bosak; Reynaldo Sequerra; Lynn Doucette-Stamm; Michael E Cusick; David E Hill; Frederick P Roth; Marc Vidal
Journal:  Nature       Date:  2005-09-28       Impact factor: 49.962

5.  Thousands of samples are needed to generate a robust gene list for predicting outcome in cancer.

Authors:  Liat Ein-Dor; Or Zuk; Eytan Domany
Journal:  Proc Natl Acad Sci U S A       Date:  2006-04-03       Impact factor: 11.205

Review 6.  Critical review of published microarray studies for cancer outcome and guidelines on statistical analysis and reporting.

Authors:  Alain Dupuy; Richard M Simon
Journal:  J Natl Cancer Inst       Date:  2007-01-17       Impact factor: 13.506

7.  Gene expression profiling predicts clinical outcome of breast cancer.

Authors:  Laura J van 't Veer; Hongyue Dai; Marc J van de Vijver; Yudong D He; Augustinus A M Hart; Mao Mao; Hans L Peterse; Karin van der Kooy; Matthew J Marton; Anke T Witteveen; George J Schreiber; Ron M Kerkhoven; Chris Roberts; Peter S Linsley; René Bernards; Stephen H Friend
Journal:  Nature       Date:  2002-01-31       Impact factor: 49.962

Review 8.  Predictive ability of DNA microarrays for cancer outcomes and correlates: an empirical assessment.

Authors:  Evangelia E Ntzani; John P A Ioannidis
Journal:  Lancet       Date:  2003-11-01       Impact factor: 79.321

9.  How biologically relevant are interaction-based modules in protein networks?

Authors:  Juan F Poyatos; Laurence D Hurst
Journal:  Genome Biol       Date:  2004-11-01       Impact factor: 13.583

10.  Network-based classification of breast cancer metastasis.

Authors:  Han-Yu Chuang; Eunjung Lee; Yu-Tsueng Liu; Doheon Lee; Trey Ideker
Journal:  Mol Syst Biol       Date:  2007-10-16       Impact factor: 11.429

View more
  13 in total

1.  Omics-based molecular target and biomarker identification.

Authors:  Zhang-Zhi Hu; Hongzhan Huang; Cathy H Wu; Mira Jung; Anatoly Dritschilo; Anna T Riegel; Anton Wellstein
Journal:  Methods Mol Biol       Date:  2011

2.  A transcriptome-proteome integrated network identifies endoplasmic reticulum thiol oxidoreductase (ERp57) as a hub that mediates bone metastasis.

Authors:  Naiara Santana-Codina; Rafael Carretero; Rebeca Sanz-Pamplona; Teresa Cabrera; Emre Guney; Baldo Oliva; Philippe Clezardin; Omar E Olarte; Pablo Loza-Alvarez; Andrés Méndez-Lucas; Jose Carlos Perales; Angels Sierra
Journal:  Mol Cell Proteomics       Date:  2013-04-26       Impact factor: 5.911

3.  Finding lost genes in GWAS via integrative-omics analysis reveals novel sub-networks associated with preterm birth.

Authors:  Douglas Brubaker; Yu Liu; Junye Wang; Huiqing Tan; Ge Zhang; Bo Jacobsson; Louis Muglia; Sam Mesiano; Mark R Chance
Journal:  Hum Mol Genet       Date:  2016-12-01       Impact factor: 6.150

Review 4.  Relative expression analysis for molecular cancer diagnosis and prognosis.

Authors:  James A Eddy; Jaeyun Sung; Donald Geman; Nathan D Price
Journal:  Technol Cancer Res Treat       Date:  2010-04

Review 5.  In silico models of cancer.

Authors:  Lucas B Edelman; James A Eddy; Nathan D Price
Journal:  Wiley Interdiscip Rev Syst Biol Med       Date:  2010 Jul-Aug

6.  Relative expression analysis for identifying perturbed pathways.

Authors:  James A Eddy; Donald Geman; Nathan D Price
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2009

7.  Identifying tightly regulated and variably expressed networks by Differential Rank Conservation (DIRAC).

Authors:  James A Eddy; Leroy Hood; Nathan D Price; Donald Geman
Journal:  PLoS Comput Biol       Date:  2010-05-27       Impact factor: 4.475

8.  Multi-level reproducibility of signature hubs in human interactome for breast cancer metastasis.

Authors:  Chen Yao; Hongdong Li; Chenggui Zhou; Lin Zhang; Jinfeng Zou; Zheng Guo
Journal:  BMC Syst Biol       Date:  2010-11-09

9.  Discovery and scoring of protein interaction subnetworks discriminative of late stage human colon cancer.

Authors:  Rod K Nibbe; Sanford Markowitz; Lois Myeroff; Rob Ewing; Mark R Chance
Journal:  Mol Cell Proteomics       Date:  2008-12-19       Impact factor: 5.911

Review 10.  The evolving role of mass spectrometry in cancer biomarker discovery.

Authors:  Pei Wang; Jeffrey R Whiteaker; Amanda G Paulovich
Journal:  Cancer Biol Ther       Date:  2009-06-06       Impact factor: 4.742

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.