Literature DB >> 24713437

HyperModules: identifying clinically and phenotypically significant network modules with disease mutations for biomarker discovery.

Alvin Leung1, Gary D Bader1, Jüri Reimand1.   

Abstract

SUMMARY: Correlating disease mutations with clinical and phenotypic information such as drug response or patient survival is an important goal of personalized cancer genomics and a first step in biomarker discovery. HyperModules is a network search algorithm that finds frequently mutated gene modules with significant clinical or phenotypic signatures from biomolecular interaction networks.
AVAILABILITY AND IMPLEMENTATION: HyperModules is available in Cytoscape App Store and as a command line tool at www.baderlab.org/Sofware/HyperModules. CONTACT: Juri.Reimand@utoronto.ca or Gary.Bader@utoronto.ca SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2014. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24713437      PMCID: PMC4103591          DOI: 10.1093/bioinformatics/btu172

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

Establishing functional links between genetic variation and human disease is a key goal of cancer genome sequencing (Gonzalez-Perez ) and genome-wide association studies (Hardy and Singleton, 2009). Complex diseases like cancer are often driven by infrequent changes in multiple genes in pathways (Vogelstein ). Network analysis helps interpret mutations in systems context and find disease genes, pathways and biomarkers for precision medicine (Barabasi ). Discovery of modules (subnetworks) in biological networks helps isolate systems with disease-related properties and reduces interactome complexity. A growing number of methods are available for this purpose. A landmark paper combines gene expression signatures with protein–protein interactions (PPI) to find predictive modules of cancer outcome (Chuang ). The NETBAG method studies genetic associations and copy number variants to find autism-related modules (Gilman ). HotNet detects frequently mutated pathways in networks (Vandin , 2012). Net-Cox builds prognostic cancer signatures in network analysis of gene expression data (Zhang ). The Reactome FI Cytoscape plugin uncovers prognostic gene modules from networks and gene expression data (Wu and Stein, 2012). Network-based stratification predicts tumor subtypes from mutations in network regions (Hofree ). Such modules maximize a feature of genes such as differential expression, disease mutation frequency or enrichment of interactions. Because clinical profiles of patients are increasingly available in cancer genomics efforts such as the The Cancer Genome Atlas (TCGA) pan-cancer project (Weinstein ), new methods are needed to discover multivariate biomarkers in networks. We recently analyzed cancer mutations in phosphorylation signaling and found that kinase–substrate networks are informative of patient survival and therapy response (Reimand and Bader, 2013; Reimand ). In particular, we found network modules with rare mutations in ovarian cancer patients with improved prognosis. We created the HyperModules method to systematically discover clinically correlated modules from gene and protein networks (Reimand and Bader, 2013) based on our earlier work on functional subnetwork discovery (Altmae ; Reimand ). Here we present the previously unavailable software in open-source Java as a command line tool for automated work and a Cytoscape app for interactive graphical analysis.

2 SOFTWARE

HyperModules assumes that clinically informative mutations of complex disease occur in systems of closely interacting genes. The greedy network search algorithm focuses on a local network area, defined by a central seed node (a mutated gene) and its surrounding subnetwork. All mutated genes are sequentially considered as seeds in module discovery. Search starts from the seed and grows the module toward increased benefit by adding connected genes that best improve clinical significance. This objective is driven by statistical tests where patients defined by the module are compared with other patients. Categorical clinical variables are studied with Fisher’s exact test and survival times with log-rank test. Cox regression is currently not supported; however, we plan to add this feature in the future. To establish statistical significance of detected modules, we build a null distribution by searching networks with permuted gene names. Each module of the true network is quantified with an empirical P-value reflecting the fraction of seed-specific modules from shuffled networks exceeding the significance of the true module. This removes artifacts of the greedy strategy and corrects for topological features such as highly connected nodes. The analysis pipeline is outlined in Figure 1. Interaction networks are loaded into Cytoscape using standard features (Shannon ). HyperModules requires gene mutations and patient clinical information in two tables. The user selects type of clinical analysis and columns in data table (survival time or variable such as tumor relapse). Survival analysis requires follow-up time and vital status of patients. Detected modules are studied further with network visualization, survival curves and data export. We tested HyperModules on protein networks to find survival modules with cancer mutations. We extracted three human PPI networks of variable size from iRefWeb (Turner ), and five cancer mutation datasets from the International Cancer Genome Consortium portal version 12 (Supplementary Fig. S1). For example, network analysis of 30 000 interactions with 121 liver cancer patients, 686 mutated genes and 10 000 permutations takes 10 min on an 8-core computer with 16 GB RAM. HyperModules is thus applicable to a range of networks and mutation datasets.
Fig. 1.

HyperModules requires three inputs—(1) mutated genes in patients, (2) patient clinical information and (3) protein or gene network. Search is performed for all mutated genes as seeds (4). Network visualization, clinical variable statistics and data export facilitate further analysis (5)

HyperModules requires three inputs—(1) mutated genes in patients, (2) patient clinical information and (3) protein or gene network. Search is performed for all mutated genes as seeds (4). Network visualization, clinical variable statistics and data export facilitate further analysis (5)

3 EXAMPLE ANALYSIS

An example dataset is provided in Supplementary File S1. It comprises 183 ovarian cancer patients from the TCGA study (Cancer Genome Atlas Research Network, 2008) and the network of 4823 kinase–substrate interactions from our earlier study (Reimand and Bader, 2013). The ovarian cancer mutations are restricted to 163 proteins with single nucleotide variants affecting protein phosphorylation sites or kinase domains. Two sets of modules were computed with 10 000 network permutations and are shown in Supplementary Figures S2–S3. First, the search for survival correlations in the kinase–substrate network with log-rank test identified 19 modules, where associated patients have significantly different survival rates compared with other patients in the cohort (empirical P ≤ 0.05). Second, the categorical variable search with Fisher’s exact test revealed fivemodules with significant enrichment of alive patients (empirical P ≤ 0.05). The modules are also summarized in Supplementary Table S1.

4 DISCUSSION

HyperModules is a biological network-mining algorithm that reveals modules of interacting genes with clinically informative disease mutations. Diverse biomolecular interaction networks can be analyzed, including PPI networks, gene regulatory networks and curated biological pathways. Disease mutations are also broadly defined. Although we initially studied cancer point mutations, other types of alterations such as copy number and gene expression changes can be used. HyperModules finds correlations with groups of genes where mutations may be infrequent but the signature strengthens through network integration. Such modules are not often directly usable as biomarkers because of small sample size; however, we believe that our approach helps discover genes and pathways as potential multivariate biomarkers for further experiments.
  19 in total

1.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

2.  Rare de novo variants associated with autism implicate a large functional network of genes involved in formation and function of synapses.

Authors:  Sarah R Gilman; Ivan Iossifov; Dan Levy; Michael Ronemus; Michael Wigler; Dennis Vitkup
Journal:  Neuron       Date:  2011-06-09       Impact factor: 17.173

Review 3.  Network medicine: a network-based approach to human disease.

Authors:  Albert-László Barabási; Natali Gulbahce; Joseph Loscalzo
Journal:  Nat Rev Genet       Date:  2011-01       Impact factor: 53.242

4.  Algorithms for detecting significantly mutated pathways in cancer.

Authors:  Fabio Vandin; Eli Upfal; Benjamin J Raphael
Journal:  J Comput Biol       Date:  2011-03       Impact factor: 1.479

5.  Discovery of mutated subnetworks associated with clinical data in cancer.

Authors:  Fabio Vandin; Patrick Clay; Eli Upfal; Benjamin J Raphael
Journal:  Pac Symp Biocomput       Date:  2012

6.  iRefWeb: interactive analysis of consolidated protein interaction data and their supporting evidence.

Authors:  Brian Turner; Sabry Razick; Andrei L Turinsky; James Vlasblom; Edgard K Crowdy; Emerson Cho; Kyle Morrison; Ian M Donaldson; Shoshana J Wodak
Journal:  Database (Oxford)       Date:  2010-10-12       Impact factor: 3.451

7.  Integrated genomic analyses of ovarian carcinoma.

Authors: 
Journal:  Nature       Date:  2011-06-29       Impact factor: 49.962

8.  Genomewide association studies and human disease.

Authors:  John Hardy; Andrew Singleton
Journal:  N Engl J Med       Date:  2009-04-15       Impact factor: 91.245

9.  A network module-based method for identifying cancer prognostic signatures.

Authors:  Guanming Wu; Lincoln Stein
Journal:  Genome Biol       Date:  2012-12-10       Impact factor: 13.583

10.  GraphWeb: mining heterogeneous biological networks for gene modules with functional significance.

Authors:  Jüri Reimand; Laur Tooming; Hedi Peterson; Priit Adler; Jaak Vilo
Journal:  Nucleic Acids Res       Date:  2008-05-06       Impact factor: 16.971

View more
  13 in total

1.  Pathway and network analysis of cancer genomes.

Authors:  Pau Creixell; Jüri Reimand; Syed Haider; Guanming Wu; Tatsuhiro Shibata; Miguel Vazquez; Ville Mustonen; Abel Gonzalez-Perez; John Pearson; Chris Sander; Benjamin J Raphael; Debora S Marks; B F Francis Ouellette; Alfonso Valencia; Gary D Bader; Paul C Boutros; Joshua M Stuart; Rune Linding; Nuria Lopez-Bigas; Lincoln D Stein
Journal:  Nat Methods       Date:  2015-07       Impact factor: 28.547

2.  Computational methods and opportunities for phosphorylation network medicine.

Authors:  Yian Ann Chen; Steven A Eschrich
Journal:  Transl Cancer Res       Date:  2014-06-01       Impact factor: 1.241

3.  Integrative analysis of mutational and transcriptional profiles reveals driver mutations of metastatic breast cancers.

Authors:  Ji-Hyun Lee; Xing-Ming Zhao; Ina Yoon; Jin Young Lee; Nam Hoon Kwon; Yin-Ying Wang; Kyung-Min Lee; Min-Joo Lee; Jisun Kim; Hyeong-Gon Moon; Yongho In; Jin-Kao Hao; Kyung-Mii Park; Dong-Young Noh; Wonshik Han; Sunghoon Kim
Journal:  Cell Discov       Date:  2016-08-30       Impact factor: 10.849

4.  Frequent mutations in acetylation and ubiquitination sites suggest novel driver mechanisms of cancer.

Authors:  Soumil Narayan; Gary D Bader; Jüri Reimand
Journal:  Genome Med       Date:  2016-05-12       Impact factor: 11.117

Review 5.  Computational Methods for Characterizing Cancer Mutational Heterogeneity.

Authors:  Fabio Vandin
Journal:  Front Genet       Date:  2017-06-14       Impact factor: 4.599

Review 6.  From sequencing data to gene functions: co-functional network approaches.

Authors:  Jung Eun Shim; Tak Lee; Insuk Lee
Journal:  Anim Cells Syst (Seoul)       Date:  2017-01-31       Impact factor: 1.815

7.  Network signatures link hepatic effects of anti-diabetic interventions with systemic disease parameters.

Authors:  Thomas Kelder; Lars Verschuren; Ben van Ommen; Alain J van Gool; Marijana Radonjic
Journal:  BMC Syst Biol       Date:  2014-09-11

8.  Gene integrated set profile analysis: a context-based approach for inferring biological endpoints.

Authors:  Jeanne Kowalski; Bhakti Dwivedi; Scott Newman; Jeffery M Switchenko; Rini Pauly; David A Gutman; Jyoti Arora; Khanjan Gandhi; Kylie Ainslie; Gregory Doho; Zhaohui Qin; Carlos S Moreno; Michael R Rossi; Paula M Vertino; Sagar Lonial; Leon Bernal-Mizrachi; Lawrence H Boise
Journal:  Nucleic Acids Res       Date:  2016-01-29       Impact factor: 16.971

Review 9.  Biological Networks for Cancer Candidate Biomarkers Discovery.

Authors:  Wenying Yan; Wenjin Xue; Jiajia Chen; Guang Hu
Journal:  Cancer Inform       Date:  2016-09-04

10.  The 'dnet' approach promotes emerging research on cancer patient survival.

Authors:  Hai Fang; Julian Gough
Journal:  Genome Med       Date:  2014-08-26       Impact factor: 11.117

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.