| Literature DB >> 28415609 |
Bo Gao1,2, Guojun Li1,2, Juntao Liu1, Yang Li1, Xiuzhen Huang2,3.
Abstract
It is widely accepted that cancer is driven by accumulated somatic mutations during the lifetime of an individual. Cancer mutations may target relatively small number of cell functional modules. The heterogeneity in different cancer patients makes it difficult to identify driver mutations or functional modules related to cancer. It is biologically desired to be capable of identifying cancer pathway modules through coordination between coverage and exclusivity. There have been a few approaches developed for this purpose, but they all have limitations in practice due to their computational complexity and prediction accuracy. We present a network based approach, CovEx, to predict the specific patient oriented modules by 1) discovering candidate modules for each considered gene, 2) extracting significant candidates by harmonizing coverage and exclusivity and, 3) further selecting the patient oriented modules based on a set cover model. Applying CovEx to pan-cancer datasets spanning 12 cancer types collecting from public database TCGA, it demonstrates significant superiority over the current leading competitors in performance. It is published under GNU GENERAL PUBLIC LICENSE and the source code is available at: https://sourceforge.net/projects/cancer-pathway/files/.Entities:
Keywords: coverage; driver gene; exclusivity; network module; pan-cancer
Mesh:
Year: 2017 PMID: 28415609 PMCID: PMC5482642 DOI: 10.18632/oncotarget.16433
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Flowchart of CovEx
In the first phase, a binary mutation matrix is built according to the mutation data; an influence network containing the mutated genes only is constructed based on an annotated PPI network; each candidate module is identified within a local influence network rooted at a node. In the second phase, the CovEx value for each candidate module is calculated and modules with small CovEx values are filtered. In the third phase, a minimum set cover model is applied to identify those patient specific crucial modules.
Figure 2Comparison Results of CovEx and HotNet2
The HotNet2 results are obtained based on the mutation frequency score on the same dataset and PPI networks. A1, B1 are obtained on HINT+HI2012; A2, B2 on iRefIndex; and A3, B3 on Multinet. A1, A2, A3 are obtained with λ=0; and B1, B2, B3 with λ = 1. k = 2, k = 3, k = 4, k = 5 correspond to different single CovEx predictions, and k = 2, 3, 4, 5 corresponds to CovEx predictions which are obtained by combining the four predictions for specific λ and PPI network.
Comparison results of CovEx and HotNet2 for single cancer types
| NC | SC | AC | NH | SH | AH | |
|---|---|---|---|---|---|---|
| BLCA | 30 | 16 | 53.3% | 147 | 42 | 28.6% |
| BRCA | 116 | 53 | 45.7% | 50 | 18 | 36.0% |
| COADREAD | 19 | 13 | 68.4% | 76 | 19 | 25.0% |
| GBM | 43 | 24 | 55.8% | 25 | 11 | 44.0% |
| HNSC | 66 | 30 | 45.5% | 93 | 23 | 24.7% |
| KIRC | 76 | 32 | 42.1% | 23 | 8 | 34.8% |
| LAML | 73 | 38 | 52.1% | 42 | 28 | 66.7% |
| LUAD | 52 | 28 | 53.8% | 240 | 50 | 20.8% |
| LUSC | 27 | 18 | 66.7% | 103 | 27 | 26.2% |
| OV | 28 | 15 | 53.5% | 25 | 7 | 28.0% |
| UCEC | 20 | 15 | 75.0% | 73 | 27 | 37.0% |
* The first column corresponds to all the analyzed cancer types. For each cancer type, the columns NC, SC, AC and NH, SH, AH correspond to the number of output genes, sensitivities and accuracies of CovEx and HotNet2, respectively.
Figure 3A candidate module mutation matrix
In the matrix, 8 patients which have exactly one mutated gene are module exclusive patients. To calculate the exclusivity value Ex of the module, we first calculate the ratio between number of mutated module exclusive patients and number of all mutated patients for each gene. We get 1/4 = 0.25 for the first and the fourth gene, 3/4 = 0.75 for the second and the third gene. The exclusivity value Ex of the module is (0.25 + 0.75 + 0.75 + 0.25)/4 = 0.5. While the coverage value Cov of the module is 11/11 = 1, the CovEx value of the module is 1*0.5 = 0.5.
Pseudocode of the algorithm for discovering crucial candidate modules
| * A patient is said to be covered by the current |
| Choose a module |
| Reset |
| Return |