| Literature DB >> 31790386 |
Vu V H Pham1, Lin Liu1, Cameron P Bracken2,3, Gregory J Goodall2,3, Qi Long4, Jiuyong Li1, Thuc D Le1.
Abstract
A key task in cancer genomics research is to identify cancer driver genes. As these genes initialise and progress cancer, understanding them is critical in designing effective cancer interventions. Although there are several methods developed to discover cancer drivers, most of them only identify coding drivers. However, non-coding RNAs can regulate driver mutations to develop cancer. Hence, novel methods are required to reveal both coding and non-coding cancer drivers. In this paper, we develop a novel framework named Controllability based Biological Network Analysis (CBNA) to uncover coding and non-coding cancer drivers (i.e. miRNA cancer drivers). CBNA integrates different genomic data types, including gene expression, gene network, mutation data, and contains a two-stage process: (1) Building a network for a condition (e.g. cancer condition) and (2) Identifying drivers. The application of CBNA to the BRCA dataset demonstrates that it is more effective than the existing methods in detecting coding cancer drivers. In addition, CBNA also predicts 17 miRNA drivers for breast cancer. Some of these predicted miRNA drivers have been validated by literature and the rest can be good candidates for wet-lab validation. We further use CBNA to detect subtype-specific cancer drivers and several predicted drivers have been confirmed to be related to breast cancer subtypes. Another application of CBNA is to discover epithelial-mesenchymal transition (EMT) drivers. Of the predicted EMT drivers, 7 coding and 6 miRNA drivers are in the known EMT gene lists.Entities:
Year: 2019 PMID: 31790386 PMCID: PMC6907873 DOI: 10.1371/journal.pcbi.1007538
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1Coding cancer drivers and genes with mutations.
Genes with driver mutations are cancer drivers. Some genes which do not bear mutations but regulate driver mutations to progress cancer are also considered as cancer drivers.
Fig 2An illustration of CBNA.
(1) Building the network for a condition: (a) Prepare matched miRNA and TF/mRNA expression data, (b) Build miRNA-TF-mRNA network where nodes represent miRNAs/TFs/mRNAs and an edge between two nodes indicates there is a significant Pearson correlation between the expression of the two nodes, (c) Create the network by combining the miRNA-TF-mRNA network with the PPI network and other existing databases, and (2) Identifying coding and miRNA drivers: (a) Detect critical nodes, (b) Identify candidate cancer drivers.
Fig 3Determining the directions of edges in the miRNA-TF-mRNA regulatory network.
In a miRNA-TF-mRNA regulatory network, miRNAs can regulate TFs and mRNAs, TFs can regulate miRNAs and mRNAs, TFs/mRNAs can regulate other TFs/mRNAs. This motif is adapted from the work of [66]. In addition, the databases used to filter out edges of the network are shown on arrows.
Fig 4Characterising the controllability of the miRNA-TF-mRNA network.
(A) Identification of critical, ordinary, and redundant nodes in the network. (B) Average in-degree and accumulative in-degree distribution (i.e. the in-degree i with the probability p means that the probability to pick a node which has in-degree larger than or equal to i is p) for three different node types. (C) Average out-degree and accumulative out-degree distribution for three different node types.
Fig 5Validation using CGC.
The cancer drivers predicted by each method are validated against CGC. Each bar in the chart indicates the number of validated coding driver genes for each method.
Fig 6Comparison of Precision, Recall, and F1Score for the top ranking genes predicted by OncodriveCLUST, ActiveDriver, OncodriveFM, DriverNet, DawnRank, NetSig, and CBNA.
In each diagram, the x-axis is the number of the top ranking genes. The y-axis is the value of Precision, Recall, or F1 Score.
Fig 7Evaluation based on the total number of predicted driver genes.
(A) Number of predicted drivers, (B) Fraction of validated drivers in the CGC and raw count of predicted drivers indicated on top of each bar.
Fig 8CBNA using different adjusted p-value cutoffs.
The cancer drivers predicted by CBNA with different adjusted p-value cutoffs are validated by the CGC. Each bar in the figure shows the number of validated coding cancer drivers of CBNA with a cutoff.
Fig 9Overlap between different methods.
The diagram shows the overlap among the four methods in their top 50, 100, 150, and 200 predicted drivers. For each of the four cases, the horizontal bars at the bottom left show the numbers of predicted cancer drivers validated by the CGC for the four methods; the vertical bars and the dotted lines together indicate the numbers of validated cancer drivers which overlap with each other.
Fig 10Validation using a well-curated set of breast cancer drivers.
The cancer drivers predicted by the methods are validated by a well-curated set of breast cancer drivers. Each bar in the figure shows the number of validated coding cancer drivers of each method.
The top 20 mutated coding drivers using mutation density.
| No. | Predicted driver | Mutation density | In CGC? |
|---|---|---|---|
| 1 | 0.0127275 | ✓ | |
| 2 | 0.0041422 | ✓ | |
| 3 | 0.0034876 | ✓ | |
| 4 | 0.0026619 | ||
| 5 | 0.0025943 | ||
| 6 | 0.0025526 | ||
| 7 | 0.0024289 | ||
| 8 | 0.0022676 | ||
| 9 | 0.0019795 | ||
| 10 | 0.0018190 | ||
| 11 | 0.0013765 | ||
| 12 | 0.0012686 | ||
| 13 | 0.0012274 | ||
| 14 | 0.0011784 | ||
| 15 | 0.0010925 | ||
| 16 | 0.0010813 | ✓ | |
| 17 | 0.0010451 | ||
| 18 | 0.0010043 | ✓ | |
| 19 | 0.0009942 | ||
| 20 | 0.0009913 |
Fig 11Identification of coding and miRNA cancer drivers.
The chart shows the percentage of different types of cancer drivers identified by CBNA from the BRCA dataset.
miRNA BRCA drivers predicted by CBNA.
| No. | Predicted driver | Confirmed | References |
|---|---|---|---|
| 1 | ✓ | [ | |
| 2 | ✓ | [ | |
| 3 | ✓ | [ | |
| 4 | ✓ | [ | |
| 5 | ✓ | [ | |
| 6 | |||
| 7 | ✓ | [ | |
| 8 | ✓ | [ | |
| 9 | ✓ | [ | |
| 10 | ✓ | [ | |
| 11 | ✓ | [ | |
| 12 | ✓ | [ | |
| 13 | ✓ | [ | |
| 14 | |||
| 15 | |||
| 16 | |||
| 17 | ✓ | [ |
Predicted drivers which are specific to each breast cancer subtype.
| Subtype | Coding drivers with mutations (Top 10) | Coding drivers without mutations | miRNA drivers |
|---|---|---|---|
| Luminal A | |||
| Luminal B | |||
| Basal | |||
| Her2 | |||
| Normal-like |
Top 20 coding and 17 miRNA drivers predicted for EMT in breast cancer.
| Coding drivers | miRNA drivers |
|---|---|