| Literature DB >> 31760937 |
Wenxiang Zhang1, Xiujuan Lei Ieee Member2, Chen Bian1.
Abstract
BACKGROUND: It's a very urgent task to identify cancer genes that enables us to understand the mechanisms of biochemical processes at a biomolecular level and facilitates the development of bioinformatics. Although a large number of methods have been proposed to identify cancer genes at recent times, the biological data utilized by most of these methods is still quite less, which reflects an insufficient consideration of the relationship between genes and diseases from a variety of factors.Entities:
Keywords: Identify cancer genes; Multiple biological data; Quadruple layer heterogeneous network; Two-rounds random walk with restart
Mesh:
Substances:
Year: 2019 PMID: 31760937 PMCID: PMC6876101 DOI: 10.1186/s12859-019-3123-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Detail information of the data
| Description | Value |
|---|---|
| Number of nodes in PPI network | 9519 |
| Number of interactions in PPI network | 37,048 |
| Number of nodes in pathway network | 10,717 |
| Number of interactions in pathway network | 302,546 |
| Number of nodes in gene network | 13,596 |
| Number of interactions in gene network | 331,127 |
| Number of protein complexes | 2298 |
| Number of proteins in protein complexes | 3169 |
| Number of nodes in cancer-cancer similarity network | 76 |
| Number of interactions in cancer-cancer similarity network | 155 |
| Number of genes associated with cancer | 160 |
| Number of gene-cancer associations | 251 |
| Number of nodes in microRNA functional similarity network | 940 |
| Number of edges in microRNA functional similarity network | 8385 |
| Number of microRNA in microRNA-gene interactions | 736 |
| Number of genes in microRNA-gene interactions | 2566 |
| Number of microRNA-gene interactions | 8046 |
| Number of microRNA in microRNA-cancer associations | 810 |
| Number of cancers in microRNA-cancer associations | 38 |
| Number of microRNA-cancer associations | 4297 |
| Number of nodes in lncRNA functional similarity network | 700 |
| Number of edges in lncRNA functional similarity network | 5349 |
| Number of lncRNA in lncRNA-gene interactions | 207 |
| Number of genes in lncRNA-gene interactions | 114 |
| Number of lncRNA-gene interactions | 1122 |
| Number of lncRNA in lncRNA-cancer associations | 347 |
| Number of cancers in lncRNA-cancer associations | 40 |
| Number of lncRNA-cancer associations | 839 |
| Number of lncRNA in microRNA-lncRNA interactions | 31 |
| Number of microRNA in microRNA-lncRNA interactions | 45 |
| Number of microRNA-lncRNA interactions | 146 |
Fig. 1The framework of TRWR-MB: a showing data processing, which contains constructing merged network, microRNA network, cancer disease network, and lncRNA network. b constructing a quadruple layer heterogeneous network based on (a), and calculating the transition matrix based on the quadruple layer heterogeneous network and protein complexes. c identifying cancer-related gene by TRWR-MB
The AUC result for α ∈ [0, 1] with an increment of 0.1
| α | 0 | 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | 0.6 | 0.7 | 0.8 | 0.9 | 1 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| γ | AUC | |||||||||||
| 0.1 | 0.8961 | 0.8996 | 0.9013 | 0.9024 | 0.9027 | 0.9037 | 0.9035 | 0.9036 | 0.9034 | 0.9028 | 0.9032 | |
| 0.2 | 0.9004 | 0.9045 | 0.9060 | 0.9070 | 0.9082 | 0.9087 | 0.9088 | 0.9086 | 0.9086 | 0.9084 | 0.9078 | |
| 0.3 | 0.9027 | 0.9068 | 0.9086 | 0.9094 | 0.9099 | 0.9104 | 0.9111 | 0.9115 | 0.9112 | 0.9105 | 0.9086 | |
| 0.4 | 0.9039 | 0.9082 | 0.9098 | 0.9103 | 0.9110 | 0.9117 | 0.9121 | 0.9128 | 0.9126 | 0.9117 | 0.9083 | |
| 0.5 | 0.9047 | 0.9088 | 0.9104 | 0.9111 | 0.9115 | 0.9120 | 0.9125 | 0.9124 | 0.9132 | 0.9127 | 0.9075 | |
| 0.6 | 0.9061 | 0.91020 | 0.9118 | 0.9132 | 0.9136 | 0.9148 | 0.9152 | 0.9158 | 0.9170 | 0.9138 | ||
| 0.7 | 0.9055 | 0.9098 | 0.9110 | 0.9119 | 0.9125 | 0.9130 | 0.9136 | 0.9140 | 0.9149 | 0.9160 | 0.9109 | |
| 0.8 | 0.9054 | 0.9089 | 0.9100 | 0.9112 | 0.9114 | 0.9121 | 0.9126 | 0.9133 | 0.9139 | 0.9148 | 0.9090 | |
| 0.9 | 0.9047 | 0.9082 | 0.9090 | 0.9097 | 0.9103 | 0.9110 | 0.9115 | 0.9116 | 0.9126 | 0.9134 | 0.9073 | |
δ = 0.5, ηi = 0.25, σ = 0.6, 0.9178 is bold, which represent the best of auc value
Fig. 2The histogram of AUC for all results
The number of cancer genes in the top k% of Rank_score
| Algorithms | TOP 5% | TOP 7% | TOP 10% | TOP 15% |
|---|---|---|---|---|
| TRWR-MB | ||||
| RWRMH | 20 | 24 | 43 | |
| RWRM | 21 | 23 | 28 | 42 |
| RWRH | 20 | 20 | 27 | 41 |
| RWR | 18 | 22 | 28 | 38 |
where k is equal to 5, 7, 10 and 15, respectively; The five position in bold represent the best reseult
Fig. 3Comparison between TRWR-MB and other algorithms
The prediction result of new cancer genes
| Rank | Breast cancer (MIM:114480) | Lung cancer (MIM:211980) | Colon Cancer (MIM:114500) | Prostate cancer (MIM:176807) | Leukemia (MIM:601626) | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Gene | PMID | Gene | PMID | Gene | PMID | Gene | PMID | Gene | PMID | |
| 1 | BRCA1 | 25,329,591 | TP53 | 27,182,622 | STK11 | – | TP53 | 27,375,016 | PDGFRB | 29,133,777 |
| 2 | NF1 | – | EXT1 | 30,032,850 | MLH1 | 28,224,663 | RNASEL | – | BCR | – |
| 3 | PTEN | 28,844,858 | BLM | – | FH | – | HSPA1A | – | NF1 | – |
| 4 | AXIN2 | 26,514,524 | PIK3R1 | – | NFKBIB | – | FGFR3 | – | PTPN11 | 27,859,216 |
| 5 | PLAG1 | – | MAPK12 | – | MSH2 | 28,537,674 | MAD2L1 | – | CBL | 28,082,680 |
| 6 | FOXO1 | 28,397,066 | PIK3C2A | – | OAZ1 | – | CTNNB1 | 29,229,583 | ARHGAP26 | – |
| 7 | GPC3 | – | PIK3C2B | – | PIK3R1 | – | EGFR | 27,793,843 | IL12RB2 | – |
| 8 | WT1 | 29,016,617 | RAF1 | 28,884,046 | HRAS | – | STK11 | – | MAPK12 | – |
| 9 | CAV1 | 25,945,613 | NF1 | 24,535,670 | KRAS | 27,338,794 | MYC | – | TP53 | 27,959,731 |
| 10 | DICER1 | 26,460,550 | CNKSR1 | – | GSK3B | – | MAX | 29,108,267 | DOT1L | 27,294,782 |
If the cancer-related genes aren’t verified by literature, the correspond PMIDs are marked as -
Fig. 4Network represent of breast cancer gene and new top 10 breast cancer gene