| Literature DB >> 28155630 |
Pi-Jing Wei1, Di Zhang2, Junfeng Xia3, Chun-Hou Zheng4.
Abstract
BACKGROUND: Cancer is a complex disease which is characterized by the accumulation of genetic alterations during the patient's lifetime. With the development of the next-generation sequencing technology, multiple omics data, such as cancer genomic, epigenomic and transcriptomic data etc., can be measured from each individual. Correspondingly, one of the key challenges is to pinpoint functional driver mutations or pathways, which contributes to tumorigenesis, from millions of functional neutral passenger mutations.Entities:
Keywords: Cancer; Driver genes; Expression data; Interaction network; Mutation data
Mesh:
Year: 2016 PMID: 28155630 PMCID: PMC5259866 DOI: 10.1186/s12859-016-1332-y
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Schematic of the LNDriver. Genes in somatic mutations are firstly applied to GAM to filter long genes and then they will combine with CNV to construct mutation matrix. The bipartite graph is constructed based on mutation data, expression data and gene-gene influence network, where the blue nodes on the left bipartite graph represent the mutated gene and the black nodes on the right represent the outlying patient-gene events from the gene expression matrix. Then greedy algorithm is applied to identify candidate driver genes. Finally, enrichment analysis is employed to these candidates to explore their roles in pathways
Description of datasets
| Tumor type | Number of tumor expression samples | Number of somatic mutation samples | Samples of tumor expression∩somatic samples |
|---|---|---|---|
| THCA | 513 | 435 | 433 |
| HNSC | 522 | 509 | 501 |
| KIRC | 534 | 417 | 415 |
Fig. 2a HNSC precision. b HNSC recall. c HNSC F1score. d KIRC precision. e KIRC recall. f KIRC F1score. g THCA precision. h THCA recall. i THCA F1score. The comparison of precision, recall and F1score for top ranking genes in LNDriver and other methods. The X axis represents the number of top ranking genes and the Y axis represents the score of the precision, recall and F1score respectively
The rare driver genes in HNSC
| Rank | Gene | Cases with mutations | Mutation frequency (%) |
| CGC gene |
|---|---|---|---|---|---|
| 14 |
| 10 | 1.996008 | 0.011832 | YES |
| 15 |
| 9 | 1.796407 | 0.012938 | YES |
| 18 |
| 7 | 1.397206 | 0.016769 | NO |
| 22 |
| 4 | 0.798403 | 0.019237 | NO |
| 23 |
| 5 | 0.998004 | 0.030388 | YES |
| 24 |
| 8 | 1.596806 | 0.032729 | YES |
| 30 |
| 3 | 0.598802 | 0.044476 | NO |
| 32 |
| 4 | 0.798403 | 0.048264 | NO |
The rare driver genes in KIRC
| Rank | Gene | Cases with mutations | Mutation frequency (%) |
| CGC genes |
|---|---|---|---|---|---|
| 3 |
| 2 | 0.481928 | 0.001378 | NO |
| 5 |
| 7 | 1.686747 | 0.003100 | YES |
| 6 |
| 6 | 1.445783 | 0.003214 | YES |
| 7 |
| 4 | 0.963855 | 0.004018 | NO |
| 8 |
| 2 | 0.481928 | 0.00551 | YES |
| 9 |
| 5 | 1.204819 | 0.007462 | NO |
| 11 |
| 1 | 0.240964 | 0.010332 | YES |
| 12 |
| 5 | 1.204819 | 0.011135 | NO |
| 13 |
| 2 | 0.481928 | 0.011135 | NO |
| 14 |
| 4 | 0.963855 | 0.012513 | YES |
| 15 |
| 4 | 0.963855 | 0.012513 | NO |
| 16 |
| 1 | 0.240964 | 0.013546 | NO |
| 17 |
| 5 | 1.204819 | 0.016186 | YES |
| 19 |
| 1 | 0.240964 | 0.018138 | YES |
| 21 |
| 2 | 0.481928 | 0.021008 | NO |
| 22 |
| 2 | 0.481928 | 0.021008 | NO |
| 23 |
| 5 | 1.204819 | 0.021008 | NO |
| 24 |
| 5 | 1.204819 | 0.023533 | YES |
| 25 |
| 3 | 0.722892 | 0.023533 | YES |
| 26 |
| 1 | 0.240964 | 0.027322 | NO |
| 27 |
| 5 | 1.204819 | 0.027322 | YES |
| 29 |
| 3 | 0.722892 | 0.032717 | NO |
| 30 |
| 1 | 0.240964 | 0.039031 | NO |
| 31 |
| 2 | 0.481928 | 0.039031 | NO |
| 32 |
| 4 | 0.963855 | 0.039031 | NO |
| 33 |
| 8 | 1.927711 | 0.044886 | NO |
| 34 |
| 2 | 0.481928 | 0.044886 | YES |
| 35 |
| 1 | 0.240964 | 0.044886 | NO |
| 36 |
| 7 | 1.686747 | 0.044886 | NO |
The rare driver genes in THCA
| Rank | Gene | Cases with mutations | Mutation frequency (%) |
| CGC genes |
|---|---|---|---|---|---|
| 3 |
| 6 | 1.385681 | 0.000101 | YES |
| 4 |
| 3 | 0.692841 | 0.000101 | YES |
| 6 |
| 2 | 0.461894 | 0.002121 | NO |
| 7 |
| 2 | 0.461894 | 0.004141 | NO |
| 8 |
| 2 | 0.461894 | 0.005858 | YES |
| 9 |
| 3 | 0.692841 | 0.006868 | YES |
| 10 |
| 1 | 0.230947 | 0.008484 | NO |
| 11 |
| 1 | 0.230947 | 0.009191 | NO |
| 12 |
| 2 | 0.461894 | 0.009191 | YES |
| 14 |
| 1 | 0.230947 | 0.009191 | NO |
| 15 |
| 1 | 0.230947 | 0.009696 | NO |
| 16 |
| 6 | 1.385681 | 0.010706 | YES |
| 17 |
| 4 | 0.923788 | 0.018079 | YES |
| 18 |
| 1 | 0.230947 | 0.020099 | NO |
| 19 |
| 8 | 1.847575 | 0.020099 | YES |
| 20 |
| 5 | 1.154734 | 0.022725 | YES |
| 21 |
| 1 | 0.230947 | 0.022725 | YES |
| 22 |
| 1 | 0.230947 | 0.026462 | NO |
| 24 |
| 5 | 1.154734 | 0.035754 | NO |
| 25 |
| 6 | 1.385681 | 0.041713 | YES |
| 26 |
| 4 | 0.923788 | 0.041713 | NO |
| 27 |
| 7 | 1.616628 | 0.041713 | YES |
| 28 |
| 5 | 1.154734 | 0.041713 | NO |
| 29 |
| 5 | 1.154734 | 0.041713 | NO |
| 30 |
| 2 | 0.461894 | 0.041713 | NO |
| 31 |
| 2 | 0.461894 | 0.041713 | NO |
| 32 |
| 1 | 0.230947 | 0.041713 | NO |
| 33 |
| 1 | 0.230947 | 0.048783 | NO |
| 34 |
| 1 | 0.230947 | 0.048783 | NO |