| Literature DB >> 29662161 |
Hyeongmin Kim1, Yong-Min Kim2.
Abstract
To discover functional gene clusters across cancers, we performed a systematic pan-cancer analysis of 33 cancer types. We identified genes that were associated with somatic mutations and were the cores of a co-expression network. We found that multiple cancer types have relatively exclusive hub genes individually; however, the hub genes cooperate with each other based on their functional relationship. When we built a protein-protein interaction network of hub genes and found nine functional gene clusters across cancer types, the gene clusters divided not only the region of the network map, but also the function of the network by their distinct roles related to the development and progression of cancer. This functional relationship between the clusters and cancers was underpinned by the high expression of module genes and enrichment of programmed cell death, and known candidate cancer genes. In addition to protein-coding hub genes, non-coding hub genes had a possible relationship with cancer. Overall, our approach of investigating cancer genes enabled finding pan-cancer hub genes and common functional gene clusters shared by multiple cancer types based on the expression status of the primary tumour and the functional relationship of genes in the biological network.Entities:
Mesh:
Year: 2018 PMID: 29662161 PMCID: PMC5902616 DOI: 10.1038/s41598-018-24379-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Schematic diagram of the research process.
Figure 2Number of variants of pan-cancer-wide selected genes (PSGs) per aliquot.
Figure 3Single-depth protein-protein interaction (PPI) network of protein-coding pan-cancer-wide selected genes (pcPSGs).
Figure 4Gene clustering of single-depth PPI network of pcPSGs and gene ontology (GO) term summarisation of gene clusters. (a) Gene clusters of nine subnetworks are shown in different colours. The colour indicator is located in the top right corner. The genes that are not in subnetworks are grey colour. (b) GO term summary of gene clusters of subnetworks are presented as a tree map. The name of the subnetwork is on top of the tree map, and the relative size of blocks shows the significance of enrichment of the GO term. Similar GO terms were combined and grouped into a large block of the same colour. The term of most uniqueness in the large block is shown on the white box and located at the centre of the block.
Figure 5Expression patterns of subnetwork genes.
Figure 6Level of cluster occupation of TCGA datasets in subnetworks and distribution of cluster occupation ratios. (a) Occupation ratio of TCGA dataset is shown for each cluster. The length of bar indicates occupation ratio of the dataset that takes the number of shared genes into account. (b) For all occupation ratio of (a), the ratio against their rank is shown. (c) A stair-step plot shows the occupation ratio against the rank for each cluster. (d) The smoothing plot using loess regression of (c). The indicator of colours for clusters is on the bottom right corner.
The number of programmed cell death (PCD) genes and enrichment analysis.
| No. of genes | No. of PCD genes | No. of type 1 PCD genes | No. of type 2 PCD genes | No. of type 3 PCD genes | |
|---|---|---|---|---|---|
| Cluster 1 | 342 | 0 | 0 | 0 | 0 |
| Cluster 2 | 536 | 13 | 5 | 7 | 6 |
| Cluster 3 | 69 | 1 | 0 | 0 | 1 |
| Cluster 4 | 158 | 0 | 0 | 0 | 0 |
| Cluster 5 | 151 | 6 | 6*** | 0 | 1 |
| Cluster 6 | 521 | 28**,*** | 19**,*** | 16**,*** | 8 |
| Cluster 7 | 399 | 29**,*** | 18**,*** | 18**,*** | 6 |
| Cluster 8 | 857 | 49**,*** | 36**,*** | 28**,*** | 10 |
| Cluster 9 | 38 | 4**,*** | 1 | 3**,*** | 0 |
| Clusters | 2,219 | 79* | 50* | 38* | 21 |
| Not in clusters | 4,597 | 124 | 64 | 43 | 37 |
| Network | 6,816 | 203* | 114* | 81* | 58* |
| Not in network | 50,472 | 113 | 28 | 50 | 41 |
| Pan-cancer-wide selected genes | 4,546 | 43* | 19* | 15 | 20* |
| Non-pan-cancer-wide selected genes | 52,742 | 273 | 123 | 116 | 79 |
*PCD genes were enriched in genes in clusters, networks, and PSGs compared to other genes not in clusters, networks, or PSGs.
**PCD genes were enriched compared to other genes in clusters.
***PCD genes were enriched compared to other genes in networks.
The number of known and candidate cancer genes and enrichment analysis.
| No. of genes | No. of known cancer genes | No. of candidate cancer genes | |
|---|---|---|---|
| Cluster 1 | 342 | 0 | 30 |
| Cluster 2 | 536 | 21 | 40 |
| Cluster 3 | 69 | 1 | 5 |
| Cluster 4 | 158 | 2 | 6 |
| Cluster 5 | 151 | 7 | 16 |
| Cluster 6 | 521 | 41** | 59** |
| Cluster 7 | 399 | 33** | 47** |
| Cluster 8 | 857 | 89** | 92** |
| Cluster 9 | 38 | 1 | 6 |
| Clusters | 2,219 | 131* | 194* |
| Not in clusters | 4,597 | 201 | 296 |
| Network | 6,816 | 332* | 490* |
| Not in network | 50,472 | 176 | 544 |
| Pan-cancer-wide selected genes | 4,546 | 67* | 278* |
| Not in pan-cancer-wide selected genes | 52,742 | 441 | 756 |
*Known and candidate cancer genes were enriched in clusters, networks, and PSGs compared to other genes.
**Known and candidate cancer genes were enriched in these clusters compared to other clusters and other genes in networks.