| Literature DB >> 26519468 |
Pora Kim1, Feixiong Cheng1, Junfei Zhao1, Zhongming Zhao2.
Abstract
Accumulating evidence has demonstrated that rewiring of metabolism in cells is an important hallmark of cancer. The percentage of patients killed by metabolic disorder has been estimated to be 30% of the advanced-stage cancer patients. Thus, a systematic annotation of cancer cell metabolism genes is imperative. Here, we present ccmGDB (Cancer Cell Metabolism Gene DataBase), a comprehensive annotation database for cell metabolism genes in cancer, available at http://bioinfo.mc.vanderbilt.edu/ccmGDB. We assembled, curated, and integrated genetic, genomic, transcriptomic, proteomic, biological network and functional information for over 2000 cell metabolism genes in more than 30 cancer types. In total, we integrated over 260 000 somatic alterations including non-synonymous mutations, copy number variants and structural variants. We also integrated RNA-Seq data in various primary tumors, gene expression microarray data in over 1000 cancer cell lines and protein expression data. Furthermore, we constructed cancer or tissue type-specific, gene co-expression based protein interaction networks and drug-target interaction networks. Using these systematic annotations, the ccmGDB portal site provides 6 categories: gene summary, phenotypic information, somatic mutations, gene and protein expression, gene co-expression network and drug pharmacological information with a user-friendly interface for browsing and searching. ccmGDB is developed and maintained as a useful resource for the cancer research community.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26519468 PMCID: PMC4702820 DOI: 10.1093/nar/gkv1128
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Annotation entry statistics for all cell metabolism genes
| Data type | # Entries | # cmGenesa | # ccmGenesb |
|---|---|---|---|
| Total 2071 (%) | Total 514 (%) | ||
| Cancer genes | # genes | ||
| Oncogenesc | 41 | 41 (2.0%) | 41 (8.0%) |
| Tumor suppressor genesd | 92 | 92 (4.4%) | 92 (17.9%) |
| Cancer Gene Censuse | 50 | 50 (2.4%) | 50 (9.7%) |
| Cancer genesf | 382 | 382 (18.4%) | 382 (74.3%) |
| Network of cancer genesg | 133 | 133 (6.4%) | 133 (25.9%) |
| Significantly mutated driver genesh | 110 | 110 (5.3%) | 110 (21.4%) |
| Pathway | # pathways (# genes) | ||
| KEGGi | 42 (922) | 922 (44.5%) | 210 (40.9%) |
| REACTOMEj | 27 (1597) | 1597 (77.1%) | 406 (79.0%) |
| Interactionk | # interactions | ||
| Physical interactionl | 679 507 | 1968 (95.0%) | 481 (93.6%) |
| Metabolic interactionm | 21 353 | 1149 (55.5%) | 245 (47.7%) |
| Signaling interactionn | 78 548 | 1131 (54.6%) | 361 (70.2%) |
| Expression | # samples | ||
| CCLEo | 1037 | 1893 (91.4%) | 488 (95.0%) |
| TCGAp | 4150 (tumor) | 2061 (99.5%) | 514 (100%) |
| 461 (normal) | |||
| RPPAq | 4775 | 24 (1.2%) | 21 (4.1%) |
| Mutation | # mutations | ||
| TCGAr | 102 399 SNVss | 2026 (97.8%) | 508 (98.8%) |
| COSMICt | 151 238 SNVs | 2040 (98.5%) | 510 (99.2%) |
| 5916 Indelsu | 1213 (58.5%) | 340 (66.1%) | |
| 6288 CNVsv | 1836 (88.6%) | 461 (90.0%) | |
| 1971 SVsw | 782 (37.7%) | 225 (43.8%) | |
| Chitars2.0x | 4729 chimeric transcripts | 1392 (67.2%) | 392 (76.3%) |
| Molecule | # molecules | ||
| DrugBanky | 4059 drugs | 946 (45.7%) | 269 (52.3%) |
| UniProtz | 2062 proteins | 2069 (99.9%) | 514 (100%) |
aCell metabolism genes.
bCancer cell metabolism genes.
cOncogenes from Cancer Genes.
dTumor suppressors from TSGene.
eCancer genes from Census of human cancer genes.
fCancer genes from CancerGenes. gCancer genes from NCG4.0.
hSignificantly mutated genes per 18 TCGA cancer types from 12 articles.
iCell metabolism related pathway in KEGG.
jCell metabolism related pathway in REACTOME.
kPathwayCommons interaction.
lGenes having ‘interacts-with’, ‘reacts-with and neighbor-of’ interactions among PathwayCommons.
mGenes having ‘catalysis-precedes’ interactions among PathwayCommons.
nGenes having ‘controls-production-of’, ‘in-complex-with’, ‘controls-state-change-of’, ‘controls-phosphorylation-of’, ‘controls-transport-of’, ‘controls-expression-of’, ‘consumption-controlled-by’, ‘controls-transport-of-chemical’ and ‘chemical-affects’ interactions among PathwayCommons.
oGene expression for cancer cell lines of 24 cancer types.
pRNA-seq data for primary tumor and normal samples.
qProtein expression values.
rMutations called for TCGA exome-seq data by TCGA investigators.
sSomatic nucleotide variations.
tAll types of variants collected in COSMIC.
uInsertions and/or deletions.
vCopy number variations.
wStructural variants.
xHuman chimeric transcripts.
yRelated drug with the cmGene.
zUniversal protein ID for the cmGene.
Figure 1.Overview of ccmGDB. Cancer cell metabolism gene database is composed of 6 categorized annotations from the integration of genotypic data, phenotypic data, pharmacological data and network analysis for all 2071 cell metabolism genes.
Figure 2.Mutation category in ccmGDB. (A) Structural variants annotation for PTEN. A Circos plot based on chromosomes and detailed information including cancer type specific statistics and fusion gene information is provided. (B) Copy number variations annotation for PTEN. Copy number gain is colored in red and copy number loss in green. (C) Somatic single nucleotide variations and small insertions and deletions for IDH1 such as mutation frequency per tissue and protein structure based representation.
Figure 3.Expression category in ccmGDB. Using this category, user can compare the expression level per cancer/tissue type at a glance. (A) Gene expression plot of mTOR for cancer cell lines using CCLE data. (B) Protein expression plot of PTEN using TCPA data. (C) Differential gene expression plot of SLC2A1 for primary cancer tissues using TCGA data. (D) Correlation plot between gene expression and copy number of mTOR for TCGA data.
Figure 4.Gene–gene network category and pharmacological category. (A) Co-expressed protein interaction network using the top 20 co-expressed genes for IDH1. By gene set enrichment analysis (GSEA) of the 20 genes with the cancer/tissue type-specific information in this category, user can infer differentially activated pathways. The target gene is colored in red and other cmGenes in orange. (B) Enriched KEGG pathway information using all interacting genes from PathwayCommons. (C) Pharmacological information for SLC2A1. Gene-centric network, drug-centric networks and detailed information for each drug including the two-dimensional structure information are provided.