| Literature DB >> 29036590 |
Pora Kim1, Aekyung Park1,2, Guangchun Han1, Hua Sun1, Peilin Jia1, Zhongming Zhao1,3.
Abstract
Tissue-specific gene expression is critical in understanding biological processes, physiological conditions, and disease. The identification and appropriate use of tissue-specific genes (TissGenes) will provide important insights into disease mechanisms and organ-specific therapeutic targets. To better understand the tissue-specific features for each cancer type and to advance the discovery of clinically relevant genes or mutations, we built TissGDB (Tissue specific Gene DataBase in cancer) available at http://zhaobioinfo.org/TissGDB. We collected and curated 2461 tissue specific genes (TissGenes) across 22 tissue types that matched the 28 cancer types of The Cancer Genome Atlas (TCGA) from three representative tissue-specific gene expression resources: The Human Protein Atlas (HPA), Tissue-specific Gene Expression and Regulation (TiGER), and Genotype-Tissue Expression (GTEx). For these 2461 TissGenes, we performed gene expression, somatic mutation, and prognostic marker-based analyses across 28 cancer types using TCGA data. Our analyses identified hundreds of TissGenes, including genes that universally kept or lost tissue-specific gene expression, with other features: cancer type-specific isoform expression, fusion with oncogenes or tumor suppressor genes, and markers for protective or risk prognosis. TissGDB provides seven categories of annotations: TissGeneSummary, TissGeneExp, TissGene-miRNA, TissGeneMut, TissGeneNet, TissGeneProg, TissGeneClin.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29036590 PMCID: PMC5753286 DOI: 10.1093/nar/gkx850
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Overview of TissGDB. (A) Venn-diagram of TissGenes among the three representative tissue-specific gene expression resources. We selected 2461 TissGenes overlapped in at least two out of three resources. Among them, 546 genes were overlapped in all three tissue-specific gene expression resources (confident TissGenes, cTissGenes). (B) The overall expression distribution of TissGenes among the 28 cancer types using the principal component analysis (PCA) method.
Figure 2.Overview of TissGene annotations in TissGDB. (A) Gene expression-based annotations of TissGenes. (B) Somatic mutation-based annotations of TissGenes. (C) Prognostic-based annotations of TissGenes.
Annotation entry statistics for TissGenes and cTissGenes
| Data type | # entries | # TissGenesa | # cTissGenesb |
|---|---|---|---|
| Total 2461 (%) | Total 546 (%) | ||
| Tissue specific genes | # genes | ||
| HPAc | 2050 | 1498 (60.9%) | 546 (100.0%) |
| TiGERd | 3090 | 1728 (70.2%) | 546 (100.0%) |
| GTExe | 6039 | 2242 (91.1%) | 546 (100.0%) |
| Cancer genes | |||
| CCGf | 4050 | 443 (18.0%) | 87 (15.9 %) |
| Expression | # genes | ||
| TCGAg | 20 530 | 2444 (99.3%) | 546 (100.0%) |
| GTEx | 56 318 | 2461 (100.0%) | 546 (100.0%) |
| Mutation | # genes | ||
| TCGA | 39 571 | 2461 (100.0%) | 546 (100.0%) |
| Copy number variation | # genes | ||
| TCGA | 24 776 | 2461 (100.0%) | 546 (100.0%) |
| Fusion gene | # genes | ||
| ChimerDB3.0h | 10 713 | 1393 (56.6%) | 293 (53.7%) |
| TCGA data Fusion Portali | 7765 | 718 (29.2%) | 155 (28.4%) |
| Survival analysis | # clin.info | ||
| TCGA | 11 896 | 2461 (100.0%) | 546 (100.0%) |
| Molecule | # molecules | ||
| DrugBankj | 8206 drugs | 218 (8.9%) | 61 (11.2%) |
| UniProtk | 2374 proteins | 2446 (99.4%) | 545 (99.8%) |
| Phenotype | # phenotype | ||
| DisGeNetl | 15 094 disease ID | 1844 (74.9%) | 434 (79.5%) |
aTissue specific genes (TissGenes).bConfident TissGenes (cTissGenes).cThe Human Protein Atlas.dTissue-specific gene expression and regulation (TiGER).eGenotype-Tissue Expression. fCatalogue of cancer genes. gThe Cancer Genome Atlas.hChimerDB3.0: an enhanced database for fusion genes from cancer transcriptome and literature data mining. iTCGA fusion gene data portal. jRelated drug with the TissGenes from DrugBank database.kThe Universal Protein Resource (UniProt). lGene-level disease annotation from DisGeNet database.
Figure 3.TissGenes that keep or lose tissue-specificity in cancer. From the gene expression patterns of all TissGenes across 28 cancer types, we identified 294 TissGenesKTS and 209 TissGenesLTS. (A) The percentage and number of TissGenesKTS and TissGenesLTS across 28 cancer types. (B) Enriched biological processes of TissGenesKTS per cancer type. (C) Enriched biological processes of TissGenesLTS.