| Literature DB >> 29208006 |
Qifeng Wang1,2, Hualei Gan1,2, Chengshu Chen3, Yifeng Sun3, Jinying Chen3, Midie Xu1,2, Weiwei Weng1,2, Liyu Cao4, Qinghua Xu5,6, Jian Wang7,8.
Abstract
BACKGROUND: Renal cancers account for more than 3% of all adult malignancies and cause more than 23,400 deaths per year in China alone. The four most common types of kidney tumours include clear cell, papillary, chromophobe and benign oncocytoma. These histological subtypes vary in their clinical course and prognosis, and different clinical strategies have been developed for their management. Some kidney tumours can be very difficult to distinguish based on the pathological assessment of morphology and immunohistochemistry.Entities:
Keywords: Gene expression profiling; Microarray; Next-generation sequencing; Quantitative real-time PCR; Renal cell carcinomas
Mesh:
Year: 2017 PMID: 29208006 PMCID: PMC5717815 DOI: 10.1186/s13046-017-0651-9
Source DB: PubMed Journal: J Exp Clin Cancer Res ISSN: 0392-9078
Fig. 1Flow diagram of gene expression signature identification and performance assessment. Gene expression profiles were retrieved from the U133A/ U133APlus2 microarray through a bioinformatics approach. Gene expression analyses were performed in the training set (six GEO microarray datasets) first and then validated in TCGA cohort two validation sets (and the FUSCC cohort)
Summary of sample information
| Samples | Training set | Test Set 1 | Test Set 2 | |||
|---|---|---|---|---|---|---|
| n | % | n | % | n | % | |
| Normal tissue | 35 | 11.9 | 129 | 12.6 | 8 | 6.2 |
| RCC subtypes | ||||||
| ccRCC | 106 | 35.9 | 534 | 52.4 | 26 | 20.2 |
| chRCC | 42 | 14.2 | 66 | 6.5 | 40 | 31 |
| pRCC | 66 | 22.4 | 291 | 28.5 | 28 | 21.7 |
| Oncocytoma | 46 | 15.6 | 0 | 0 | 27 | 20.9 |
| Total | 295 | 100 | 1020 | 100 | 129 | 100 |
Descripotion of 44 genes annotation
| Gene Symbol | Gene description | Cytoband |
|---|---|---|
| ABCA8 | ATP-binding cassette, sub-family A (ABC1), member 8 | 17q24 |
| AKR1C2 | aldo-keto reductase family 1, member C2 | 10p15-p14 |
| ALDOB | aldolase B, fructose-bisphosphate | 9q21.3-q22.2 |
| ANGPTL4 | angiopoietin-like 4 | 19p13.3 |
| AQP6 | aquaporin 6, kidney specific | 12q13 |
| ASS1 | argininosuccinate synthase 1 | 9q34.1 |
| ATP6V0A4 | ATPase, H+ transporting, lysosomal V0 subunit a4 | 7q34 |
| C7 | complement component 7 | 5p13 |
| CALB1 | calbindin 1, 28 kDa | 8q21.3 |
| CLDN8 | claudin 8 | 21q22.11 |
| CRYAB | crystallin, alpha B | 11q22.3-q23.1 |
| DEFB1 | defensin, beta 1 | 8p23.1 |
| DHRS2 | dehydrogenase/reductase (SDR family) member 2 | 14q11.2 |
| FLRT3 | fibronectin leucine rich transmembrane protein 3 | 20p11 |
| FOSB | FBJ murine osteosarcoma viral oncogene homolog B | 19q13.32 |
| GSTA1 | glutathione S-transferase alpha 1 | 6p12.1 |
| HILPDA | hypoxia inducible lipid droplet-associated | 7q32.1 |
| IGFBP1 | insulin-like growth factor binding protein 1 | 7p12.3 |
| IGFBP6 | insulin-like growth factor binding protein 6 | 12q13 |
| KRT7 | keratin 7, type II | 12q13.13 |
| LCN2 | lipocalin 2 | 9q34 |
| MAL | mal, T-cell differentiation protein | 2q11.1 |
| MAOB | monoamine oxidase B | Xp11.23 |
| MMP7 | matrix metallopeptidase 7 (matrilysin, uterine) | 11q21-q22 |
| MT1G | metallothionein 1G | 16q13 |
| NDUFA4L2 | NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, 4-like 2 | 12q13.3 |
| NNMT | nicotinamide N-methyltransferase | 11q23.1 |
| PAH | phenylalanine hydroxylase | 12q22-q24.2 |
| PCP4 | Purkinje cell protein 4 | 21q22.2 |
| PLIN2 | perilipin 2 | 9p22.1 |
| RHCG | Rh family, C glycoprotein | 15q25 |
| RNF128 | ring finger protein 128, E3 ubiquitin protein ligase | Xq22.3 |
| S100A2 | S100 calcium binding protein A2 | 1q21 |
| SERPINA5 | serpin peptidase inhibitor, clade A (alpha-1 antiproteinase, antitrypsin), member 5 | 14q32.1 |
| SFTPB | surfactant protein B | 2p12-p11.2 |
| SLC12A1 | solute carrier family 12 (sodium/potassium/chloride transporter), member 1 | 15q15-q21.1 |
| SLC18A2 | solute carrier family 18 (vesicular monoamine transporter), member 2 | 10q25 |
| STAP1 | signal transducing adaptor family member 1 | 4q13.2 |
| TACSTD2 | tumor-associated calcium signal transducer 2 | 1p32 |
| TFPI2 | tissue factor pathway inhibitor 2 | 7q22 |
| TMEM255A | transmembrane protein 255A | Xq24 |
| UMOD | uromodulin | 16p12.3 |
| VCAN | versican | 5q14.3 |
| ZNF395 | zinc finger protein 395 | 8p21.1 |
Fig. 2Hierarchical clustering analysis of 44-gene expression data in the training set and test sets. a Hierarchical clustering of 295 samples from the training set. Normalized gene expression intensities were shifted to mean = 0, and rescaled to STD = 1 to enhance the expression differences. The average linkage hierarchical clustering method was performed where the metric of similarity was Pearson’s correlation between every pair of samples. The right panel indicates the official symbol of 44 genes. The left panel shows a dendrogram of hierarchical clustering of these genes. Colored pixels capture the magnitude of the expression for any gene, where shades of red and blue represent over-expression and under-expression, respectively, relative to the mean for each gene. The upper panel shows a dendrogram of hierarchical clustering of samples. The histological type of each sample is indicated in the bottom panel, with chromophobe tumours shown in purple, clear cell tumours shown in orange, oncocytoma samples indicated in yellow, papillary tumours in pink, and adjacent tissue samples in blue. The samples clustered into five groups that closely follow the histological types. Among the four tumour subtypes, the oncocytoma and chromophobe samples cluster together, whereas the conventional tumours show a higher degree of similarity to papillary tumours. b Hierarchical clustering of 1020 samples from the Test Set 1. c Hierarchical clustering of 129 samples from the Test Set 2
GO and KEGG pathway analysis of 44 gene
| Data source | Feature ID | Name | Genes |
|
|---|---|---|---|---|
| GO Molecular Function | GO:0031995 | insulin-like growth factor II binding | IGFBP1, IGFBP6 | 1.25E-04 |
| GO Molecular Function | GO:0031994 | insulin-like growth factor I binding | IGFBP1, IGFBP6 | 1.60E-04 |
| GO Molecular Function | GO:0005539 | glycosaminoglycan binding | SERPINA5, VCAN | 1.00E-03 |
| GO Molecular Function | GO:0016597 | amino acid binding | ASS1, PAH | 1.00E-03 |
| GO Biological Process | GO:0007588 | excretion | AQP6, ATP6V0A4, SLC12A1, UMOD | 1.76E-06 |
| GO Biological Process | GO:0071242 | cellular response to ammonium ion | ASS1, SLC18A2 | 4.50E-06 |
| GO Biological Process | GO:0048878 | chemical homeostasis | SLC12A1, UMOD | 1.35E-05 |
| GO Biological Process | GO:0051412 | response to corticosterone | FOSB, MAOB, SLC18A2 | 1.38E-05 |
| GO Cellular Component | GO:0005615 | extracellular space | ANGPTL4, DEFB1, FLRT3, HILPDA, IGFBP1, IGFBP6, LCN2, MMP7, SERPINA5, SFTPB, TACSTD2, UMOD, VCAN | 8.42E-07 |
| GO Cellular Component | GO:0016324 | apical plasma membrane | AQP6, ATP6V0A4, MAL, RHCG, SLC12A1, UMOD | 2.02E-05 |
| GO Cellular Component | GO:0005576 | extracellular region | ANGPTL4, C7, DEFB1, IGFBP1, IGFBP6, LCN2, MMP7, PLIN2, SERPINA5, TFPI2, UMOD, VCAN | 3.77E-05 |
| GO Cellular Component | GO:0005578 | proteinaceous extracellular matrix | ANGPTL4, FLRT3, MMP7, TFPI2, VCAN | 1.43E-04 |
| Cancer Gene Index [CGI] | C4863 | prostate cancer | AKR1C2, ANGPTL4, ASS1, GSTA1, IGFBP1, IGFBP6, LCN2, MAL, MT1G, S100A2, TFPI2, VCAN | 3.20E-06 |
| DISEASE DB (Univ of Copenhagen) | DOID:6245 | Renal oncocytoma | AQP6, CLDN8, KRT7 | 6.14E-06 |
| Cancer Gene Index [CGI] | C9385 | renal-cell cancer | ASS1, DEFB1, IGFBP6, LCN2, SERPINA5, UMOD, VCAN | 1.41E-05 |
| Cancer Gene Index [CGI] | C2978 | cysts | GSTA1, IGFBP1, IGFBP6, LCN2, MAL, UMOD | 1.45E-05 |
| KEGG | path:hsa05030 | Cocaine addiction | FOSB, MAOB, SLC18A2 | 1.68E-04 |
| REACTOME | REACT_15518 | Transmembrane transport of small molecules | ABCA8, AQP6, ATP6V0A4, LCN2, RHCG, SLC12A1, SLC18A2 | 2.28E-04 |
| KEGG | path:hsa05031 | Amphetamine addiction | FOSB, MAOB, SLC18A2 | 4.19E-04 |
| KEGG | path:hsa01230 | Biosynthesis of amino acids | ALDOB, ASS1, PAH | 1.00E-03 |
| KEGG | path:hsa00360 | Phenylalanine metabolism | MAOB, PAH | 1.00E-03 |
Fig. 3Protein-protein interaction network of the 44-gene set. The network of protein-protein interactions of the 44-gene set inferred using the IMEx Interactome Database. Blue nodes indicate the proteins present in the 44-gene set, whereas grey nodes represent proteins not in the 44-gene set. The size of the node is proportional to the degree of connections. The large nodes represent a few high-degree hub nodes, while most small nodes have only a few connections
Top 20 enriched pathways of 55 genes within network
| Feature ID | Name | Data source | Genes |
|
|---|---|---|---|---|
| WP366 | TGF beta Signaling Pathway | WIKIPATHWAYS | APP ATF2 EP300 FOSB HDAC1 MYC PTK2 SP1 TP53 | 4.97E-07 |
| WP138 | Androgen receptor signaling pathway | WIKIPATHWAYS | EP300 HDAC1 KAT5 MDM2 PTK2 RELA SP1 | 4.27E-06 |
| path:hsa05202 | Transcriptional misregulation in cancer | KEGG | CEBPB HDAC1 MDM2 MYC PTK2 RELA SP1 TP53 | 2.02E-05 |
| WP2377 | Integrated Pancreatic Cancer Pathway | WIKIPATHWAYS | APP EGR1 EP300 HNF4A MDM2 MYC SP1 TP53 | 3.49E-05 |
| path:hsa05169 | Epstein-Barr virus infection | KEGG | ATF2 CD44 EP300 HDAC1 MDM2 MYC RELA TP53 | 4.12E-05 |
| path:hsa05030 | Cocaine addiction | KEGG | ATF2 FOSB MAOB RELA SLC18A2 | 1.33E-04 |
| REACT_169274 | Cellular Senescence | REACTOME | CEBPB MDM2 RELA SP1 TERF1 TP53 UBC | 3.43E-04 |
| path:hsa05031 | Amphetamine addiction | KEGG | ATF2 FOSB HDAC1 MAOB SLC18A2 | 4.35E-04 |
| REACT_169325 | Oncogene Induced Senescence | REACTOME | MDM2 SP1 TP53 UBC | 4.68E-04 |
| REACT_120734 | SMAD2/SMAD3:SMAD4 heterotrimer regulates transcription | REACTOME | HDAC1 MYC SP1 UBC | 1.00E-03 |
| path:hsa05220 | Chronic myeloid leukemia | KEGG | HDAC1 MDM2 MYC RELA TP53 | 1.00E-03 |
| REACT_120956 | Cellular responses to stress | REACTOME | CEBPB EP300 MDM2 RELA SP1 TERF1 TP53 UBC | 1.00E-03 |
| WP1984 | Integrated Breast Cancer Pathway | WIKIPATHWAYS | EP300 HDAC1 MDM2 MYC SP1 TP53 | 1.00E-03 |
| WP254 | Apoptosis | WIKIPATHWAYS | IGF2 MDM2 MYC RELA TP53 | 1.00E-03 |
| REACT_121061 | Transcriptional activity of SMAD2/SMAD3:SMAD4 heterotrimer | REACTOME | HDAC1 MYC SP1 UBC | 1.00E-03 |
| path:hsa05166 | HTLV-I infection | KEGG | ATF2 EGR1 EP300 KAT5 MYC RELA TP53 | 2.00E-03 |
| WP399 | Wnt Signaling Pathway and Pluripotency | WIKIPATHWAYS | CD44 EP300 MMP7 MYC TP53 | 2.00E-03 |
| REACT_118780 | NOTCH1 Intracellular Domain Regulates Transcription | REACTOME | EP300 HDAC1 MYC UBC | 2.00E-03 |
| REACT_299 | Signaling by NOTCH | REACTOME | EP300 HDAC1 MYC TP53 UBC | 3.00E-03 |
| h_arfPathway | Tumor Suppressor Arf Inhibits Ribosomal Biogenesis | BIOCARTA | MDM2 MYC TP53 | 3.00E-03 |
Performance characteristics of the 44-gene expression signature in two test sets
| Test Set 1 | Test Set 2 | |||||
|---|---|---|---|---|---|---|
| n | Sensitivity | Specificity | n | Sensitivity | Specificity | |
| Normal | 129 | 94.6% | 100.0% | 8 | 100.0% | 98.3% |
| ccRCC | 534 | 94.2% | 95.7% | 26 | 96.2% | 96.1% |
| chRCC | 66 | 90.9% | 97.8% | 40 | 92.5% | 97.8% |
| pRCC | 291 | 92.1% | 97.8% | 28 | 89.3% | 99.0% |
| Oncocytoma | / | / | / | 27 | 92.6% | 98.2% |
| Total | 1020 | Overall accuracy = 93.4% | 129 | Overall accuracy = 93.0% | ||