| Literature DB >> 35817785 |
Chinthalapally V Rao1,2,3, Chao Xu4,5, Yuting Zhang6,7, Adam S Asch8,9, Hiroshi Y Yamada10,11,12.
Abstract
Genomic instability (GI) in cancer facilitates cancer evolution and is an exploitable target for therapy purposes. However, specific genes involved in cancer GI remain elusive. Causal genes for GI via expressions have not been comprehensively identified in colorectal cancers (CRCs). To fill the gap in knowledge, we developed a data mining strategy (Gene Expression to Copy Number Alterations; "GE-CNA"). Here we applied the GE-CNA approach to 592 TCGA CRC datasets, and identified 500 genes whose expression levels associate with CNA. Among these, 18 were survival-critical (i.e., expression levels correlate with significant differences in patients' survival). Comparison with previous results indicated striking differences between lung adenocarcinoma and CRC: (a) less involvement of overexpression of mitotic genes in generating genomic instability in the colon and (b) the presence of CNA-suppressing pathways, including immune-surveillance, was only partly similar to those in the lung. Following 13 genes (TIGD6, TMED6, APOBEC3D, EP400NL, B3GNT4, ZNF683, FOXD4, FOXD4L1, PKIB, DDB2, MT1G, CLCN3, CAPS) were evaluated as potential drug development targets (hazard ratio [> 1.3 or < 0.5]). Identification of specific CRC genomic instability genes enables researchers to develop GI targeting approach. The new results suggest that the "targeting genomic instability and/or aneuploidy" approach must be tailored for specific organs.Entities:
Mesh:
Year: 2022 PMID: 35817785 PMCID: PMC9273645 DOI: 10.1038/s41598-022-15692-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Identifying genes associated with Copy Number Alterations in colon adenocarcinoma with the “Gene Expression to Copy Number Alterations” (“GE-CNA”) approach. For all genes, we recorded CNA for high expressor tumors (N = 10) and for low expressor tumors (N = 10). The CNA from the “high expressor” and “low expressor” groups were compared using unpaired t-test for each gene, testing the correlation between gene expression and numbers of CNA (q-value < 0.05). Genes whose high expression was associated with high CNA were annotated as CNA suppressors, while genes whose high expression was associated with low CNA were annotated as CNA suppressors. Genes specifically associated with a type of CNA ([a] amplification/insertion [amp/ins] CNA, often associated with Microsatellite Instability [MIN], and [b] deletion CNA, often associated with mitotic error-mediated Chromosome Instability [CIN]), were identified. Figure was generated with cBioportal (https://www.cbioportal.org/datasets).
Figure 2Pathway analysis of CNA suppressors. The 247 CNA facilitator genes in Supplementary Table 1 did not show significant enrichment in a pathway. The 253 CNA suppressor genes in Supplementary Table 2 were further subcategorized to amplification/insertion CNA suppressors (Supplementary Table 5) and deletion CNA suppressors (Supplementary Table 6). Amp/ins CNA suppressors include only 23 genes, while deletion CNA suppressors include 253 genes, suggesting that CRC cells with amplification/insertion CNA and deletion CNA are suppressed through different modalities. Deletion CNA suppressor genes show enrichment in the (A) Antigen Presentation Pathway, (B) Interferon signaling pathway, and (C) JAK-STAT signaling pathway, suggesting that CRC cells carrying CIN-associated deletion CNA are targeted by these immune-associated pathways and that they represent an immunosurveillance mechanism of CIN cells in CRC. Purple highlighting indicates particular genes with significant GE-CNA correlations and/or a cluster of such genes in the IPA pathways. Figures were generated with IPA (Ingenuity Pathway Analysis, QIAGEN, Inc., https://www.qiagenbioinformatics.com/products/ingenuity-pathway-analysis).
Figure 3Lung adenocarcinomas carry higher CNA than do CRCs at all stages and in both types of CNA (amp/ins CNA and deletion CNA). (A) At all stages, lung adenocarcinomas carry higher numbers of CNA (all types of CNA) than do CRCs (green: lung adenocarcinoma, orange: CRC). The difference is particularly notable at earlier stages. For stages 1–3, the difference was statistically significant (Bonferroni corrected p-value < 0.05). The trend is the same in both (B) Amp/ins CNA and (C) deletion CNA. Figures were generated from R v4.0.3 (https://www.R-project.org/).
Data for Gene Expression and Copy Number Alteration (GE-CNA) on initially-identified 27 “survival critical” genes.
| High expression group | Low expression group | |||||
|---|---|---|---|---|---|---|
| Average # of CNVs | SD | Average # of CNVs | SD | |||
| CAPS | 14,892.8 | 4412.615 | 2347.6 | 2412.918 | 1.66E-06 | 0.001333 |
| CCDC115 | 14,359.3 | 5092.752 | 3448.8 | 4751.292 | 0.000104 | 0.011347 |
| ATP6AP1 | 12,126.8 | 4855.072 | 3390.3 | 3034.437 | 0.000218 | 0.017405 |
| NBEAP1 | 14,700.2 | 4783.089 | 6138 | 3591.707 | 0.000311 | 0.021176 |
| SPANXC | 14,463 | 1873.704 | 6901.6 | 4500.857 | 0.000361 | 0.022546 |
| TIGD6 | 14,767.4 | 4393.891 | 6187.2 | 4481.801 | 0.00041 | 0.023795 |
| C7ORF13 | 13,171.1 | 5601.803 | 3521 | 4346.452 | 0.000484 | 0.025349 |
| TMEM184A | 12,130.2 | 2904.1 | 4049.2 | 4924.642 | 0.00048 | 0.025349 |
| F8A1 | 14,166.3 | 4575.244 | 5996.1 | 4508.695 | 0.0008 | 0.032097 |
| LZTS3 | 11,336.4 | 4583.305 | 3650 | 4075.844 | 0.000933 | 0.034692 |
| OLMALINC | 13,573.4 | 6549.111 | 4271.7 | 3132.232 | 0.00139 | 0.042547 |
| WARS | 1849.9 | 2042.03 | 12,385.9 | 3529.064 | 8.72E-07 | 0.001118 |
| FOXD4L1 | 3028.6 | 3274.165 | 13,484.1 | 3968.791 | 5.65E-06 | 0.002873 |
| VWA5B2 | 5133 | 3112.157 | 12,829.7 | 2520.75 | 1.16E-05 | 0.004001 |
| DDB2 | 3286.4 | 3690.682 | 14,951.7 | 5218.157 | 2.72E-05 | 0.006084 |
| EPOR | 3647 | 3455.342 | 11,305.9 | 2558.743 | 3.27E-05 | 0.00635 |
| ROBO3 | 4185.4 | 3290.998 | 12,872.1 | 4243.073 | 8.69E-05 | 0.01051 |
| PKIB | 2970 | 4398.217 | 11,411.5 | 3568.829 | 0.000193 | 0.016055 |
| TMED6 | 4789.4 | 4295.765 | 12,499.6 | 3371.77 | 0.000339 | 0.021989 |
| APOBEC3D | 2925.1 | 2451.516 | 13,097.9 | 6163.835 | 0.00042 | 0.023986 |
| B3GNT4 | 5193.3 | 4830.085 | 13,538.8 | 3636.482 | 0.000437 | 0.024567 |
| CLCN3 | 5522 | 3753.278 | 12,298.4 | 3620.061 | 0.00066 | 0.029537 |
| FOXD4 | 4987 | 4658.925 | 12,645.9 | 3707.015 | 0.000789 | 0.032024 |
| ZNF683 | 3835 | 4311.881 | 10,868.4 | 3498.788 | 0.000892 | 0.033785 |
| EP400P1 | 3915.2 | 2512.353 | 12,900.2 | 6281.908 | 0.001276 | 0.040754 |
| KLHDC7B | 5436.4 | 5035.776 | 14,321.6 | 5600.079 | 0.001555 | 0.044346 |
| MT1G | 6955.3 | 4964.891 | 14,235.3 | 3544.305 | 0.001618 | 0.045402 |
The data on GE-CNA correlation (see Fig. 1 for details) for the select 27 genes. There are significant differences in CNVs (= CNAs) between high expressor and low expressor of the select 27 genes. Genes whose high expression is associated with higher CNV/CNA are annotated as CNA facilitators. Genes whose high expression is associated with lower CNV/CNA are annotated as CNA suppressors.
List of 18 (27) survival critical genes.
| Gene | Expression in altered group | Expression in unaltered group | test | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Survival-critical | Alterations | Drug needed | Entrez_id | Table | hr | Mean | SD | Median | n | mean | SD | Median | Wilcox | t-test | |||
| Decrease correlates with | |||||||||||||||||
| CAPS | Calcyphosine, Ca2 + -binding, ion transport, V-ATPase assembly | Poor survival | Enhancer | 828 | 4 | 1.907922041 | 0.375986 | 7 | − 0.4095 | 0.183021 | − 0.4771 | 585 | 0.15685 | 1.479923 | − 0.2992 | 0.206713 | 6.88E-06 |
| CCDC115 | Coiled-coil domain-containing 115, ER, ion pump | ||||||||||||||||
| ATP6AP1 | ATPaseH + transporting Accessory protein 1, vacuolar ATPase | ||||||||||||||||
| NBEAP1 | Neurobeachin pseudogene 1 | ||||||||||||||||
| SPANXC | SPANX family member C, testis-specific, metastasis/cancer antigen | ||||||||||||||||
| TIGD6 | Tigger Transposable element derived 6, transposon, similar to cenpB | Better survival | 81,789 | 3 | 0.442975 | 11 | − 0.35823 | 0.733154 | − 0.5256 | 581 | − 0.05557 | 1.20006 | − 0.269 | 0.45003 | 0.208536 | ||
| C7ORF13 | Testis-specific, facilitates migration, EMT via mir | ||||||||||||||||
| TMEM184A | Transmembrane protein 184A, vesicle transport. Heparin receptor | ||||||||||||||||
| F8A1 | Coagulation factor VIII associated 1, vesicular transport of early endosome | ||||||||||||||||
| LZTS3 | Leucine zipper tumor suppressor family member 3, tumor suppressor | ||||||||||||||||
| OLMALINC | Oligodendrocyte maturation-associated long intergenic non-coding RNA | ||||||||||||||||
| WARS | Tryptophanyl-tRNA synthase1, damage-induced cytokine, immunomodulator | Poor survival | Enhancer | 7453 | 5 | 0.814291 | 9 | 0.559444 | 1.566013 | − 0.1172 | 583 | − 0.15584 | 0.785742 | − 0.3405 | 0.057173 | 0.208338 | |
| FOXD4L1 | Forkhead Box D4 like 1, TF | Better survival | 200,350 | 6 | 1.426462336 | 0.4651 | 21 | 0.637481 | 1.37034 | 0.1306 | 571 | − 0.06577 | 0.898184 | − 0.2856 | 0.002956 | 0.029834 | |
| VWA5B2 | Von Willebrand factor A domain containing 5B2, | Poor survival | Enhancer | 90,113 | 6 | 1.444342328 | 0.482426 | 22 | − 0.07453 | 0.132804 | − 0.1124 | 570 | − 0.02217 | 0.902549 | − 0.12405 | 0.144395 | 0.269443 |
| DDB2 | Damage specific RNA binding protein 2, UV damage repair, Xeroderma | Poor survival | Enhancer | 1643 | 6 | 0.995514 | 4 | 0.78545 | 0.68143 | 0.60775 | 588 | − 0.10943 | 0.937722 | − 0.3888 | 0.02689 | 0.077632 | |
| EPOR | Erythropoietin receptor, JAK2-MAPK, PI3K, STAT signaling | Poor survival | Enhancer | 2057 | 6 | 0.928492 | 7 | − 0.02714 | 0.422443 | 0.1963 | 585 | − 0.05042 | 0.908843 | − 0.2833 | 0.362072 | 0.891349 | |
| ROBO3 | Roundabout guidance receptor 3, migration, neurite outgrowth | Better survival | 64,221 | 6 | 1.058267165 | 0.888419 | 38 | 0.452355 | 1.426736 | 0.10805 | 554 | − 0.08564 | 0.853085 | − 0.34145 | 0.00036 | 0.027128 | |
| PKIB | cAMP-dependent PK inhibitor beta, <—> PKA, PI3K/AKT signaling | Better survival | 5570 | 6 | 1.468143772 | 0.597215 | 11 | 0.186236 | 1.018755 | − 0.0177 | 581 | − 0.02932 | 0.908974 | − -0.3012 | 0.488256 | 0.501509 | |
| TMED6 | Transmembrane p24 trafficking protein 6 | Better survival | 146,456 | 6 | 0.994421 | 2 | − 0.51305 | 0.20301 | − 0.51305 | 590 | − 0.02848 | 1.070154 | − 0.3208 | 0.467364 | 0.157419 | ||
| APOBEC3D | Apolipoprotein B mRNA editing enzyme catalytic subunit 3D, retrovirus inhibition | Better survival | 140,564 | 6 | 4.550147633 | 0.004547 | 12 | 0.790867 | 2.18234 | 0.0305 | 580 | -0.14183 | 0.883672 | -0.3564 | 0.101112 | 0.167277 | |
| B3GNT4 | UDP glcNAc betaGal 1,3-N-acetylglucosaminyl transferase 4, Golgi, TM | Better survival | 79,369 | 6 | 2.35404515 | 0.103688 | 13 | 0.611408 | 1.947388 | − 0.0383 | 579 | − 0.02153 | 1.074378 | − 0.3863 | 0.249043 | 0.265232 | |
| CLCN3 | Chloride voltage-gated channel 3, endosomal protein trafficking, ion channel | Poor survival | Enhancer | 1182 | 6 | 3.564447662 | 0.001042 | 29 | − 0.53016 | 0.970466 | − 0.4802 | 563 | − 0.24354 | 1.005001 | − 0.3456 | 0.168123 | 0.131658 |
| FOXD4 | Forkhead Box 4, TF, CRC progression | Better survival | 2298 | 6 | 1.788201318 | 0.220313 | 25 | 0.782232 | 1.357949 | 0.5783 | 567 | -0.0695 | 0.897691 | -0.345 | 9.77E-05 | 0.00468 | |
| ZNF683 | Zinc Finger 683, immune system signaling, regulates memory T, NK, NKT cells | Better survival | 257,101 | 6 | 1.957393583 | 0.370498 | 14 | 1.003221 | 1.980712 | 0.2154 | 578 | − 0.11031 | 0.896775 | − 0.35535 | 0.001668 | 0.05578 | |
| EP400P1 | Ep400 pseudogene 1 | Better survival | 347,918 | 6 | 3.791644858 | 0.067105 | 10 | 1.12437 | 1.337051 | 1.49015 | 582 | − 0.02512 | 1.08331 | − 0.18395 | 0.005293 | 0.02379 | |
| KLHDC7B | Kelch domain containing 7B, <—> cul1, cul3, oncogenic | Poor survival | Enhancer | 113,730 | 6 | 0.80687 | 30 | 0.32065 | 1.37536 | 0.02375 | 562 | − 0.071 | 0.905335 | − 0.3043 | 0.003778 | 0.133445 | |
| MT1G | Metallothionein 1G, can inhibit pro-inhibitory cytokines | Poor survival | Enhancer | 4495 | 6 | 2.477913164 | 0.218678 | 4 | − 0.44585 | 0.146572 | − 0.46165 | 588 | − 0.03724 | 0.935006 | − 0.32245 | 0.382064 | 0.004617 |
CNA facilitator/suppressor affecting patients’ survival; total 27 genes for which expression levels correlate with both CNA and survival (11 for CNA facilitator, 16 for CNA suppressor). Genes are shown indicating which category/Supplementary Table they are from. After subsequent analysis, nine genes that did not show significance association after adjusting covariates were omitted from Hazard Ratio (HR) calculations. For example, the gene expression of TIGD6 is significantly associated with survival after adjustment of age and stage. But the altered and non-altered group of TIGD6 is not significantly associated with survival after adjustment of age and stage. We also found the gene expression of TIGD6 in altered and non-altered group is not significantly different. The result is interpreted to show that the observed altered status of TIGD6 does not affect/impact its gene expression in TCGA data, but its expression may associate with the survival.
Column G: Highlighted in bold: HR < 1 (for which expression alterations decrease risk). HR > 1 (for which expression alterations increase risk).
Figure 4CNA facilitator/suppressor genes affecting patients’ survival (“survival-critical”). For 18 genes, expression levels correlate with both CNA and patients’ survival in CRC (i.e., “survival-critical” genes). The genes represent potential targets for drug development. There are four categories, as follows. (A) Lower altered expression with improved survival. For TMED6 and TIDG6, lower expression was associated with improved survival; thus, they are potential inhibitor development targets. Hazard ratio (HR) < 1 (i.e., decreased risk). “Altered” (red), “Not Altered” (green). (B) Higher altered expression with improved survival. For DDB2, WARS, and KLHDC7B, higher expression was associated with improved survival; thus, they are potential enhancer development targets. (C) Lower altered expression with decreased survival. For MT1G, CLCN3, and CAPS, lower expression was associated with decreased patients’ survival. For HR > 1, expression alterations increase risk. For estimating magnitude of HR, small, medium, and large HRs comparing two groups would be approximately 1.3, 1.9, and 2.8, respectively[34]. (D) Higher altered expression with decreased survival. For APOBEC3D, EP400NL, B3GNT4, ZNF683, FOXD4, FOXD4L1, and PKIB, higher expression was associated with decreased survival; thus, they are potential targets for inhibitors. (E) ROBO3 is consistently shown to be over-expressed in CRCs. This finding is corroborated by the present study. However, the impact of ROBO3 expression on patients’ survival in CRCs is small (not trivial, but possibly inconsequential) with HR1.058. Figures were generated with cBioportal and with R v4.0.3.