| Literature DB >> 30949194 |
Michael Chigaev1, Hui Yu1, David C Samuels2, Quanhu Sheng3, Olufunmilola Oyebamiji1, Scott Ness1, Wei Yue1, Ying-Yong Zhao4, Yan Guo1.
Abstract
RNA editing is phenomenon that occurs in both protein coding and non-coding RNAs. Increasing evidence have shown that adenosine-to-inosine RNA editing can potentially rendering substantial functional effects throughout the genome. Using RNA editing datasets from two large consortiums: The Cancer Genome Atlas (TCGA) and Genotype-Tissue Expression (GTEx) project, we quantitatively analyzed human genome-wide RNA editing events derived from tumor or normal tissues. Generally, a common RNA editing site tends to have a higher editing level in tumors as compared to normal samples. Of the 14 tumor-normal-paired cancer types examined, Eleven of the 14 cancers tested had overall increased RNA editing levels in the tumors. The editomes in cancer or normal tissues were dissected by genomic locations, and significant RNA editing locational difference was found between cancerous and healthy subjects. Additionally, our results indicated a significant correlation between the RNA editing rate and the gene density across chromosomes, highlighted hyper RNA editing clusters through visualization of running RNA editing rates along chromosomes, and identified hyper RNA edited genes (protein-coding genes, lincRNAs, and pseudogenes) that embody a large portion of cancer prognostic predictors. This study reinforces the potential functional effects of RNA editing in protein-coding genes, and also makes a strong foundation for further exploration of RNA editing's roles in non-coding regions.Entities:
Keywords: A-to-I; GTEx; RNA editing; TCGA; adenosine to inosine; cancer; non-coding RNAs
Year: 2019 PMID: 30949194 PMCID: PMC6435843 DOI: 10.3389/fgene.2019.00211
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1(A) Histogram which displays the number of times RNA editing positions have been observed in the 14 cancer types. Majority of the RNA editing positions were observed in one cancer type. (B) Barplot which shows the RNA editing rate in TCGA (RNA editing events per 1000 bp) by chromosome. The RNA editing rate is adjusted for the chromosome length. Chromosome 19 had the highest RNA editing rate. (C) Barplot which shows the RNA editing rate in GTEx chromosome. The RNA editing rate is adjusted for the chromosome length. Similar to GTEx results, chromosome 19 had the highest RNA editing rate. (D) Scatter plot of overall RNA editing rates by chromosome between TCGA and GTEx by chromosome. (E) Scatter plot of TCGA RNA editing rate and gene density by chromosome. (F) Scatter plot of GTEx RNA editing rate and gene density by chromosome. (G) Scatter plot of TCGA RNA editing rate and lincRNA density by chromosome. (H) Scatter plot of GTEx RNA editing rate and lincRNA density by chromosome.
FIGURE 2Pie charts display the distribution of RNA editing positions by genomic location categories. Most of RNA editing happens in intronic regions. Large proportional differences can be observed between GTEx and TCGA.
FIGURE 3(A) Box plots depicting the average RNA editing levels between paired tumor and normal samples by cancer type. Majority of the cancer types have higher RNA editing levels in tumor than normal. (B) Barplot depicting number of RNA editing events with p < 0.05 between tumor and normal by cancer type and by genomic location categories. (C) Manhattan plot of genome-wide differential RNA editing level results for protein-coding genes. (D) Manhattan plot of genome-wide differential RNA editing level results for lincRNA and pseudogenes.
Differential edited positions by cancer and genomic position categories.
| Cancer | Sample size | downstream | Intergenic | Intronic | ncRNA | Non-synonymous | Splicing | Synonymous | Upstream | UTR3 | UTR5 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| BLCA | 271 | 355 | 360 | 430 | 148 | 10 | 0 | 4 | 20 | 2162 | 8 |
| BRCA | 942 | 2684 | 4077 | 10323 | 1390 | 86 | 8 | 40 | 116 | 14217 | 103 |
| CESC | 199 | 11 | 15 | 21 | 12 | 1 | 0 | 2 | 0 | 75 | 2 |
| HNSC | 468 | 1209 | 1360 | 1645 | 511 | 26 | 1 | 21 | 59 | 7555 | 34 |
| KICH | 91 | 1322 | 1639 | 1559 | 609 | 22 | 1 | 13 | 71 | 6828 | 30 |
| KIRC | 515 | 1949 | 4170 | 16069 | 892 | 46 | 4 | 25 | 229 | 8342 | 66 |
| KIRP | 228 | 1398 | 1764 | 2180 | 602 | 19 | 1 | 12 | 103 | 6894 | 33 |
| LIHC | 250 | 1045 | 1158 | 1842 | 406 | 25 | 2 | 10 | 44 | 5272 | 28 |
| LUAD | 546 | 1516 | 1953 | 7776 | 767 | 54 | 2 | 22 | 103 | 8349 | 53 |
| LUSC | 237 | 657 | 786 | 928 | 249 | 27 | 0 | 11 | 36 | 4146 | 18 |
| PRAD | 426 | 463 | 717 | 1298 | 232 | 13 | 2 | 7 | 32 | 2670 | 14 |
| STAD | 318 | 547 | 1 | 1752 | 244 | 11 | 1 | 2 | 35 | 2124 | 9 |
| THCA | 557 | 1205 | 2121 | 4493 | 627 | 28 | 5 | 16 | 131 | 6087 | 49 |
| UCEC | 320 | 1 | 3 | 4 | 16 | 0 | 0 | 1 | 8 | 271 | 1 |
Hyper RNA editing genes.
| Source | Gene | Gene type | Editing events | Editing events per 1000 bp | Cancer prognostic |
|---|---|---|---|---|---|
| GTEx | GTF2IP23 | Pseudogene | 454 | 199.65 | None |
| IPO5P1 | Pseudogene | 427 | 147.90 | None | |
| LINGO3 | Protein coding | 253 | 112.90 | None | |
| HTR1F | Protein coding | 282 | 89.81 | None | |
| GS1-124K5.11 | lincRNA | 1414 | 85.51 | None | |
| C19orf71 | Protein coding | 380 | 77.92 | Renal↑, pancreatic↑, head and neck↑ | |
| LINC00354 | lincRNA | 89 | 74.66 | None | |
| THRIL | lincRNA | 142 | 71.83 | None | |
| AMZ2 | protein coding | 634 | 66.10 | liver↓, renal↓ | |
| HNRNPA1L2 | Protein coding | 194 | 64.43 | Breast↑, pancreatic↑ | |
| TCGA | TLCD2 | Protein coding | 121 | 45.34 | Renal↓ |
| SPN | Protein coding | 252 | 31.95 | Endometrial↑, breast↑, melanoma↑ | |
| CYCS | Protein coding | 135 | 25.61 | Renal↑, head, and neck↓ | |
| PDDC1 | Protein coding | 230 | 22.40 | Renal↑, liver↓ | |
| NDUFB1 | Protein coding | 118 | 20.36 | Renal↑ | |
| COQ4 | Protein coding | 217 | 18.81 | Endometrial↑, renal↑, cervical↑ | |
| MRI1 | Protein coding | 148 | 15.18 | None | |
| FAM98C | Protein coding | 85 | 14.28 | Ovarian↓, urothelial↑ | |
| KAT8 | Protein coding | 217 | 13.87 | Lung↑ | |
| CCDC84 | Protein coding | 243 | 13.77 | Renal↑ |
FIGURE 4(A–C) Venn diagrams of protein-coding genes, lincRNA and pseudogenes with RNA editing between GTEx and TCGA. (D–F) Boxplots of gene-level RNA editing rates of all protein-coding genes, lincRNA, and pseudogenes between GTEx and TCGA. (G–I) Scatter plots of gene-level RNA editing rates of the common protein-coding genes, lincRNA, and pseudogenes between GTEx and TCGA.
FIGURE 5RNA editing frequency for chromosome 1 measured in both GTEx and TCGA. The RNA editing frequency is computed as number of RNA editing events observed in a 10,000 nucleotide long moving window.