| Literature DB >> 28754146 |
Eddie Park1, Jiguang Guo2, Shihao Shen1, Levon Demirdjian3, Ying Nian Wu3, Lan Lin1, Yi Xing4.
Abstract
BACKGROUND: A-to-I RNA editing is an important step in RNA processing in which specific adenosines in some RNA molecules are post-transcriptionally modified to inosines. RNA editing has emerged as a widespread mechanism for generating transcriptome diversity. However, there remain significant knowledge gaps about the variation and function of RNA editing.Entities:
Keywords: GWAS; Genetic variation; RNA editing; RNA-seq; Transcriptome
Mesh:
Substances:
Year: 2017 PMID: 28754146 PMCID: PMC5532815 DOI: 10.1186/s13059-017-1270-7
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1edQTL analysis to identify cis-regulated RNA editing events. a Distribution of RNA editing levels (Φ) across the 445 human LCLs. Box plots of RNA editing levels for 9094 candidate sites across 445 individuals. Sites are sorted by the mean Φ value on the x-axis. The inner quartile ranges for each box plot are represented in yellow and the medians are in white. b Quantile-quantile plot (qq-plot) testing association of RNA editing levels with cis genetic polymorphisms in five populations. c Relationship between edQTL significance and distance of SNP to editing site in five populations. Note that the apparent spikes at +60 kb and −110 kb are due to multiple RNA editing sites in a single gene (SLC35E2 for +60 kb and HLA-G for −110 kb) with edQTL signals in multiple populations. d Mosaic plot indicating the number of edQTL RNA editing sites shared between five populations. Values in the top rectangles represent population-specific edQTL sites and values in the bottom rectangles represent edQTL sites shared across all five populations. e Example of an edQTL signal in the NDE1 gene. Box plot showing the significant association of rs8048427 with the editing level (Φ) at chr16:15795035 within the CEU population. Each dot represents data from a particular individual and the size of the dot indicates the number of reads covering the RNA editing site in that individual
Fig. 2ASED analysis to identify cis-regulated RNA editing events. a Schematic diagram of ASED analysis. Heterozygous SNPs are used to assign RNA-seq reads to specific alleles. b Example of allele-specific RNA editing in the NDE1 gene. ASED analysis of RNA editing site chr16:15795035 with respect to heterozygous SNP rs8048427. c Cis-regulated RNA editing sites in the CEU population. edQTL and ASED of CEU as well as multiple replicates of GM12878 were used. The three circles outside of the Venn diagram represent RNA editing sites that were not considered in the other two analyses due to preliminary filters and method-specific limitations. d Example of a cis-regulated RNA editing site in ZDHHC20 associated with a rare variant, called with ASED analysis of multiple RNA-seq replicates from one individual, GM12878. Error bars represent likelihood-ratio test-based 95% confidence intervals of RNA editing levels inferred from read counts. Average allelic Φ values are represented in parentheses
Fig. 3Comprehensive ASED analysis in five populations. a Mosaic plot indicating the number of ASED RNA editing sites shared between five populations. Values in the top rectangles represent population-specific ASED sites and values in the bottom rectangles represent ASED sites shared in all five populations. b The number of ASED RNA editing sites shared between five populations. Example of an ASED signal in the SPN gene at RNA editing site chr16:29680268 with respect to SNP rs12932957 in the CEU population (c) and the YRI population (d). Error bars represent likelihood-ratio test-based 95% confidence intervals of RNA editing levels inferred from read counts. Average allelic Φ values are represented in parentheses
List of selected GWAS SNPs that are linked to both edQTL and ASED SNPs
| Gene symbol | Editing site | ASED SNP | edQTL SNP | Linked GWAS SNP(s) | GWAS gene symbol | GWAS disease/trait | Reference (PMID) |
|---|---|---|---|---|---|---|---|
|
| Chr7:56078339 | NA | rs4947534 | rs4947534 | PSPH | Blood metabolite levels | 24816252 [ |
| Chr7:56079087 | rs4947534 | NA | |||||
| Chr7:56079100 | rs4947534 | NA | |||||
|
| Chr11:108236635 | NA | rs12801988 rs227080 rs170546 rs5023001 | rs11212617 | C11orf65 | Response to metformin in type 2 diabetes (glycemic) | 21186350 [ |
| Chr11:108237818 | rs227091 | NA | |||||
| Chr11:108237819 | rs227091 | NA | |||||
| Chr11:108237832 | rs227091 | rs227080 rs227090 | |||||
| Chr11:108237844 | rs227091 | NA | |||||
| Chr11:108237854 | rs227091 | NA | |||||
|
| Chr21:45644472 | rs8127114 | rs8127114 | rs4819388 | ICOSLG | Celiac disease | 20190752 [ |
|
| Chr20:25427805 | rs6037121 rs6050623 | NA | rs7267979 | ABHD12 | Liver enzyme levels (alkaline phosphatase) | 22001757 [ |
| Chr20:25427815 | rs6050626 rs6050623 | rs2258728 | |||||
| Chr20:25428294 | rs1047171 rs6050626 rs6037121 | NA | |||||
| Chr20:25428308 | rs6050626 | NA | |||||
| Chr20:25428320 | rs6050626 rs6050623 | NA | |||||
| Chr20:25428646 | rs1047171 | NA | |||||
| Chr20:25428669 | rs1047171 | NA | |||||
| Chr20:25428724 | rs1047171 | NA | |||||
| Chr20:25428750 | rs6037121 | NA | |||||
|
| Chr5:43380817 | NA | rs7706402 | rs11951515 | CCL28 | Metabolite levels (X-11787) | 23934736 [ |
| Chr5:43381564 | rs7706402 | NA | |||||
|
| Chr1:184761188 | rs492126 rs682331 | rs570441 rs682331 | rs682331 | FAM129A | Obesity-related traits | 23251661 [ |
| Chr1:184762487 | rs526024 | NA | |||||
| Chr1:184762590 | rs492126 | NA |
Fig. 4RNA editing of ATM is genetically associated with response to metformin. a Box plot showing the significant association of SNP rs227091 with editing level (Φ) at chr11:108237832 within the CEU population. Each dot represents data from a particular individual and the size of each dot indicates the number of reads covering the RNA editing site in that individual. b ASED allele-specific editing level (Φ) of chr11:108237832 with respect to SNP rs227091 within the CEU population. Error bars represent likelihood-ratio test-based 95% confidence intervals of RNA editing levels inferred from read counts. Average allelic Φ values are represented in parentheses. c LD plot showing a GWAS signal (response to metformin; green) linked with edQTL (purple) and ASED (orange) SNPs in ATM. d Heatmap of edQTL significance for six cis-regulated RNA editing sites in ATM along with seven cis SNPs. The values in the heatmap represent − log(p value) for the association between a given RNA editing site and a given SNP within the given population
Fig. 5RNA editing of MDM4 is genetically associated with cancer and cognitive performance. a Box plot showing the significant association of SNP rs12038102 with editing level (Φ) at chr1:204525548 within the TSI population. Each dot represents data from a particular individual and the size of each dot indicates the number of reads covering the RNA editing site in that individual. b ASED allele-specific editing level (Φ) of chr1:204526727 with respect to SNP rs1046874 within the TSI population. Error bars represent likelihood-ratio test-based 95% confidence intervals of RNA editing levels inferred from read counts. Average allelic Φ values are represented in parentheses. c LD plot showing GWAS signals (breast cancer, prostate cancer, and cognitive performance; green) linked with edQTL (purple) and ASED (orange) SNPs in MDM4
Fig. 6Impact of edQTL SNPs on RNA secondary structure. a Cumulative distribution plot comparing the absolute value of the distance between SNP–RNA editing site pairs for significant edQTL SNPs and control SNPs within the computationally predicted RNA secondary structure of the IRAlu hairpin. b Cumulative distribution plot comparing the absolute value of the change in the number of paired bases for significant edQTL SNPs and control SNPs. c Cumulative distribution plot comparing the absolute value of the change in free energy of the predicted RNA secondary structure for significant edQTL SNPs and control SNPs. The Kolmogorov–Smirnov test was used for the cumulative distribution plots. Two examples of SNPs that significantly alter RNA editing levels: SNP on the opposite Alu to the RNA editing site in NDE1 (d) and SNP on the same Alu as the RNA editing site in H2AFV (e). Cartoon representation of the IRAlu hairpins and computationally predicted RNA secondary structures (left). Detailed base-pairing structures (right)