| Literature DB >> 26637431 |
Pierre-François Roux1, Laure Frésard2, Morgane Boutin1, Sophie Leroux2, Christophe Klopp3, Anis Djari4, Diane Esquerré5, Pascal G P Martin6, Tatiana Zerjal7, David Gourichon8, Frédérique Pitel9, Sandrine Lagarrigue10.
Abstract
RNA editing is a posttranscriptional process leading to differences between genomic DNA and transcript sequences, potentially enhancing transcriptome diversity. With recent advances in high-throughput sequencing, many efforts have been made to describe mRNA editing at the transcriptome scale, especially in mammals, yielding contradictory conclusions regarding the extent of this phenomenon. We show, by detailed description of the 25 studies focusing so far on mRNA editing at the whole-transcriptome scale, that systematic sequencing artifacts are considered in most studies whereas biological replication is often neglected and multi-alignment not properly evaluated, which ultimately impairs the legitimacy of results. We recently developed a rigorous strategy to identify mRNA editing using mRNA and genomic DNA sequencing, taking into account sequencing and mapping artifacts, and biological replicates. We applied this method to screen for mRNA editing in liver and white adipose tissue from eight chickens and confirm the small extent of mRNA recoding in this species. Among the 25 unique edited sites identified, three events were previously described in mammals, attesting that this phenomenon is conserved throughout evolution. Deeper investigations on five sites revealed the impact of tissular context, genotype, age, feeding conditions, and sex on mRNA editing levels. More specifically, this analysis highlighted that the editing level at the site located on COG3 was strongly regulated by four of these factors. By comprehensively characterizing the mRNA editing landscape in chickens, our results highlight how this phenomenon is limited and suggest regulation of editing levels by various genetic and environmental factors.Entities:
Keywords: DNA-seq; RNA-seq; chicken; liver and adipose tissue; mRNA editing
Mesh:
Substances:
Year: 2015 PMID: 26637431 PMCID: PMC4751552 DOI: 10.1534/g3.115.022251
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Whole transcriptome mRNA editing screening studies in vertebrates
| Study | Species | Cells | Edited Sites (N) | Matched DNA | Replicates | Potential Biases |
|---|---|---|---|---|---|---|
| ( | Immortalized B cells | 1809 | Yes | 2/17 | Splice, homopolymer, strand, extremity | |
| ( | Immortalized B cells | 28,766 | Yes | 2/27 | Splice, homopolymer, multimapping, strand, extremity | |
| (Bahn | Glioblastoma cells | 10,000 | No | 2/2 | Splice, homopolymer, multimapping, strand, extremity | |
| ( | Immortalized B cells | 22,688 | Yes | 1/1 | Splice, homopolymer, extremity | |
| ( | 14 ENCODE cell lines | 5695 | No | 1/1 | Homopolymer, extremity, multimapping | |
| ( | Immortalized B cells | 1503 | Yes | 1/2 | Homopolymer | |
| ( | ENCODE cell lines | 150,865 | No | 1/2 | Strand, multimapping | |
| ( | Spinal cord cells | 15 | Yes (Exome) | 1/1 | Homopolymer, multimapping, strand | |
| (Bazak | 16 tissues | 1,586,270 | No | 1/1 | Homopolymer, multimapping, strand | |
| ( | 7 ENCODE cell lines | 259,385 | No | 1/2 | Multimapping | |
| ( | Liver cells | 20,007 | No | 1/3 | Splice, homopolymer, multimapping, strand, extremity | |
| ( | Prostate cancer cells | 16,194 | Yes | 2/10 | Homopolymer, multimapping, strand, extremity | |
| ( | Brain cells | 19,791 | No | 1/1 | Splice, homopolymer, multimapping, strand, extremity | |
| ( | Immortalized B cells | 5997 | No | 2/2 | Multimapping, strand, extremity | |
| ( | Immortalized B cells | 22,715 | No | 1/1 | Multimapping, extremity | |
| ( | Hepatocellular carcinoma cells | 900 | Yes | 6/6 | Splice, multimapping, strand | |
| ( | Liver cells | 485,684 | Yes | 1/9 | Splice, multimapping, strand | |
| ( | Brain cells | 7389 | No | 2/2 | Homopolymer, multimapping | |
| ( | Liver, adipose, and bone cells | 253 | No | 3/3 | Homopolymer, multimapping, extremity | |
| ( | Liver and adipose cells | 63 and 188 | No | 4/6 | Multimapping | |
| ( | Brain cells | 665 | No | 1/1 | Splice, homopolymer, multimapping, strand, extremity | |
| ( | Intestine and liver cells | 500 | No | 1/1 | Homopolymer, multimapping, extremity | |
| ( | Prefrontal cortex, cerebellum, muscle, kidney, heart, testis, and lung cells | 31,250 | Yes | 1/1 | Homopolymer, multimapping, extremity | |
| Whole embryo | 40 | Yes | 2/8 | — | ||
| Roux | Liver and adipose cells | 11 and 17 | Yes | 3/8 | — |
If “Yes”: individual genomic DNA information is used to account for potential private individual genomic polymorphisms. If “No”: potential private genomic polymorphisms are defined considering either genomic variant databases such as dbSNPs, or strain-specific consensus genomic sequence in the case of studies based on clonal mouse strains.
Ratio between the number of biological replicates considered for reporting a candidate difference between DNA and mRNA as a true mRNA editing event and the total number of biological replicates available in the study for a given cell type.
Figure 1mRNA editing screening studies based on high-throughput sequencing in the literature. This graph describes the numbers of mRNA editing events detected (log10) across RNA-seq-based screening studies as a function of the numbers of biological replicates considered to declare an event as a true positive.
Figure 2Impact of sequencing and mapping biases on mRNA editing discovery. Contribution of random or systematic sequencing biases and mapping artifacts to the false discovery of mRNA editing events using combined mRNA and DNA sequencings are given as a fraction (%) of the intial pool of candidate editing events subject to each source of bias in each tissue. WAT, white adipose tissue.
Figure 3Impact of biological replication on mRNA editing discovery. Distribution (in %) of unbiased mRNA editing events across the 12 classes of substitution according to the number of replicates they are detected in, ranging from N = 1 to N = 3, in white adipose tissue (WAT) and liver. The first two classes (AtoG and TtoC) are associated to ADAR-mediated RNA editing, and the next two (CtoT and GtoA) to APOBEC-meditated RNA editing. At the top-right of each graph, the total number of RNA editing events detected for a given number of replicates is shown. ADAR: Adenosine deaminases acting on RNA. APOBEC: Apolipoprotein B mRNA editing enzyme, catalytic polypetide-like.
mRNA editing screening in adult chicken liver and adipose tissue
| Chromosome | Position | DNA Allele | RNA Allele | Canonical | Validation in Sanger | Validation in PyroMark | Replicates | Gene Names | Localization | |
|---|---|---|---|---|---|---|---|---|---|---|
| WAT | 1 | 79605543 | T | C | yes | 3 | PLA1A, POPDC2 | Downstream gene | ||
| 1 | T | C | yes | 3 | MPZL1, BRP44 | Downstream gene | ||||
| 1 | 103511385 | T | C | yes | 6 | GRIK1 | Intron | |||
| A | G | yes | yes | yes | 3 | |||||
| 1 | 169769193 | A | G | yes | 3 | THSD1 | Downstream gene | |||
| 1 | 187056174 | A | G | yes | 3 | NOX4 | Downstream gene | |||
| 1 | 187056183 | A | G | yes | 3 | NOX4 | Downstream gene | |||
| 1 | 187057565 | A | G | yes | 3 | — | Intergenic | |||
| T | C | yes | yes | yes | 5 | NDUFS6 | Upstream | |||
| 4 | 17996546 | T | C | yes | 4 | — | Intergenic | |||
| 5 | 22958596 | T | C | yes | 3 | DGKZ | Intron | |||
| A | G | yes | 5 | |||||||
| 11 | 19169664 | A | C | no | no | no | 3 | DHODH, IST1 | Upstream, Intron | |
| A | G | yes | yes | yes | 6 | |||||
| 12 | 8910897 | A | G | yes | 5 | FLNB | Intron | |||
| 17 | 4705 | T | C | yes | 3 | — | Intron | |||
| 17 | 94999 | C | T | yes | 6 | — | Intergenic | |||
| 18 | 28378 | T | C | yes | 6 | ZNF302 | Upstream | |||
| 19 | 4812610 | C | T | yes | 4 | CCLI8 | Intron | |||
| Liver | T | C | yes | yes | yes | 7 | POPDC2, PLA1A | Downstream gene | ||
| 1 | T | C | yes | 3 | MPZL1, BRP44 | Downstream gene | ||||
| 1 | 90873188 | T | C | yes | yes | yes | 4 | MPZL1, BRP44 | Downstream gene | |
| A | G | yes | yes | yes | 5 | |||||
| 1 | 193343226 | A | G | yes | 6 | MADPRT, ART7B | Upstream | |||
| T | C | yes | yes | yes | 5 | NDUFS6 | Upstream | |||
| A | G | yes | 3 | |||||||
| A | G | yes | yes | yes | 5 | MYO1B | Intron | |||
| 8 | 25588113 | T | C | yes | 3 | — | Exon (synonymous) | |||
| 28 | 518606 | A | C | no | yes | no | 3 | HNRNPM | Missense | |
| LGE64 | 615592 | T | C | yes | 4 | — | Intron |
In italics, mRNA editing events common to both tissues; in bold, mRNA editing candidate events subjected to Sanger genomic DNA sequencing and cDNA pyrosequencing; Underlined, mRNA editing events annotated as “coding - missense” and located in four different genes.
Number of samples in which the mRNA editing event is detected.
Name of the gene impacted by the mRNA editing event or name of the closest genes (<10 kb) if the mRNA editing event is falling in an intergenic region.
Localization of the mRNA editing event inside genomic features, as predicted by Variant Effect Predictor (McLaren ). If the mRNA editing event is falling inside a coding region, its impact on gene product is given in brackets.
Figure 4Position of mRNA editing events across the chicken genome. WAT: White adipose tissue.
Figure 5Distribution of mRNA editing events across genomic features. Annotations were assessed using Ensembl v71 Variant Effect Predictor (McLaren ).
Figure 6Multispecies protein sequence alignments for coding mRNA editing events. The red stars indicate the position of the amino acid impacted by coding mRNA events. The overall conservation across sequences is depicted below each alignment. The mRNA editing event impacting COG3 was detected in both white adipose tissue (WAT) and liver, while the ones impacting CES1 and FLNB were WAT-specific, and the one impacting KCMA1 was specific to liver.
Figure 7Impact of genetic background, age, feeding conditions, and sex on mRNA editing level. Editing level (in %) at five genomic positions, in white adipose tissue (WAT) and liver according to (A) genetic background, (B) age, (C) feeding conditions, and (D) sex. Each boxplot shows the distribution of editing levels (% of the edited allele) across N = 8 biological replicates. * P <0.05, ** P <0.01, *** P <0.001, unpaired two-tailed Student t-test. F, Females; M, Males; Post-pub, postpuberal animals; Pre-pub, prepuberal animals.