| Literature DB >> 34681956 |
Marine Guilcher1,2, Arnaud Liehrmann1,2,3, Chloé Seyman3, Thomas Blein1,2, Guillem Rigaill1,2,3, Benoit Castandet1,2, Etienne Delannoy1,2.
Abstract
Plastid gene expression involves many post-transcriptional maturation steps resulting in a complex transcriptome composed of multiple isoforms. Although short-read RNA-Seq has considerably improved our understanding of the molecular mechanisms controlling these processes, it is unable to sequence full-length transcripts. This information is crucial, however, when it comes to understanding the interplay between the various steps of plastid gene expression. Here, we describe a protocol to study the plastid transcriptome using nanopore sequencing. In the leaf of Arabidopsis thaliana, with about 1.5 million strand-specific reads mapped to the chloroplast genome, we could recapitulate most of the complexity of the plastid transcriptome (polygenic transcripts, multiple isoforms associated with post-transcriptional processing) using virtual Northern blots. Even if the transcripts longer than about 2500 nucleotides were missing, the study of the co-occurrence of editing and splicing events identified 42 pairs of events that were not occurring independently. This study also highlighted a preferential chronology of maturation events with splicing happening after most sites were edited.Entities:
Keywords: Arabidopsis thaliana; co-maturation; nanopore; plastid; post-transcriptional
Mesh:
Substances:
Year: 2021 PMID: 34681956 PMCID: PMC8537030 DOI: 10.3390/ijms222011297
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1The complexity of the psbB-petD locus. Screenshots of Integrative Genomics Viewer (IGV) displaying nanopore reads mapping to the psbB-petD locus. (A) plastid genomic position. (B) coverage track displaying the number of reads at each nucleotide. (C) screenshot of reads mapping on the Watson strand. Matching bases are shown in red. Split reads are joined by blue lines. (D) Annotation of the locus. Introns are shown as thinner segments. (E) screenshot of reads mapping on the Crick strand. Matching bases are shown in purple. Split reads are joined by blue lines.
Figure 2Virtual Northern blots derived from the nanopore sequencing. Northern blots were emulated from nanopore reads mapping to the sequences of psbN, psbH, the second exon of petB, or the second exon of petD shown in red on the genomic map displayed above. The size (in nt) is shown on the left.
Figure 3Identification of transcripts isoforms. Screenshots of IGV displaying the reads corresponding to various virtual Northern blot isoforms. Matching bases are shown in red. Split reads are joined by blue lines. Other colors indicate mismatches and indels. (A) Reads corresponding to the 2200 nt isoform of the petB and petD virtual Northern blots. (B) Reads corresponding to the 990 nt isoform of the petD virtual Northern blot. (C) Reads corresponding to the 1100–1150 isoform of the psbH virtual Northern blot. The two 5′ ends are shown by black arrows.
Figure 4Examples of coverage biases in plastid transcripts. For each panel, the coverage at single-nucleotide resolution by strand-specific reads overlapping at least partially the genomic regions shown in red is shown. At the top, the genomic positions are shown while the coding sequences associated with these regions are shown below as gray arrows or boxes. Introns are represented as black lines. (A): psaA transcripts. (B): rbcL transcripts. (C): ndhB transcripts. (D): psbE-psbJ transcripts.
Quantification of known editing and splicing events.
| Name | Type | Maturation Rate | Maturation Rate | Maturation Rate (Ruwe et al., 2013) |
|---|---|---|---|---|
|
| splicing | 4% | 4% | NA |
|
| splicing | 89% | 82% | NA |
|
| splicing | 64% | 19% | NA |
|
| splicing | 79% | 42% | NA |
|
| splicing | 63% | 45% | NA |
|
| splicing | 60% | 71% | NA |
|
| splicing | 69% | 62% | NA |
|
| splicing | 91% | 58% | NA |
|
| splicing | 97% | 62% | NA |
|
| splicing | 69% | 12% | NA |
|
| splicing | 66% | 52% | NA |
|
| splicing | 68% | 55% | NA |
|
| splicing | 92% | 81% | NA |
|
| splicing | 68% | 27% | NA |
|
| editing | 53% | 79% | 93% |
|
| editing | 89% | 91% | 95% |
|
| editing | 5% | 3% | 4% |
|
| editing | 33% | 21% | 15% |
|
| editing | 87% | 82% | 85% |
|
| editing | 64% | 83% | 86% |
|
| editing | 69% | 76% | 94% |
|
| editing | 93% | 90% | 95% |
|
| editing | 89% | 93% | 94% |
|
| editing | 92% | 97% | 96% |
|
| editing | 16% | 10% | 12% |
|
| editing | 6% | 3% | 10% |
|
| editing | 4% | 4% | 6% |
|
| editing | 90% | 95% | 99% |
|
| editing | 76% | 75% | 83% |
|
| editing | 90% | 98% | 98% |
|
| editing | 95% | 100% | 100% |
|
| editing | 79% | 91% | 86% |
|
| editing | 3% | 4% | NA |
|
| editing | 21% | 26% | 27% |
|
| editing | 82% | 72% | 81% |
|
| editing | 78% | 76% | 91% |
|
| editing | 34% | 74% | 75% |
|
| editing | 3% | 4% | NA |
|
| editing | 8% | 0% | NA |
|
| editing | 88% | 93% | 94% |
|
| editing | 95% | 98% | 99% |
|
| editing | 87% | 84% | 80% |
|
| editing | 78% | 87% | 81% |
|
| editing | 88% | 91% | 84% |
|
| editing | 75% | 94% | 92% |
|
| editing | 6% | 4% | 6% |
|
| editing | 6% | 3% | 5% |
|
| editing | 90% | 89% | 90% |
|
| editing | 81% | 88% | 82% |
|
| editing | 94% | 94% | 95% |
|
| editing | 85% | 93% | 96% |
|
| editing | 76% | 83% | 92% |
|
| editing | 77% | 84% | 90% |
|
| editing | 88% | 90% | 93% |
|
| editing | 94% | 97% | 98% |
|
| editing | 35% | 33% | 45% |
|
| editing | 69% | 78% | 85% |
NA: Not Analyzed. The genomic position of each site and the corresponding nomenclature of Rüdinger et al. [34] are given in Table S1.
Figure 5Network of splicing and editing coordination. Splicing events are shown in green and editing events in red. Dependent events are joined by an edge. The darkness of the edge is proportional to the adjusted p-value of the Exact Fisher test for the pool of the three replicates.
Figure 6Proposed chronology of maturation events. Exons are shown as grey bars and introns as black lines. The editing sites are indicated by their genomic position. Grey editing sites are processed independently and thus are not included in the chronology. The preferred order of the maturation events is indicated by the numbers above the editing sites or introns.