| Literature DB >> 22645572 |
Anireddy S N Reddy1, Mark F Rogers, Dale N Richardson, Michael Hamilton, Asa Ben-Hur.
Abstract
Extensive alternative splicing (AS) of precursor mRNAs (pre-mRNAs) in multicellular eukaryotes increases the protein-coding capacity of a genome and allows novel ways to regulate gene expression. In flowering plants, up to 48% of intron-containing genes exhibit AS. However, the full extent of AS in plants is not yet known, as only a few high-throughput RNA-Seq studies have been performed. As the cost of obtaining RNA-Seq reads continues to fall, it is anticipated that huge amounts of plant sequence data will accumulate and help in obtaining a more complete picture of AS in plants. Although it is not an onerous task to obtain hundreds of millions of reads using high-throughput sequencing technologies, computational tools to accurately predict and visualize AS are still being developed and refined. This review will discuss the tools to predict and visualize transcriptome-wide AS in plants using short-reads and highlight their limitations. Comparative studies of AS events between plants and animals have revealed that there are major differences in the most prevalent types of AS events, suggesting that plants and animals differ in the way they recognize exons and introns. Extensive studies have been performed in animals to identify cis-elements involved in regulating AS, especially in exon skipping. However, few such studies have been carried out in plants. Here, we review the current state of research on splicing regulatory elements (SREs) and briefly discuss emerging experimental and computational tools to identify cis-elements involved in regulation of AS in plants. The availability of curated alternative splice forms in plants makes it possible to use computational tools to predict SREs involved in AS regulation, which can then be verified experimentally. Such studies will permit identification of plant-specific features involved in AS regulation and contribute to deciphering the splicing code in plants.Entities:
Keywords: Arabidopsis; RNA-Seq; alternative splicing; plants; pre-mRNA splicing; splicing code; splicing regulators; splicing regulatory elements
Year: 2012 PMID: 22645572 PMCID: PMC3355732 DOI: 10.3389/fpls.2012.00018
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Splice forms for the ING2 gene from . A splice graph is a compact representation that shows all the ways in which a gene’s exons may be combined. These plots, generated by SpliceGrapher, use color coding to highlight AS events.
Figure 2Pipelines for prediction and quantification of splice forms from RNA-Seq data. Methods such as trans-ABySS and Trinity perform de novo prediction, and do not require a reference genome (left); these methods do not rely on alignment programs to map reads to a genome. When a reference genome is available, methods such as Cufflinks first map the reads to the genome, followed by a step of assembly of splice forms or their quantification (right). Some of these methods require, or can use annotated isoforms to guide the process.
Tools for predicting isoforms, their expression, and alternative splicing from RNA-Seq data.
| Method | Task | Input data | Notes |
|---|---|---|---|
| Trans-ABySS (Robertson et al., | IP, IE | Requires Abyss contigs | |
| Trinity (Grabherr et al., | IP, IE | ||
| Rnnotator (Martin et al., | IP | ||
| Scripture (Guttman et al., | IP | G | |
| IsoLasso (Li et al., | IP, IE | G | Improved version of IsoInfer (Feng et al., |
| NSMAP (Xia et al., | IP, IE | G | |
| Cufflinks (Trapnell et al., | IP, IE | G, A | Annotated isoforms are optional |
| TAU (Filichkin et al., | IP | G,A | Annotated isoforms are optional; does not scale well with read length |
| SpliceGrapher (Rogers et al., | SG | G, A | |
| IsoEM (Nicolae et al., | IE | G, A | |
| IsoformEX (Kim et al., | IE | G, A | |
| SpliceTrap (Wu et al., | IE | G, A | Only handles exon skipping |
| NEUMA (Lee et al., | IE | G, A | |
| Solas (Richard et al., | IE | G, A | |
| rSeq (Jiang and Wong, | IE | G, A | |
| RSEM (Li et al., | IE | Requires a transcriptome assembler |
The tools vary in the specific task they address; we distinguish between several tasks: isoform prediction (IP), isoform expression (IE) and splice graph prediction (SG). The tools also vary in the input data they require: de novo (no input required except for the RNA-Seq data), a reference genome (G) or annotated isoforms (A).
Figure 3Read coverage for the gene SCL33 in . The top panel shows the annotated gene model; the middle panel shows reads that map across splice junctions, with labels showing the number of reads that aligned across each junction and novel splice junctions highlighted in green. The bottom panel shows the distribution of reads across the gene. Here the read depth ranges from 1 to over 300, demonstrating that read coverage can be highly variable even across known exons (shaded regions on the graphs). This variability can make it difficult to distinguish between weakly expressed splice forms and background noise. This figure was generated by SpliceGrapher.
Figure 4Approach to mutate a predicted . In the first PCR the target gene with two primers sets (F1/R1 and F2/R2). The F1 and R1 primer set amplifies the gene from the initiation codon to the predicted cis-element and F2 and R2 will amplify from the predicted cis-element to the stop codon. Primers F1 and R2, in addition to gene-specific sequence (shown in green), will be tailed with sequences complementary to Gateway vector primers (shown in dark yellow). Similarly, primers R1 and F2, in addition to gene-specific sequence, will be tailed with the changed sequence in the predicted cis-element (shown in blue). In the second PCR, the two gene fragments from the first PCR will be mixed. This overlapping template will be amplified using primers complementary to primers F1 and R2 tailed with the attB1 and attB2 Gateway sequences, which can then be cloned into a Gateway donor vector and into a plant transformation vector with a tag as a fusion to the N-terminus. The wild type gene will be cloned in a similar fashion except that only one PCR will be done with the F1/R2 primer set containing the entire attB1 and attB2 Gateway sequences.