| Literature DB >> 33922891 |
Naima Ahmed Fahmi1,2, Heba Nassereddeen2,3, Jaewoong Chang4, Meeyeon Park4, Hsinsung Yeh4, Jiao Sun1,2, Deliang Fan5, Jeongsik Yong4, Wei Zhang1,2.
Abstract
(1) Background: A simplistic understanding of the central dogma falls short in correlating the number of genes in the genome to the number of proteins in the proteome. Post-transcriptional alternative splicing contributes to the complexity of the proteome and is critical in understanding gene expression. mRNA-sequencing (RNA-seq) has been widely used to study the transcriptome and provides opportunity to detect alternative splicing events among different biological conditions. Despite the popularity of studying transcriptome variants with RNA-seq, few efficient and user-friendly bioinformatics tools have been developed for the genome-wide detection and visualization of alternative splicing events. (2)Entities:
Keywords: RNA-seq; RT-PCR; alternative splicing; transcriptome; visualization
Mesh:
Year: 2021 PMID: 33922891 PMCID: PMC8123109 DOI: 10.3390/ijms22094468
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Five major types of alternative splicing events. The alternative splicing exon(s) in each category is highlighted in yellow.
Comparison among AS-Quant, SUPPA2, rMATS and diffSplice on simulated RNA-seq data. AUC score, sensitivity, specificity of the four methods are reported. The best results across the four methods are bold.
| Method | AUC | Sensitivity | Specificity |
|---|---|---|---|
| AS-Quant |
|
|
|
| SUPPA2 | 0.80 | 0.44 | 0.97 |
| rMATS | 0.65 | 0.22 | 0.49 |
| diffSplice | 0.74 | 0.05 | 0.79 |
Figure 2Simulation experiment to assess the performance of AS-Quant and baseline methods. The receiver operating characteristic (ROC) curves, i.e., true positive rate against false positive rate, are plotted.
Comparison among AS-Quant, SUPPA2, rMATS, and diffSplice on simulated RNA-seq data. AUC score, sensitivity, specificity of the four methods on five different types of splicing events are reported. The best results across the four methods are bold.
| AS Type | Method | AUC | Sensitivity | Specificity |
|---|---|---|---|---|
| AS-Quant |
| 0.60 |
| |
| SE | SUPPA2 | 0.84 | 0.64 | 0.99 |
| rMATS | 0.84 |
| 0.50 | |
| diffSplice | 0.72 | 0.31 | 0.95 | |
| AS-Quant |
|
|
| |
| RI | SUPPA2 | 0.63 | 0.09 |
|
| rMATS | 0.58 | 0.30 | 0.50 | |
| diffSplice | 0.53 | 0.01 | 0.98 | |
| AS-Quant |
|
| 0.82 | |
| MXE | SUPPA2 | 0.66 | 0.37 |
|
| rMATS | 0.76 | 0.69 | 0.50 | |
| diffSplice | 0.46 | 0.03 |
| |
| AS-Quant |
|
|
| |
| A3SS | SUPPA2 | 0.80 | 0.56 | 0.99 |
| rMATS | 0.49 | 0.58 | 0.50 | |
| diffSplice | 0.62 | 0.03 | 0.51 | |
| AS-Quant | 0.71 | 0.50 | 0.97 | |
| A5SS | SUPPA2 |
|
|
|
| rMATS | 0.46 | 0.57 | 0.50 | |
| diffSplice | 0.58 | 0.03 | 0.51 |
Figure 3Simulation experiment to assess the performance of AS-Quant on different read depths. The ROC curves for the results of different RNA-seq read depth are plotted.
Number of alternative splicing events identified by AS-Quant and three baseline methods between MEFs with control and siU2af1. diffSplice cannot separate A3SS and A5SS.
| SE | RI | MXE | A3SS | A5SS | |
|---|---|---|---|---|---|
| AS-Quant | 257 | 5 | 43 | 101 | 30 |
| SUPPA2 | 172 | 46 | 12 | 121 | 117 |
| rMATS | 1128 | 15 | 129 | 51 | 16 |
| diffSplice | 169 | 560 | 0 | 1125 | |
Figure 4Validation of U2AF1-mediated alternative splicing events that are commonly detected by all four tested methods. (a) RNA-seq read coverage plots of the gene Ptbp1 and Ganab in the two samples with accurate isoform annotations. Alternatively spliced exons are marked in yellow. (b) Validation of isoform expression using RT-PCR and agarose gel electrophoresis. Quantitation of gel images was done using ImageQuant software. Exon inclusion and exclusion are color-coded. Total RNAs from MEFs used for RNA-Seq experiments were used for RT-PCR amplification of Ptbp1 or Ganab transcript isoforms. Scrambled RNA interference is control and U2af1 RNA interference is the case. The PCR primers to detect transcript isoforms for Ptbp1 or Ganab were marked by red arrows and their sequences are reported in the Supplementary document. Schematic of alternative spliced isoform structures for each PCR product is shown next to the gel image. Exon numbers and transcript identification numbers in RefSeq annotation are shown. A higher band intensity of PCR products indicates a higher expression of that specific transcript isoform.
Figure 5Validation of U2AF1-mediated alternative splicing events that are only detected by AS-Quant (Camk2g) or by AS-Quant and SUPPA2 (Tpm3). (a) Isoform structures of Camk2g and Tpm3 gene and their RNA-seq read coverage plots. Alternatively spliced exons are marked in yellow. (b) Validation of isoform expressions was conducted using RT-PCR and agarose gel electrophoresis. Quantitation of gel images was done using ImageQuant software. Two biological repeats of experiment were performed. Exon inclusion and exclusion are color-coded. Total RNAs from MEFs used for RNA-Seq experiments were used for RT-PCR amplification of Camk2g or Tpm3 transcript isoforms. Scrambled RNA interference is control and U2af1 RNA interference is the case. The PCR primers to detect transcript isoforms for Camk2g or Tpm3 were marked by red arrows and their sequences are reported in the Supplementary document. Schematic of alternative spliced isoform structures for each PCR product is shown next to the gel image. Exon numbers and transcript identification numbers in RefSeq annotation are shown. A higher band intensity of PCR products indicates a higher expression of that specific transcript isoform.
Figure 6Workflow of AS-Quant. Starting with aligned RNA-seq bam files, AS-Quant consists of three steps (i) read coverage estimation, (ii) splicing events categorization and assessment, (iii) visualization.