| Literature DB >> 24655717 |
Kristoffer Vitting-Seerup, Bo Torben Porse, Albin Sandelin1, Johannes Waage.
Abstract
BACKGROUND: RNA-seq data is currently underutilized, in part because it is difficult to predict the functional impact of alternate transcription events. Recent software improvements in full-length transcript deconvolution prompted us to develop spliceR, an R package for classification of alternative splicing and prediction of coding potential.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24655717 PMCID: PMC3998036 DOI: 10.1186/1471-2105-15-81
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The R code for a standard spliceR analysis. The R code necessary to run a standard spliceR analysis based on Cuffdiff output. A) The R code to generate a spliceRList from Cuffdiff output. B-D) The R code for making a standard spliceR analysis.
Figure 2Number of individual alternative splicing events identified. A schematic structure of each alternative splicing type, along with the associated name, abbreviation and the number of classified events in Usp49 KD RNA seq data.
Frequency of splice site consensus sequences
| 5′ end (GT) | 93.23 | 89.72 | 92.08 |
| 3′ end (AG) | 93.70 | 90.64 | 92.71 |
The two first and last nucleotides, corresponding to the splice site consensus sequences, were extracted from all exons originating from the RNA-seq data, Gencode, and RefSeq. The percentage of dinucleotides identical to the canonical motif was calculated.
Comparison of the analyzed transcripts to RefSeq
| | ||||
|---|---|---|---|---|
| Exon level | 97.90 | 89.20 | 98.00 | 88.70 |
| Intron level | 96.40 | 97.00 | 96.40 | 96.90 |
| Intron chain level | 91.70 | 81.90 | 91.80 | 81.90 |
Cufflinks’ cuffcompare tool (v. 2.1.1.) was used to compare RefSeq to the obtained transcripts and the sensitivity and specificity (as defined [20]) for exons, introns, and intron chains are shown.
Figure 3Relative abundance of transcripts. All NMD + transcripts (bottom) and all transcripts with IR (top) was extracted and the density distributions of the IF values from WT and Usp49 KD were plotted.
Figure 4Example of transcript switching. Screen shot from the UCSC genome browser showing the transcript switch found in the SQSTM1 gene. The two top tracks show transcripts generated by the generateGTF() function for WT (top) and Usp49KD (bottom). Darker transcripts are expressed at higher levels. The two bottom tracks indicate RefSeq genes (top) and protein domains identified via Pfam [21] respectively (bottom).