| Literature DB >> 25111964 |
Michael Fiebig1, Eva Gluenz1, Mark Carrington2, Steven Kelly3.
Abstract
The Kinetoplastida are a diverse and globally distributed class of free-living and parasitic single-celled eukaryotes that collectively cause a significant burden on human health and welfare. In kinetoplastids individual genes do not have promoters, but rather all genes are arranged downstream of a small number of RNA polymerase II transcription initiation sites and are thus transcribed in polycistronic gene clusters. Production of individual mRNAs from this continuous transcript occurs co-transcriptionally by trans-splicing of a ∼39 nucleotide capped RNA and subsequent polyadenylation of the upstream mRNA. SLaP mapper (Spliced-Leader and Polyadenylation mapper) is a fully automated web-service for identification, quantitation and gene-assignment of both spliced-leader and polyadenylation addition sites in Kinetoplastid genomes. SLaP mapper only requires raw read data from paired-end Illumina RNAseq and performs all read processing, mapping, quality control, quantification, and analysis in a fully automated pipeline. To provide usage examples and estimates of the quantity of sequence data required we use RNAseq obtained from two different library preparations from both Trypanosoma brucei and Leishmania mexicana to show the number of expected reads that are obtained from each preparation type. SLaP mapper is an easy to use, platform independent webserver that is freely available for use at http://www.stevekellylab.com/software/slap. Example files are provided on the website.Entities:
Keywords: Kinetoplastida; Leishmania; Polyadenylation; RNA splicing; RNA-Seq; Trypanosoma
Mesh:
Substances:
Year: 2014 PMID: 25111964 PMCID: PMC4222701 DOI: 10.1016/j.molbiopara.2014.07.012
Source DB: PubMed Journal: Mol Biochem Parasitol ISSN: 0166-6851 Impact factor: 1.759
The number of reads observed using different library preparation methods in two different species. L. mexicana based on 3 independent biological replicates. T. brucei based on 2 independent biological replicates. Numbers in brackets indicate one standard deviation.
| Species | Library type | Poly(A) reads per million reads | Trans-splice reads per million reads |
|---|---|---|---|
| T15VN | 34 784 (2100) | 4343 (300) | |
| Random primed | 5673 (730) | 79 773 (10 000) | |
| T15VN | 167 864 (8000) | 4782 (250) | |
| Random primed | 701 (50) | 76 563 (4800) |
Fig. 1(A) Screen shot of SLaP mapper results as visualised on the IGV genome browser. Four data tracks are shown. CDS are the gene models from V6 of the L. mexicana genome. Coverage is the from raw RNAseq reads mapped to the L. mexicana V6 genome (the coloured lines in the coverage plot indicate single nucleotide polymorphisms between the genome reference and the strain used for RNAseq). SAS are the splice acceptor addition sites identified by SLaP mapper. PAS are the polyadenylation addition sites identified by SLaP mapper. (B) The corresponding entries in the SLaP mapper results file for all SAS sites shown in A. The poly(A) results for the 55 sites shown in part A are not listed for space reasons. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)