| Literature DB >> 26046471 |
Mattia D'Antonio, Paolo D'Onorio De Meo, Matteo Pallocca, Ernesto Picardi, Anna Maria D'Erchia, Raffaele A Calogero, Tiziana Castrignanò, Graziano Pesole.
Abstract
BACKGROUND: The study of RNA has been dramatically improved by the introduction of Next Generation Sequencing platforms allowing massive and cheap sequencing of selected RNA fractions, also providing information on strand orientation (RNA-Seq). The complexity of transcriptomes and of their regulative pathways make RNA-Seq one of most complex field of NGS applications, addressing several aspects of the expression process (e.g. identification and quantification of expressed genes and transcripts, alternative splicing and polyadenylation, fusion genes and trans-splicing, post-transcriptional events, etc.).Entities:
Mesh:
Substances:
Year: 2015 PMID: 26046471 PMCID: PMC4461013 DOI: 10.1186/1471-2164-16-S6-S3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Schematic description of RAP workflow. Quality check and filtering step for quality assessment (Steps 1 and 2). High quality reads are aligned to the reference genome using TopHat (Step 3). Alignments are assembled into full-length transcripts and their relative abundances are estimated by Cufflinks to (Step 4) and raw-counted by HTSeq (Step 5). Unspliced reads are filtered out after an ungapped alignment to the genome (Step 6). Remaining reads (potentially spliced) are mapped to a custom built junction library (Step 7). Reads still unmapped are scanned to identify poly(A) tags (Step 8). Cassette exons are identified and quantified by adopting a statistical tool, SpliceTrap (Step 9), and chimeric transcripts are detected by means of ChimeraScan (Step 10). After the completion of the main analysis, several differential analyses can be executed. Cuffdiff at transcript level (Step A), based on expression levels calculated by Cufflinks. DESeq at gene level (Step B), based on gene raw counts calculated by HTSeq. Differential exons (Step C), differential junctions usage (Step D) and differential polyadenylation sites (Step E) can also be calculated.
Figure 2RAP Banner and results examples. From top left corner in counterclockwise direction: gene structure view with alternative isoforms of gene TP53, example of chromosomal distribution of polyadenylation sites, example of results table with detailed information about expressed transcripts and visualization of the query form for data filtering.