| Literature DB >> 27905880 |
Harold Pimentel1, Pascal Sturmfels2, Nicolas Bray3, Páll Melsted4, Lior Pachter5,6.
Abstract
Increased emphasis on reproducibility of published research in the last few years has led to the large-scale archiving of sequencing data. While this data can, in theory, be used to reproduce results in papers, it is difficult to use in practice. We introduce a series of tools for processing and analyzing RNA-Seq data in the Sequence Read Archive, that together have allowed us to build an easily extendable resource for analysis of data underlying published papers. Our system makes the exploration of data easily accessible and usable without technical expertise. Our database and associated tools can be accessed at The Lair: http://pachterlab.github.io/lair .Entities:
Keywords: Exploratory data analysis; Interactive visualization; Kallisto; RNA-Seq; Reanalysis; Reproducibility; Sequence read archive; Shiny; Sleuth
Mesh:
Year: 2016 PMID: 27905880 PMCID: PMC5131447 DOI: 10.1186/s12859-016-1357-2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Workflow of The Lair system for distributing analysis of sequence read archive data. The inputs to the system are sets of two files: config.json file that specifies parameters to be used during the processing of each experiment and a design matrix for each experiment that specifies its structure. A master Snakemake workflow organizes a series of computations starting with downloading of data to the sequence read archive and ending with deployment of a sleuth analyses to a Shiny server. Finally, a website generated from information in the config.json files links to objects in the Shiny server thus providing access to the processed experiments
Fig. 2Principal Components Analysis of the Trapnell et al. HOXA1 knockdown RNA-Seq data. The Lair allows for plotting projections with respect to any pair of principal components, and also identifies the transcripts constituting the loadings of each dimension
Fig. 3Transcript abundances for the differential isoform of the TBX3 gene in the Trapnell et al. data. The error bars on each quantification are produced via the bootstrap feature of kallisto, which establishes the inferential variance associated with quantification. The Lair provides an interactive template for viewing such plots for any transcript