| Literature DB >> 27079975 |
Fidel Ramírez1, Devon P Ryan1, Björn Grüning2, Vivek Bhardwaj3, Fabian Kilpert1, Andreas S Richter1, Steffen Heyne1, Friederike Dündar4, Thomas Manke5.
Abstract
We present an update to our Galaxy-based web server for processing and visualizing deeply sequenced data. Its core tool set, deepTools, allows users to perform complete bioinformatic workflows ranging from quality controls and normalizations of aligned reads to integrative analyses, including clustering and visualization approaches. Since we first described our deepTools Galaxy server in 2014, we have implemented new solutions for many requests from the community and our users. Here, we introduce significant enhancements and new tools to further improve data visualization and interpretation. deepTools continue to be open to all users and freely available as a web service at deeptools.ie-freiburg.mpg.de The new deepTools2 suite can be easily deployed within any Galaxy framework via the toolshed repository, and we also provide source code for command line usage under Linux and Mac OS X. A public and documented API for access to deepTools functionality is also available.Entities:
Mesh:
Year: 2016 PMID: 27079975 PMCID: PMC4987876 DOI: 10.1093/nar/gkw257
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Typical applications of deepTools components and a summary of their main inputs and outputs
|
|
Figure 1.deepTools2 has enhanced features for quality control, data reduction, normalization and visualization of deeply sequenced data. (A) deepTools2 comes with two new quality control tools: plotCoverage, which displays sequencing depth distribution; and plotPCA, which plots the results of principal component analysis (PCA) on BAM or bigWig files. (B) Signal visualization by means of normalized bigWig tracks, produced by deepTools’ bamCompare (ChIP-seq samples) and bamCoverage, which can now handle MNase- and strand-specific RNA-seq samples. In the genes track, the gray color represents untranslated regions and the thin lines represent introns. (C) computeMatrix and plotHeatmap were used to summarize and cluster multiple bigWig scores over genomic intervals. The image shows the resulting k-means clustering of ChIP-seq signals of three histone marks around the transcription start site of genes (with k = 2). Subsequently, the clustered regions were used to plot MNase-seq and CAGE read coverages. In the image, each row corresponds to the same genomic region. (D) plotProfile now offers additional means for visualization. The top panel shows the average signal for different samples and different cluster of regions (left and right plot). In the middle, the same profiles are represented as heatmaps. The bottom panel shows summary profiles where colors correspond to the observed frequency of the signal within each cluster. All data are from the Drosophila melanogaster S2 cell line and are publically available (see Section Accession numbers).