| Literature DB >> 29304726 |
Qin Zhu1, Stephen A Fisher2, Hannah Dueck2, Sarah Middleton1, Mugdha Khaladkar2, Junhyong Kim3.
Abstract
BACKGROUND: Many R packages have been developed for transcriptome analysis but their use often requires familiarity with R and integrating results of different packages requires scripts to wrangle the datatypes. Furthermore, exploratory data analyses often generate multiple derived datasets such as data subsets or data transformations, which can be difficult to track.Entities:
Keywords: Exploratory data analysis; Graphical user interface; Interactive visualization; Transcriptomics
Mesh:
Substances:
Year: 2018 PMID: 29304726 PMCID: PMC5756333 DOI: 10.1186/s12859-017-1994-0
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
List of tools currently integrated/implemented in PIVOT
| PIVOT Modules | Tools Integrated |
|---|---|
| Normalization | DESeq, Modified DESeq, TMM, Upper quartile, CPM/RPKM/TPM, RUV, Spike-in regression, Census |
| Feature/Sample Filtering | List based, Expression based and Quality based filters |
| Basic Analysis Modules | Data distribution plots, Dispersion analysis, Rank-frequency plot, Spike-in analysis, Feature heatmap, etc. |
| Differential Expression | DESeq2, edgeR, SCDE, Monocle, Mann-Whitney U test |
| Clustering/Classification | Hierarchical, K-means, SC3, Community detection, Classification with caret, Cell state ordering with Monocle2/Diffusion pseudotime |
| Dimension Reduction | PCA, t-SNE, Metric/Non-Metric MDS, penalized LDA, |
| Correlation Analysis | Pairwise scatter plots, Sample/feature correlation heatmap, |
| Gene Set Enrichment Analysis | KEGG pathway analysis, Gene ontology analysis |
| Network Analysis | STRING protein association network, Regnetwork visualization, Mogrify based trans-differentiation factor prediction |
| Other Utilities | Data map, Gene ID/Name conversion, BioMart gene annotation query, Venn diagram, Report generation, State saving |
Fig. 1Data management with data map. The map shows the history of the data change and the association between analysis and data nodes. Users can hover over edges to see operation details, or click nodes to get analysis reports or switch active subsets
Fig. 2Selected analysis modules in PIVOT. a The table on the left lists basic sample statistics. The selected statistics are plotted below the table, and clicking a sample in the table will plot its count distribution. b Mean-Standard deviation plot (top left, with vsn package), rank frequency plot (top right) and mean variability plot (bottom, with Seurat package). c The t-SNE module plots 1D, 2D and 3D projections (3D not shown due to space). d Feature heatmap with the top 100 differentially expressed genes reported by DESeq2 likelihood ratio test
Fig. 3Network analysis for the identification of potential transdifferentiation factors. a, b Graphs showing the connection between transcription factors differentially expressed between fibroblasts and ES cells. 3a is an undirected graph showing the protein-protein interaction relationship based on the STRING database, and 3b is constructed based on the Regnetwork repository, showing the regulatory relationship. The size of the nodes and the color gradient indicate the log fold change of the genes. The graphs have been zoomed in to only include the genes with large LFC and small p-value. c Predicted transdifferentiation factor lists based on the network score ranking. The table includes information such as the center transcription factor score, the total number of vertices in its direct neighborhood, and the number of activated neighbors with gene score above a user-specified threshold. Clicking entries on the table will plot the local neighborhood network centered on that TF