Literature DB >> 28505270

HTSvis: a web app for exploratory data analysis and visualization of arrayed high-throughput screens.

Christian Scheeder1, Florian Heigwer1, Michael Boutros1.   

Abstract

SUMMARY: Arrayed high-throughput screens (HTS) cover a broad range of applications using RNAi or small molecules as perturbations and specialized software packages for statistical analysis have become available. However, exploratory data analysis and integration of screening results has remained challenging due to the size of the data sets and the lack of user-friendly tools for interpretation and visualization of screening results. Here we present HTSvis, a web application to interactively visualize raw data, perform quality control and assess screening results from single to multi-channel measurements such as image-based screens. Per well aggregated raw and analyzed data of various assay types and scales can be loaded in a generic tabular format.
AVAILABILITY AND IMPLEMENTATION: HTSvis is distributed as an open-source R package, downloadable from https://github.com/boutroslab/HTSvis and can also be accessed at http://htsvis.dkfz.de . CONTACT: m.boutros@dkfz.de. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online .
© The Author(s) 2017. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2017        PMID: 28505270      PMCID: PMC5870698          DOI: 10.1093/bioinformatics/btx319

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Arrayed high-throughput screens (HTS) in high-density multiwell plates are a powerful method for small molecule screening and target discovery (Macarron ; Sundberg, 2000). Automated technologies allow to screen tens of thousands of genetic or chemical perturbations, resulting in very large datasets. HTS experiments can range in complexity from univariate cell viability measurements (Whitehurst ), to multichannel fluorescence-activated cell sorting (FACS) (Björklund ) or multiparametric image based screens (Bray ; Fischer ). A range of statistical analysis methods have been developed for processing, normalization and quality control of HTS data to robustly identify and annotate significant perturbations (Birmingham ). Open-source software for integrated statistical analysis using statistical languages, such as cellHTS has been developed previously (Boutros ; Dutta ; List ). Although commercial desktop software, e.g. TIBCO Spotfire, exist for visualization and exploratory data analysis, few open-source options, in particular for multiparametric screens, are available (Antal ; Dao ). Thus, there is a need for lightweight software packages that are easy to install and use to aid the interpretation and evaluation of HTS data without requiring extensive programming skills.

2 The HTSvis application

We developed HTSvis, an application for the visualization of data from arrayed HTSs. After installation as an R package, data input and all user interactions are controlled via a user interface requiring no programming skills. Input data can be in commonly used formats to store raw- and analyzed data, such as delimited files (.txt, .csv, .xlsx) or RData stores. In addition, we provide a web service to access HTSvis (http://htsvis.dkfz.de). HTSvis accepts data in a generic tabular format, providing flexibility towards the assay type (e.g. multiparametric data) and scale (Fig. 1A). In particular, data that have been statistically analyzed with the R/Bioconductor package cellHTS (Boutros ) can be imported directly into HTSvis for exploratory data analysis.
Fig. 1

Workflow diagram and functionalities for visualization and exploratory data analysis of arrayed HTSs. (A) Raw data or statistically analyzed data of various assay formats, ranging from single-channel readouts to image features in 6- to 384-well plates, in tabular formats can be loaded into the application to facilitate interactive data visualizations. (B) The user can switch between four pages for performing, e.g. the identification of experimental artifacts (see plate viewer), perform quality control checks and the identification of hits by brushing and comparing measurements

Workflow diagram and functionalities for visualization and exploratory data analysis of arrayed HTSs. (A) Raw data or statistically analyzed data of various assay formats, ranging from single-channel readouts to image features in 6- to 384-well plates, in tabular formats can be loaded into the application to facilitate interactive data visualizations. (B) The user can switch between four pages for performing, e.g. the identification of experimental artifacts (see plate viewer), perform quality control checks and the identification of hits by brushing and comparing measurements

2.1 Local installation and data structure

HTSvis can be installed on local computers from GitHub (https://github.com/boutroslab/HTSvis). After loading the package in R, a single command launches the app in any default web browser. Further instructions, also how to deploy HTSvis in a local shiny server, are documented on the GitHub repository. Input data can be in common tabular formats (.txt, .csv, .xlsx) and requires a certain structure and annotation, such as well and plate annotations and measured variables in distinct, named columns. Specifics about input formats are detailed in Supplementary Material. When data were analyzed with cellHTS, the summary table (‘topTable.txt’) provides all required information and can be uploaded directly. The number of parameters per well is not limited. This allows to load multiparametric datasets from various assay types. More detailed help can be found within the application.

2.2 Interactive data exploration

2.2.1 Spatial plate analysis: plate viewer tab

Plate plots show the data in the format it was measured (e.g. 384-well plates, Fig. 1B). By interactively comparing different plates and measurements, spatial distribution of values can be assessed. This allows to interactively browse the dataset and facilitates the identification of experimental artifacts, such as edge effects (Fig. 1B). The color scale for each plate plot can be adapted for comparisons between plates, e.g. biological replicates. A tooltip on each well provides quick information of the numeric value and annotation (e.g. perturbation reagent) per well.

2.2.2 Assessing screening quality: quality control tab

Screen quality and integrity is commonly assessed based on control perturbations, for which a known phenotypic effect is expected (Birmingham ). Up to three control populations (positive, negative and non-targeting) can be defined by selecting wells on a plate map (Fig. 1B). A scatter plot of values vs. plates, a box and a density plot (Kernel-density estimation) of controls are shown. The box and density plot summarize how well controls are separated and allow to estimate effect size and performance of the assay (Z’-factor). The scatter plot adds information about measured values of individual plates over the entire experiment.

2.2.3 Data interpretation: scatter plot tab

The scatter plot tab is a visual tool for quality control and exploratory data analysis. To evaluate the correlation between replicates and to judge the experiment reproducibility, two experiments are plotted against each other. Experiment and measured variable are chosen ad hoc. Users can also brush data points by box selection. Brushed data points can be assigned to a subpopulation with a user-defined name and color (Fig. 1B). Multiple populations can be created and compared. Hypotheses, e.g. how measurements of interest behave in different experimental conditions can be tested accordingly. Brushing of data points is linked to the well and plate position, hence is persistent when measured variable or experiment is changed. This way differential effects between conditions (e.g. between control and drug treatment) can be identified.

2.3 Conclusions

HTSvis is a locally deployable web application to explore and visualize data of arrayed screens with various readouts and scales. Interactive plots and tables provide an advantage compared to the handling of individual files and programming scripts, e.g. one for each plate or plot. Ease-of-use from installation to data input and visualization via the user interface is the main characteristic of HTSvis. Reactive data representations that can be readily accessed provide a versatile tool for exploratory data analysis filling a yet unmet need in the HTS community. Click here for additional data file.
  12 in total

Review 1.  High-throughput and ultra-high-throughput screening: solution- and cell-based approaches.

Authors:  S A Sundberg
Journal:  Curr Opin Biotechnol       Date:  2000-02       Impact factor: 9.740

2.  Synthetic lethal screen identification of chemosensitizer loci in cancer cells.

Authors:  Angelique W Whitehurst; Brian O Bodemann; Jessica Cardenas; Deborah Ferguson; Luc Girard; Michael Peyton; John D Minna; Carolyn Michnoff; Weihua Hao; Michael G Roth; Xian-Jin Xie; Michael A White
Journal:  Nature       Date:  2007-04-12       Impact factor: 49.962

Review 3.  Statistical methods for analysis of high-throughput RNA interference screens.

Authors:  Amanda Birmingham; Laura M Selfors; Thorsten Forster; David Wrobel; Caleb J Kennedy; Emma Shanks; Javier Santoyo-Lopez; Dara J Dunican; Aideen Long; Dermot Kelleher; Queta Smith; Roderick L Beijersbergen; Peter Ghazal; Caroline E Shamu
Journal:  Nat Methods       Date:  2009-08       Impact factor: 28.547

Review 4.  Impact of high-throughput screening in biomedical research.

Authors:  Ricardo Macarron; Martyn N Banks; Dejan Bojanic; David J Burns; Dragan A Cirovic; Tina Garyantes; Darren V S Green; Robert P Hertzberg; William P Janzen; Jeff W Paslay; Ulrich Schopfer; G Sitta Sittampalam
Journal:  Nat Rev Drug Discov       Date:  2011-03       Impact factor: 84.694

5.  Cell Painting, a high-content image-based assay for morphological profiling using multiplexed fluorescent dyes.

Authors:  Mark-Anthony Bray; Shantanu Singh; Han Han; Chadwick T Davis; Blake Borgeson; Cathy Hartland; Maria Kost-Alimova; Sigrun M Gustafsdottir; Christopher C Gibson; Anne E Carpenter
Journal:  Nat Protoc       Date:  2016-08-25       Impact factor: 13.491

6.  Identification of pathways regulating cell size and cell-cycle progression by RNAi.

Authors:  Mikael Björklund; Minna Taipale; Markku Varjosalo; Juha Saharinen; Juhani Lahdenperä; Jussi Taipale
Journal:  Nature       Date:  2006-02-23       Impact factor: 49.962

7.  Comprehensive analysis of high-throughput screens with HiTSeekR.

Authors:  Markus List; Steffen Schmidt; Helle Christiansen; Marc Rehmsmeier; Qihua Tan; Jan Mollenhauer; Jan Baumbach
Journal:  Nucleic Acids Res       Date:  2016-06-21       Impact factor: 16.971

8.  A map of directional genetic interactions in a metazoan cell.

Authors:  Bernd Fischer; Thomas Sandmann; Thomas Horn; Maximilian Billmann; Varun Chaudhary; Wolfgang Huber; Michael Boutros
Journal:  Elife       Date:  2015-03-06       Impact factor: 8.140

9.  CellProfiler Analyst: interactive data exploration, analysis and classification of large biological image sets.

Authors:  David Dao; Adam N Fraser; Jane Hung; Vebjorn Ljosa; Shantanu Singh; Anne E Carpenter
Journal:  Bioinformatics       Date:  2016-06-26       Impact factor: 6.937

10.  Mineotaur: a tool for high-content microscopy screen sharing and visual analytics.

Authors:  Bálint Antal; Anatole Chessel; Rafael E Carazo Salas
Journal:  Genome Biol       Date:  2015-12-17       Impact factor: 13.583

View more
  2 in total

1.  RNA Interference (RNAi) Screening in Drosophila.

Authors:  Florian Heigwer; Fillip Port; Michael Boutros
Journal:  Genetics       Date:  2018-03       Impact factor: 4.562

2.  iSEE: Interactive SummarizedExperiment Explorer.

Authors:  Kevin Rue-Albrecht; Federico Marini; Charlotte Soneson; Aaron T L Lun
Journal:  F1000Res       Date:  2018-06-14
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.