Literature DB >> 20385013

web cellHTS2: a web-application for the analysis of high-throughput screening data.

Oliver Pelz1, Moritz Gilsdorf, Michael Boutros.   

Abstract

BACKGROUND: The analysis of high-throughput screening data sets is an expanding field in bioinformatics. High-throughput screens by RNAi generate large primary data sets which need to be analyzed and annotated to identify relevant phenotypic hits. Large-scale RNAi screens are frequently used to identify novel factors that influence a broad range of cellular processes, including signaling pathway activity, cell proliferation, and host cell infection. Here, we present a web-based application utility for the end-to-end analysis of large cell-based screening experiments by cellHTS2.
RESULTS: The software guides the user through the configuration steps that are required for the analysis of single or multi-channel experiments. The web-application provides options for various standardization and normalization methods, annotation of data sets and a comprehensive HTML report of the screening data analysis, including a ranked hit list. Sessions can be saved and restored for later re-analysis. The web frontend for the cellHTS2 R/Bioconductor package interacts with it through an R-server implementation that enables highly parallel analysis of screening data sets. web cellHTS2 further provides a file import and configuration module for common file formats.
CONCLUSIONS: The implemented web-application facilitates the analysis of high-throughput data sets and provides a user-friendly interface. web cellHTS2 is accessible online at http://web-cellHTS2.dkfz.de. A standalone version as a virtual appliance and source code for platforms supporting Java 1.5.0 can be downloaded from the web cellHTS2 page. web cellHTS2 is freely distributed under GPL.

Entities:  

Mesh:

Year:  2010        PMID: 20385013      PMCID: PMC3098057          DOI: 10.1186/1471-2105-11-185

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


Background

High-throughput cell-based screens have become an important experimental tool for the analysis of many cellular processes. Whole genome sequences and methods for gene silencing by RNA interference (RNAi) have enabled loss-of-function analysis in ex vivo and in vivo, opening new avenues for functional analysis that were previously unfeasible [1,2]. Different experimental methods to assess phenotypic changes are being used, from single-channel homogenous readouts to multi-channel cytometry and imaging, producing large data sets that need to be analyzed to extract phenotypically relevant information. RNAi screening has found a broad user-base as a genetic method to dissect many different cellular processes, such as cell survival, signaling pathways and other cellular phenotypes in a high-throughput manner [3-6]. High-throughput screens are mostly performed using 96- to 384-well plates and produce large data sets that need to be normalized, summarized and ranked to generate a list of significant phenotypic modifiers. Large-scale RNAi screens can easily exceed more than 100,000 data points per screening experiment and specialized statistical approaches have been developed for their analysis [7-10]. Quality control assessments of assays and screening data are performed to provide benchmarks for the overall performance, such as experiment-wide performance of controls, reproducibility between replicate experiments, as well as other statistical quality control measures [10-13]. We have previously described cellHTS as an analysis toolbox for cell-based high-throughput screens [7]. cellHTS is implemented in R/Bioconductor [12] as a command-line utility that provides a workflow for the analysis of high-throughput data sets. cellHTS and cellHTS2 have become widely used in the community as they provide an end-to-end solution for the analysis of high-throughput screening data sets, while retaining the flexibility to incorporate further functions for statistical analysis as the field matures. However, an obstacle for general use in the laboratory was the lack of an integrated and easy-to-use solution for the configuration of screening plates, choice of controls and analysis methods. Here, we present a web-based application that guides the user through all steps required for the analysis of high-throughput screens, including configuration of plates and controls, normalization and statistical quality control (Figure 1). The software allows the user to upload a variety of file formats, select different methods for summarization and normalization, and returns a complete analysis report by E-mail or as a download. Analysis workflows can be saved as templates for re-analysis, or possible submission as supplementary material for publications. The software can be accessed at http://web-cellHTS2.dkfz.de or downloaded as a fully functional Java application (for platforms supporting Java 1.5.0) or VirtualBox appliance.
Figure 1

Schematic representation of the workflow for the analysis of high-throughput screening data sets.

Schematic representation of the workflow for the analysis of high-throughput screening data sets.

Implementation

Data files needed for the analysis are generated through the graphical user interface or can be provided through upload forms. web cellHTS2 also provides an import module that supports upload of a spectrum of different file formats. web cellHTS2 implements error detection mechanisms for each data file or website input, checking for common input errors prior to running cellHTS2. Once the configuration of a screening experiment is completed, the analysis project, containing information on the complete session including all input files and processing parameters, can be saved for re-use. This function allows for rapid re-processing of similar datasets and generation of a full documentation of the analysis. The results of the analysis can be streamed to the web browser or can be sent via E-mail directly. web cellHTS2 was implemented based on a Java Server Pages infrastructure using the Tapestry5 [14] open-source web-framework, which facilitates maintenance and extension of the web application. The frontend has been designed to run remotely on a Tomcat5 webserver [15] but can also be installed locally using an integrated Jetty [16] Java web-server. AJAX (Asynchronous JavaScript and XML) technology is used to improve the interactivity of web cellHTS2. Interaction between the cellHTS2 R/Bioconductor software and the Java based web application is achieved using R-serve, which can be run on an independent webserver. This separates the web application logic from the statistical calculations, thereby reducing the computational load for accessing the webserver and allowing the analysis of several high-throughput screening data sets in parallel (Figure 2). The workload for computational calculations can be scaled at runtime by setting the maximum number of parallel analyses.
Figure 2

Implementation of . The user interacts with the application that resides on an internal or external server using a web-browser interface. The software architecture includes Tapestry as extensible framework and interacts with R using a scalable R-server implementation. The software is available as open-source and can be downloaded from http://web-cellhts2.dkfz.de.

Implementation of . The user interacts with the application that resides on an internal or external server using a web-browser interface. The software architecture includes Tapestry as extensible framework and interacts with R using a scalable R-server implementation. The software is available as open-source and can be downloaded from http://web-cellhts2.dkfz.de.

Results

web cellHTS2 facilitates the analysis of high-throughput screening data by providing an easy to use web-application. It has been developed with a view towards large-scale RNAi screens but can also be employed for the analysis of small molecule screens. A particular focus has been to provide a user-friendly interface to select analysis parameters and to generate "re-usable" analysis workflows. Furthermore, error-checking procedures of raw data and annotation files, and automated pre-processing of uploaded data have been implemented. web cellHTS2 can be accessed online or downloaded for local installation. web cellHTS can also be downloaded as a virtual appliance to run web cellHTS2 in a contained environment [17]. The web application implements three steps in the analysis workflow. In the first step, the user starts a new analysis by choosing the type of experiment (current options are single or dual-channel experiments) or can upload a previously saved workflow (Figure 3a). Uploaded workflows can be modified, e.g. by altering analysis parameters or replacing data files. In the next step, raw data files from high-throughput screening experiments are uploaded. The web application recognizes plate, replicate and channel files, or file assignments can be manually annotated. This feature allows rapid upload of large data sets that can easily entail several hundred data files (such as generated by multi-mode platereaders). Also, previously generated "plate list" files can be uploaded (Figure 3b). An import module can also be used to upload more complex data files. Step three involves configuration of the layout of the multi-well plates which were used in the screening experiment. The graphical user interface allows the user to indicate which wells contain negative, positive and other controls (Figure 3c). Alternatively, previously generated configuration files with existing plate layouts can be uploaded. At this stage the user can select among multiple options regarding how the data is processed and different channels summarized. Table 1 shows a list of options that are currently implemented in cellHTS2, which include both sample and control-based normalization methods, as well as procedures such as B-score normalization [18] to remove spatial artefacts. In the last step, annotation and description files are uploaded or manually described using an edit form. The workflow can then be saved for future analysis or reference. The analysis report is either streamed as a compressed file via HTML or sent by E-mail. A tutorial on the analysis of high-throughput screens with web cellHTS2, including a sample data set, is provided on the website.
Figure 3

Screenshots of the analysis workflow of high-throughput screens by web cellHTS2. (a) The user can start a new analysis or upload previous analysis templates. (b) The data file upload form with parameter editor. (c) Graphical plate configuration editor.

Table 1

Examples of normalization options

Normalization optionDescription
MedianMeasurements are divided by the median of all sample wells in the plate

ShorthThe midpoint of the 'shorth' of the distribution of all sample wells is used for normalization

MeanMeasurements are divided by the mean of all sample wells in the plate

NegativesMeasurements are divided by the median of negative controls in the plate

Percent controlMeasurements are divided by the mean of the plate's positive control

Normalized percent controlMeasurements are divides by the difference of the plates positive and negative controls

B-scoreA two-way (row and column) median polish is applied to each plate

Robust local fit regressionSpatial effects are normalized by fitting a bivariate local regression

Loess regressionSpatial effects are normalized using Loess regression
Screenshots of the analysis workflow of high-throughput screens by web cellHTS2. (a) The user can start a new analysis or upload previous analysis templates. (b) The data file upload form with parameter editor. (c) Graphical plate configuration editor. Examples of normalization options

Discussion

The application presented here is a web or stand-alone program to facilitate the analysis of high-throughput screening data. High-throughput screening experiments are of increasing importance, both for basic science and drug discovery. Such data sets easily exceed the complexity of transcriptome experiments, however there are still comparably much fewer tools available that enable an easy-to-use analysis. cellHTS and other software packages [9] have started to address this issue by enabling an end-to-end analysis of high-throughput screening data sets and have become widely used in the community. Here, we provide a web application as a front-end for cellHTS2 to increase its accessibility and accelerate the analysis of high-throughput screening data sets. The web application can be used both for RNAi and compound screening experiments and can be extended to meet future needs. In contrast to commercial packages, we provide an open-source and extensible solution for online and offline usage.

Conclusions and future directions

web cellHTS2 provides an intuitive interface for the analysis of high-throughput screens. The user can choose among different options for the analysis of screening data sets. Statistical analysis options will be expanded as new methods become available and broadly used [9,18]. The graphical user interface for the configuration of screening experiments and the option to save "re-usable" session templates make it convenient to use in the laboratory. Future developments of the application will be to provide direct links to phenotype databases [19], e.g. to compare hit lists, to annotate hit list with additional information from public databases e.g. through BioMart and to extend the analysis by functional annotation data such GO enrichment analysis. It is also planned to provide diagnostic plots "on-the-fly" to allow the user to compare different normalization strategies.

Availability and requirements

Project name: web cellHTS2 Project home page: http://web-cellHTS2.dkfz.de Operating system(s): Platform independent Programming language: e.g. Java Other requirements: Java 1.5.0 Downloadable Version: R 2.10.0, cellHTS2 2.11.1 and Rserve 0.6.0 Virtual appliance: Open source software Virtual box http://www.virtualbox.org License: GNU GPL Any restrictions to use by non-academics: none

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

OP developed the software. MG provided advice in the design and development of the software package. MB conceived the concept and methodology and supervised the project. MB and OP wrote the manuscript. All authors read and approved the final manuscript.
  15 in total

1.  A Simple Statistical Parameter for Use in Evaluation and Validation of High Throughput Screening Assays.

Authors: 
Journal:  J Biomol Screen       Date:  1999

2.  Improved statistical methods for hit selection in high-throughput screening.

Authors:  Christine Brideau; Bert Gunter; Bill Pikounis; Andy Liaw
Journal:  J Biomol Screen       Date:  2003-12

Review 3.  The art and design of genetic screens: RNA interference.

Authors:  Michael Boutros; Julie Ahringer
Journal:  Nat Rev Genet       Date:  2008-06-03       Impact factor: 53.242

4.  RNAither, an automated pipeline for the statistical analysis of high-throughput RNAi screens.

Authors:  Nora Rieber; Bettina Knapp; Roland Eils; Lars Kaderali
Journal:  Bioinformatics       Date:  2009-01-25       Impact factor: 6.937

Review 5.  Statistical methods for analysis of high-throughput RNA interference screens.

Authors:  Amanda Birmingham; Laura M Selfors; Thorsten Forster; David Wrobel; Caleb J Kennedy; Emma Shanks; Javier Santoyo-Lopez; Dara J Dunican; Aideen Long; Dermot Kelleher; Queta Smith; Roderick L Beijersbergen; Peter Ghazal; Caroline E Shamu
Journal:  Nat Methods       Date:  2009-08       Impact factor: 28.547

6.  Functional genomics reveals genes involved in protein secretion and Golgi organization.

Authors:  Frederic Bard; Laetitia Casano; Arrate Mallabiabarrena; Erin Wallace; Kota Saito; Hitoshi Kitayama; Gianni Guizzunti; Yue Hu; Franz Wendler; Ramanuj Dasgupta; Norbert Perrimon; Vivek Malhotra
Journal:  Nature       Date:  2006-02-02       Impact factor: 49.962

7.  Bioconductor: open software development for computational biology and bioinformatics.

Authors:  Robert C Gentleman; Vincent J Carey; Douglas M Bates; Ben Bolstad; Marcel Dettling; Sandrine Dudoit; Byron Ellis; Laurent Gautier; Yongchao Ge; Jeff Gentry; Kurt Hornik; Torsten Hothorn; Wolfgang Huber; Stefano Iacus; Rafael Irizarry; Friedrich Leisch; Cheng Li; Martin Maechler; Anthony J Rossini; Gunther Sawitzki; Colin Smith; Gordon Smyth; Luke Tierney; Jean Y H Yang; Jianhua Zhang
Journal:  Genome Biol       Date:  2004-09-15       Impact factor: 13.583

8.  ZNF423 is critically required for retinoic acid-induced differentiation and is a marker of neuroblastoma outcome.

Authors:  Sidong Huang; Jamila Laoukili; Mirjam T Epping; Jan Koster; Michael Hölzel; Bart A Westerman; Wouter Nijkamp; Akiko Hata; Shahab Asgharzadeh; Robert C Seeger; Rogier Versteeg; Roderick L Beijersbergen; René Bernards
Journal:  Cancer Cell       Date:  2009-04-07       Impact factor: 31.743

9.  FlyRNAi: the Drosophila RNAi screening center database.

Authors:  Ian Flockhart; Matthew Booker; Amy Kiger; Michael Boutros; Susan Armknecht; Nadire Ramadan; Kris Richardson; Andrew Xu; Norbert Perrimon; Bernard Mathey-Prevot
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

10.  GenomeRNAi: a database for cell-based RNAi phenotypes.

Authors:  Thomas Horn; Zeynep Arziman; Juerg Berger; Michael Boutros
Journal:  Nucleic Acids Res       Date:  2006-11-28       Impact factor: 16.971

View more
  26 in total

Review 1.  Systems biology in immunology: a computational modeling perspective.

Authors:  Ronald N Germain; Martin Meier-Schellersheim; Aleksandra Nita-Lazar; Iain D C Fraser
Journal:  Annu Rev Immunol       Date:  2011       Impact factor: 28.527

2.  Dual myxovirus screen identifies a small-molecule agonist of the host antiviral response.

Authors:  Dan Yan; Stefanie A Krumm; Aiming Sun; David A Steinhauer; Ming Luo; Martin L Moore; Richard K Plemper
Journal:  J Virol       Date:  2013-08-07       Impact factor: 5.103

3.  Sclerotiamide: The First Non-Peptide-Based Natural Product Activator of Bacterial Caseinolytic Protease P.

Authors:  Nathan P Lavey; Jesse A Coker; Eliza A Ruben; Adam S Duerfeldt
Journal:  J Nat Prod       Date:  2016-03-11       Impact factor: 4.050

4.  HTSanalyzeR: an R/Bioconductor package for integrated network analysis of high-throughput screens.

Authors:  Xin Wang; Camille Terfve; John C Rose; Florian Markowetz
Journal:  Bioinformatics       Date:  2011-01-22       Impact factor: 6.937

5.  Cross-resistance mechanism of respiratory syncytial virus against structurally diverse entry inhibitors.

Authors:  Dan Yan; Sujin Lee; Vidhi D Thakkar; Ming Luo; Martin L Moore; Richard Karl Plemper
Journal:  Proc Natl Acad Sci U S A       Date:  2014-08-04       Impact factor: 11.205

6.  The Retinoid Agonist Tazarotene Promotes Angiogenesis and Wound Healing.

Authors:  Ayman Al Haj Zen; Dorota A Nawrot; Alison Howarth; Andrea Caporali; Daniel Ebner; Aude Vernet; Jurgen E Schneider; Shoumo Bhattacharya
Journal:  Mol Ther       Date:  2016-08-02       Impact factor: 11.454

7.  siMacro: A Fast and Easy Data Processing Tool for Cell-Based Genomewide siRNA Screens.

Authors:  Nitin Kumar Singh; Bo Yeun Seo; Mathukumalli Vidyasagar; Michael A White; Hyun Seok Kim
Journal:  Genomics Inform       Date:  2013-03-31

8.  GUItars: a GUI tool for analysis of high-throughput RNA interference screening data.

Authors:  Asli N Goktug; Su Sien Ong; Taosheng Chen
Journal:  PLoS One       Date:  2012-11-20       Impact factor: 3.240

9.  RNAi-based screening identifies kinases interfering with dioxin-mediated up-regulation of CYP1A1 activity.

Authors:  David Gilot; Nolwenn Le Meur; Fanny Giudicelli; Marc Le Vée; Dominique Lagadic-Gossmann; Nathalie Théret; Olivier Fardel
Journal:  PLoS One       Date:  2011-03-29       Impact factor: 3.240

10.  High throughput screening for small molecule enhancers of the interferon signaling pathway to drive next-generation antiviral drug discovery.

Authors:  Dhara A Patel; Anand C Patel; William C Nolan; Yong Zhang; Michael J Holtzman
Journal:  PLoS One       Date:  2012-05-04       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.