Literature DB >> 27354701

CellProfiler Analyst: interactive data exploration, analysis and classification of large biological image sets.

David Dao1, Adam N Fraser2, Jane Hung3, Vebjorn Ljosa2, Shantanu Singh2, Anne E Carpenter2.   

Abstract

CellProfiler Analyst allows the exploration and visualization of image-based data, together with the classification of complex biological phenotypes, via an interactive user interface designed for biologists and data scientists. CellProfiler Analyst 2.0, completely rewritten in Python, builds on these features and adds enhanced supervised machine learning capabilities (Classifier), as well as visualization tools to overview an experiment (Plate Viewer and Image Gallery).
AVAILABILITY AND IMPLEMENTATION: CellProfiler Analyst 2.0 is free and open source, available at http://www.cellprofiler.org and from GitHub (https://github.com/CellProfiler/CellProfiler-Analyst) under the BSD license. It is available as a packaged application for Mac OS X and Microsoft Windows and can be compiled for Linux. We implemented an automatic build process that supports nightly updates and regular release cycles for the software. CONTACT: anne@broadinstitute.orgSupplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2016        PMID: 27354701      PMCID: PMC5048071          DOI: 10.1093/bioinformatics/btw390

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

CellProfiler Analyst is open-source software for biological image-based classification, data exploration and visualization with an interactive user interface designed for biologists and data scientists. Using data from feature extraction software such as CellProfiler (Kamentsky ), CellProfiler Analyst offers easy-to-use tools for exploration and mining of image data, which is being generated in ever increasing amounts, particularly in high-content screens (HCS). Its tools can help identify complex and subtle phenotypes, improve quality control and provide single-cell and population-level information from experiments. Some distinctive and critical features of CellProfiler Analyst are its user-friendly object-based machine learning interface, its ability to handle the tremendous scale of HCS experiments (millions of cell images), its gating capabilities that allow observing relationships among different data displays, and its exploration tools which enable interactively viewing connections between cell-level data and well-level data, and among raw images, processed/segmented images, extracted features and sample metadata. Compared to other commonly-cited open-source biological image classification software like Ilastik (Sommer ), CellCognition (Held ) and WND-CHARM (Orlov ), CellProfiler Analyst has the advantage of containing companion visualization tools, being suitable for high-throughput datasets, having multiple classifier options, and allowing both cell and field-of-view classification. Advanced Cell Classifier (Horvath ) shares many of the classification features of CellProfiler Analyst, but it lacks HCS data exploration and visualization tools. Compared to command-line-based data exploration software like cellHTS (Boutros ) and imageHTS (Pau ) and the web tool web CellHTS2 (Pelz ), CellProfiler Analyst provides interactive object classification and image viewing. Several other software tools (e.g. the HCDC set of modules for KNIME (Berthold )) are no longer available/maintained. Here, we present major improvements to CellProfiler Analyst. Since its original publication (Jones ), CellProfiler Analyst has been rewritten in Python (vs. its original language, Java) with significant enhancements. While keeping the original functionality allowing researchers to visualize data through histograms, scatter plots and density plots and to explore and score phenotypes by sequential gating, the key new features include: multiple machine learning algorithms that can be trained to identify multiple phenotypes in single cells or whole fields of view, by simple drag and drop more efficient handling of large scale, high-dimensional data a gallery view to explore images in an experiment, and cells in individual images and a plate layout view to explore aggregated cell measurements or image thumbnails for single or multiple plates.

2 New features in CellProfiler Analyst 2.0

Classifier: CellProfiler Analyst 1.0 allowed researchers to train a single classifier (Gentle Boosting) to recognize a single phenotype (two-class) in individual cell images (rather than whole fields-of-view) (Jones ). In CellProfiler Analyst 2.0 (Fig. 1) , Classifier can perform cell and field-of-view-level classification of multiple phenotypes (multi-class) using popular models like Random Forest, SVM and AdaBoost from the high performance machine learning library scikit-learn (Pedregosa ), which yields a ∼200-fold improvement in speed (Supplementary Data 1). First, cell- or whole-image samples from the experiment are fetched and sorted by drag and drop into researcher-defined classes, making up the annotated training set. Fetching can be random, based on filters, based on per-class predictions of an already-trained classifier, or based on active learning. The new active learning option speeds annotation by presenting uncertain cases. In addition, researchers can view full images of each sample and drag and drop cells from the image for annotation. Next, a classifier is trained on this set. After training on the annotated set, a model’s performance can be evaluated by cross validation in the form of a confusion matrix and precision, recall and F1 score per class. The model can then be used to quantify cell phenotypes or whole-image phenotypes.
Fig. 1.

User Interface of CellProfiler Analyst. (A) Main Toolbar and Image Gallery; (B) Classifier; (C) Plate Viewer (Color version of this figure is available at Bioinformatics online.)

User Interface of CellProfiler Analyst. (A) Main Toolbar and Image Gallery; (B) Classifier; (C) Plate Viewer (Color version of this figure is available at Bioinformatics online.) Image Gallery: CellProfiler Analyst 2.0 offers a convenient new Image Gallery tool (Fig. 1A), in addition to the existing visualization/exploration tools with standard plotting and gating capabilities in version 1.0 (Jones ). Image Gallery provides a convenient grid view allowing an overview of images. A variety of options are provided to filter images based on experiment-specific metadata, e.g. gene name, compound treatments, etc. Multiple filters can be combined to refine the search. Images can be displayed as a custom-sized thumbnail or in full resolution, and the color assigned to each channel in the image can be customized to highlight structures of interest. Individual segmented cells can be viewed for each image, and can be dragged and dropped into the Classifier window. Plate Viewer : Many large-scale imaging experiments take place in multi-well plate format. Researchers are often interested in seeing their data overlaid on this format, to check for systematic sample quality issues, or to see results from controls placed in particular locations, at a glance. The Plate Viewer tool (Fig. 1C) displays aggregated and/or filtered measurements (according to customizable color maps) or a thumbnail image for each well. Automatically imported annotations can be viewed, and individual annotations can be manually added or deleted for each well. Additional features : Additional features added to CellProfiler Analyst vs. version 1.0 have been described elsewhere, such as Tracer, a tool that complements the object tracking functionality of CellProfiler, including visualization and editing of tracks (Bray and Carpenter, 2015), as well as workspaces for saving progress and display settings across sessions (Bray ). The website, manual and tutorials have been redesigned and updated to the new version.

3 Future directions

The redesigned CellProfiler Analyst contains useful classification and visualization features in an interactive interface that facilitates data analysis and exploration of biological images. Its code base forms a solid foundation for integrating new classifiers into the tool, potentially including deep learning architectures. We also intend to integrate methods for constructing per-sample ‘profiles’ from raw morphological measurements to support morphological profiling applications (Caicedo ; Bray ). Click here for additional data file.
  12 in total

1.  Workflow and metrics for image quality control in large-scale high-content screens.

Authors:  Mark-Anthony Bray; Adam N Fraser; Thomas P Hasaka; Anne E Carpenter
Journal:  J Biomol Screen       Date:  2011-09-28

2.  WND-CHARM: Multi-purpose image classification using compound image transforms.

Authors:  Nikita Orlov; Lior Shamir; Tomasz Macura; Josiah Johnston; D Mark Eckley; Ilya G Goldberg
Journal:  Pattern Recognit Lett       Date:  2008-01       Impact factor: 3.756

3.  Improved structure, function and compatibility for CellProfiler: modular high-throughput image analysis software.

Authors:  Lee Kamentsky; Thouis R Jones; Adam Fraser; Mark-Anthony Bray; David J Logan; Katherine L Madden; Vebjorn Ljosa; Curtis Rueden; Kevin W Eliceiri; Anne E Carpenter
Journal:  Bioinformatics       Date:  2011-02-23       Impact factor: 6.937

4.  Machine learning improves the precision and robustness of high-content screens: using nonlinear multiparametric methods to analyze screening results.

Authors:  Peter Horvath; Thomas Wild; Ulrike Kutay; Gabor Csucs
Journal:  J Biomol Screen       Date:  2011-08-01

Review 5.  Applications in image-based profiling of perturbations.

Authors:  Juan C Caicedo; Shantanu Singh; Anne E Carpenter
Journal:  Curr Opin Biotechnol       Date:  2016-04-17       Impact factor: 9.740

6.  Cell Painting, a high-content image-based assay for morphological profiling using multiplexed fluorescent dyes.

Authors:  Mark-Anthony Bray; Shantanu Singh; Han Han; Chadwick T Davis; Blake Borgeson; Cathy Hartland; Maria Kost-Alimova; Sigrun M Gustafsdottir; Christopher C Gibson; Anne E Carpenter
Journal:  Nat Protoc       Date:  2016-08-25       Impact factor: 13.491

7.  web cellHTS2: a web-application for the analysis of high-throughput screening data.

Authors:  Oliver Pelz; Moritz Gilsdorf; Michael Boutros
Journal:  BMC Bioinformatics       Date:  2010-04-12       Impact factor: 3.169

8.  Analysis of cell-based RNAi screens.

Authors:  Michael Boutros; Lígia P Brás; Wolfgang Huber
Journal:  Genome Biol       Date:  2006       Impact factor: 13.583

9.  CellProfiler Analyst: data exploration and analysis software for complex image-based screens.

Authors:  Thouis R Jones; In Han Kang; Douglas B Wheeler; Robert A Lindquist; Adam Papallo; David M Sabatini; Polina Golland; Anne E Carpenter
Journal:  BMC Bioinformatics       Date:  2008-11-15       Impact factor: 3.169

10.  CellProfiler Tracer: exploring and validating high-throughput, time-lapse microscopy image data.

Authors:  Mark-Anthony Bray; Anne E Carpenter
Journal:  BMC Bioinformatics       Date:  2015-11-04       Impact factor: 3.169

View more
  41 in total

1.  Chrysalis: A New Method for High-Throughput Histo-Cytometry Analysis of Images and Movies.

Authors:  Dmitri I Kotov; Thomas Pengo; Jason S Mitchell; Matthew J Gastinger; Marc K Jenkins
Journal:  J Immunol       Date:  2018-12-03       Impact factor: 5.422

2.  Persistent increases of PKMζ in memory-activated neurons trace LTP maintenance during spatial long-term memory storage.

Authors:  Changchi Hsieh; Panayiotis Tsokas; Alejandro Grau-Perales; Edith Lesburguères; Joseph Bukai; Kunal Khanna; Joelle Chorny; Ain Chung; Claudia Jou; Nesha S Burghardt; Christine A Denny; Rafael E Flores-Obando; Benjamin Rush Hartley; Laura Melissa Rodríguez Valencia; A Iván Hernández; Peter J Bergold; James E Cottrell; Juan Marcos Alarcon; André Antonio Fenton; Todd Charlton Sacktor
Journal:  Eur J Neurosci       Date:  2021-02-04       Impact factor: 3.386

3.  Macrophage Cytological Profiling and Anti-Inflammatory Drug Discovery.

Authors:  Tannia A Lau; Walter M Bray; R Scott Lokey
Journal:  Assay Drug Dev Technol       Date:  2019-01       Impact factor: 1.738

4.  Robust Cell Detection and Segmentation for Image Cytometry Reveal Th17 Cell Heterogeneity.

Authors:  Takahiro Tsujikawa; Guillaume Thibault; Vahid Azimi; Sam Sivagnanam; Grace Banik; Casey Means; Rie Kawashima; Daniel R Clayburgh; Joe W Gray; Lisa M Coussens; Young Hwan Chang
Journal:  Cytometry A       Date:  2019-02-04       Impact factor: 4.355

Review 5.  Towards systems tissue engineering: Elucidating the dynamics, spatial coordination, and individual cells driving emergent behaviors.

Authors:  Matthew S Hall; Joseph T Decker; Lonnie D Shea
Journal:  Biomaterials       Date:  2020-06-14       Impact factor: 12.479

6.  Biomedical Image Processing with Containers and Deep Learning: An Automated Analysis Pipeline: Data architecture, artificial intelligence, automated processing, containerization, and clusters orchestration ease the transition from data acquisition to insights in medium-to-large datasets.

Authors:  Germán González; Conor L Evans
Journal:  Bioessays       Date:  2019-05-16       Impact factor: 4.345

7.  Enhanced method to select human oogonial stem cells for fertility research.

Authors:  Russel C Sequeira; Sivanandane Sittadjody; Tracy Criswell; Anthony Atala; John D Jackson; James J Yoo
Journal:  Cell Tissue Res       Date:  2021-08-20       Impact factor: 5.249

Review 8.  Towards a Quantitative Understanding of Cell Identity.

Authors:  Zi Ye; Casim A Sarkar
Journal:  Trends Cell Biol       Date:  2018-10-08       Impact factor: 20.808

9.  Learning deep features for dead and living breast cancer cell classification without staining.

Authors:  Gisela Pattarone; Laura Acion; Marina Simian; Roland Mertelsmann; Marie Follo; Emmanuel Iarussi
Journal:  Sci Rep       Date:  2021-05-13       Impact factor: 4.379

Review 10.  Immune monitoring using mass cytometry and related high-dimensional imaging approaches.

Authors:  Felix J Hartmann; Sean C Bendall
Journal:  Nat Rev Rheumatol       Date:  2019-12-31       Impact factor: 20.543

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.