Literature DB >> 20338898

EBImage--an R package for image processing with applications to cellular phenotypes.

Grégoire Pau1, Florian Fuchs, Oleg Sklyar, Michael Boutros, Wolfgang Huber.   

Abstract

SUMMARY: EBImage provides general purpose functionality for reading, writing, processing and analysis of images. Furthermore, in the context of microscopy-based cellular assays, EBImage offers tools to segment cells and extract quantitative cellular descriptors. This allows the automation of such tasks using the R programming language and use of existing tools in the R environment for signal processing, statistical modeling, machine learning and data visualization. AVAILABILITY: EBImage is free and open source, released under the LGPL license and available from the Bioconductor project (http://www.bioconductor.org/packages/release/bioc/html/EBImage.html).

Entities:  

Mesh:

Year:  2010        PMID: 20338898      PMCID: PMC2844988          DOI: 10.1093/bioinformatics/btq046

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

Imaging cells labeled with specific markers is a powerful method to localize cellular structures and proteins, and to characterize cell morphological changes during population progression or induced by perturbing agents. Automated phenotyping from such images generates quantitative descriptors of cellular phenotypes, which are computationally analysed to infer biological roles or functional relationships. Recent examples include characterization of genes involved in cell division by analysing time-lapse image sequences of human cells (Neumann et al., 2006), estimation of drug effects based on phenotypic changes measured in human HeLa cells (Loo et al., 2007) and identification of genes involved in cell morphology in Drosophila using RNA interference (RNAi) (Kiger et al., 2003). Cell segmentation and feature extraction are well-established steps, realized by dedicated software such as CellProfiler (Carpenter et al., 2006) or generic image processing platforms like Matlab, Labview or ImageJ. However, the analysis and interpretation of multi-parametric cellular descriptors is a more challenging task. It requires powerful statistical and machine learning methods and can be facilitated by the possibility of producing visualizations of intermediate results, by the automation of complex workflows such as cross-validation or parameter searches, and by easy access to biological metadata and genomic databases. These points motivate the use of Bioconductor (Gentleman et al., 2004), a software project based on the feature-rich R programming language, providing tools for the analysis and comprehension of genomic data. Several R packages provide some level of functionality for processing and analysing images. Rimage offers diverse filtering functions but supports only the JPEG format and cannot save images. The package ripa is dedicated to the analysis of hyperspectral images but does not provide for image segmentation. biOps offers a wide range of image filters, but only supports the JPEG and TIFF formats and lacks a fast interactive display interface. The recent package RImageJ provides R bindings to ImageJ, but does not allow easy access to the image data by R. EBImage is an image processing toolbox for R, which has been developed over the past 4 years (Sklyar and Huber, 2006). The current release 3.0 is a major redesign whose features include multi-dimensional image processing, a range of fast image processing functions, support of more than 80 image formats, fast interactive image display, seamless integration with R's native array data structures and coherence of the user interface.

2 DESCRIPTION

Images are represented in EBImage as multi-dimensional arrays containing pixel intensity values. The two first dimensions are typically meant to be spatial, while the other ones are unspecified and can contain, e.g. colour channels, z-slices, replicates, time points or combinations of different conditions. Image representation is dissociated from rendering, and multi-dimensional arrays can be displayed as animated sequences of images in greyscale or colour mode. The interactive display interface is powered by GTK+ and supports animation, zoom and pan. As matrices, images can be manipulated in R with algebraic operators such as sum, product, comparison or convolution. These elementary operators allow a broad range of image transformations. For example, if we denote by x an image, α+βxγ is an enhanced image where the parameter α controls the brightness, β the contrast and γ the γ-factor of the transformed image. Another example includes adaptive thresholding, performed by x>x⋆m+μ, where ⋆ is the fast convolution product, m a neighbourhood mask and μ an offset parameter. R also offers statistical tools to model images with natural 2D splines and provides Fourier analysis tools to detect regular patterns and deconvolute noisy images using Wiener filters. EBImage uses ImageMagick to read and save images, and supports more than 80 image formats, including JPEG, TIFF, TGA, GIF and PNG. The package also supports standard geometric transformations such as rotation, reflection, cropping, translation and resizing. Classical image processing tools are available: linear filtering, morphological erosion and dilation, fast distance map computation, contour delineation and area filling. Object segmentation can be performed with global or adaptive thresholding followed by connected set labeling. Specific algorithms such as watershed transform or Voronoi segmentation (Jones et al., 2005) are provided to segment touching objects. Computation of geometric and texture features (image moments, Haralick features, Zernike moments) from segmented objects is supported.

3 ANALYSIS OF CELLULAR PHENOTYPES

RNAi is a powerful method to study the role of genes in loss-of-function phenotypes. We measured the effects of two RNAi reagents on human HeLa cells by fluorescence microscopy. One cell population was transfected by a negative control, siRluc, a small interfering RNA (siRNA) targeting the Renilla firefly luciferase gene that is not present in the HeLa genome. The other population was treated with siCLSPN, an siRNA targeting the CLSPN mRNA, whose protein is involved in DNA damage response mediation. Cells were grown for 48 h, stained with immunofluorescent markers and imaged (Fig. 1).
Fig. 1.

From microscope images to cellular phenotypes. (a) Fluorescent microscopy images from three channels of the same population of HeLa cells perturbed by siRluc. (b) A false colour image combining the actin (red), the tubulin (green) and the DNA (blue) channels. (c) Nuclei boundaries (yellow) were segmented with adaptive thresholding followed by connected set labeling. (d) Cell membranes (magenta) were determined by Voronoi segmentation. (e) Distribution of the cell sizes compared to a population of HeLa cells perturbed by siCLSPN. Cells treated with siCLSPN were significantly enlarged compared to those perturbed with siRluc (Wilcoxon rank sum test, P<10−15).

From microscope images to cellular phenotypes. (a) Fluorescent microscopy images from three channels of the same population of HeLa cells perturbed by siRluc. (b) A false colour image combining the actin (red), the tubulin (green) and the DNA (blue) channels. (c) Nuclei boundaries (yellow) were segmented with adaptive thresholding followed by connected set labeling. (d) Cell membranes (magenta) were determined by Voronoi segmentation. (e) Distribution of the cell sizes compared to a population of HeLa cells perturbed by siCLSPN. Cells treated with siCLSPN were significantly enlarged compared to those perturbed with siRluc (Wilcoxon rank sum test, P<10−15). For visualization, the three channels were combined (Fig. 1a) into a colour image (Fig. 1b). Nuclei were segmented by adaptive thresholding, morphological opening and connected set labeling (Fig. 1c). Cell boundaries were determined by Voronoi segmentation, using nuclei as seeds and propagating the boundaries using a Riemann metric based on the image gradient (Fig. 1d; Jones et al., 2005). Quantitative descriptors were extracted from the cell shapes and fluorescence distributions. Negative control cells treated with siRluc showed a median cell size of 1024 μm2, while targeting CLSPN led to a population of significantly enlarged cells with a median cell size of 1577 μm2 (Wilcoxon rank sum test, P<10−15) (Fig. 1e). The median Zernike (4,4) actin moment descriptor, capturing high-frequency radial structures, was also strongly discriminating between the two cell populations and can serve to characterize the actin stress fibers displayed by the siCLSPN perturbed cells.

4 CONCLUSION

Automated phenotyping of cells promises to be useful in many fields of biology. It relies on imaging, cell segmentation, feature extraction and statistical analysis. EBImage offers the essential functionality for performing these tasks. Future improvements are expected to include handling large multi-dimensional objects using NetCDF format, 3D objects (segmentation and reconstruction) and time-lapse image sequences (cell tracking and extraction of spatio-temporal features).
  5 in total

1.  High-throughput RNAi screening by time-lapse imaging of live human cells.

Authors:  Beate Neumann; Michael Held; Urban Liebel; Holger Erfle; Phill Rogers; Rainer Pepperkok; Jan Ellenberg
Journal:  Nat Methods       Date:  2006-05       Impact factor: 28.547

2.  Image-based multivariate profiling of drug responses from single cells.

Authors:  Lit-Hsin Loo; Lani F Wu; Steven J Altschuler
Journal:  Nat Methods       Date:  2007-04-01       Impact factor: 28.547

3.  Bioconductor: open software development for computational biology and bioinformatics.

Authors:  Robert C Gentleman; Vincent J Carey; Douglas M Bates; Ben Bolstad; Marcel Dettling; Sandrine Dudoit; Byron Ellis; Laurent Gautier; Yongchao Ge; Jeff Gentry; Kurt Hornik; Torsten Hothorn; Wolfgang Huber; Stefano Iacus; Rafael Irizarry; Friedrich Leisch; Cheng Li; Martin Maechler; Anthony J Rossini; Gunther Sawitzki; Colin Smith; Gordon Smyth; Luke Tierney; Jean Y H Yang; Jianhua Zhang
Journal:  Genome Biol       Date:  2004-09-15       Impact factor: 13.583

4.  CellProfiler: image analysis software for identifying and quantifying cell phenotypes.

Authors:  Anne E Carpenter; Thouis R Jones; Michael R Lamprecht; Colin Clarke; In Han Kang; Ola Friman; David A Guertin; Joo Han Chang; Robert A Lindquist; Jason Moffat; Polina Golland; David M Sabatini
Journal:  Genome Biol       Date:  2006-10-31       Impact factor: 13.583

5.  A functional genomic analysis of cell morphology using RNA interference.

Authors:  A A Kiger; B Baum; S Jones; M R Jones; A Coulson; C Echeverri; N Perrimon
Journal:  J Biol       Date:  2003-10-01
  5 in total
  173 in total

1.  Clonal structure of carcinogen-induced intestinal tumors in mice.

Authors:  Andrew T Thliveris; Linda Clipson; Alanna White; Jesse Waggoner; Lauren Plesh; Bridget L Skinner; Christopher D Zahm; Ruth Sullivan; William F Dove; Michael A Newton; Richard B Halberg
Journal:  Cancer Prev Res (Phila)       Date:  2011-06

2.  Tandem fluorescent protein timers for in vivo analysis of protein dynamics.

Authors:  Anton Khmelinskii; Philipp J Keller; Anna Bartosik; Matthias Meurer; Joseph D Barry; Balca R Mardin; Andreas Kaufmann; Susanne Trautmann; Malte Wachsmuth; Gislene Pereira; Wolfgang Huber; Elmar Schiebel; Michael Knop
Journal:  Nat Biotechnol       Date:  2012-06-24       Impact factor: 54.908

3.  Myocardial NF-κB activation is essential for zebrafish heart regeneration.

Authors:  Ravi Karra; Anne K Knecht; Kazu Kikuchi; Kenneth D Poss
Journal:  Proc Natl Acad Sci U S A       Date:  2015-10-15       Impact factor: 11.205

4.  The hepatokine FGF21 is crucial for peroxisome proliferator-activated receptor-α agonist-induced amelioration of metabolic disorders in obese mice.

Authors:  Tsuyoshi Goto; Mariko Hirata; Yumeko Aoki; Mari Iwase; Haruya Takahashi; Minji Kim; Yongjia Li; Huei-Fen Jheng; Wataru Nomura; Nobuyuki Takahashi; Chu-Sook Kim; Rina Yu; Shigeto Seno; Hideo Matsuda; Megumi Aizawa-Abe; Ken Ebihara; Nobuyuki Itoh; Teruo Kawada
Journal:  J Biol Chem       Date:  2017-04-12       Impact factor: 5.157

5.  High-throughput and quantitative approaches for measuring circadian rhythms in cyanobacteria using bioluminescence.

Authors:  Ryan K Shultzaberger; Mark L Paddock; Takeo Katsuki; Ralph J Greenspan; Susan S Golden
Journal:  Methods Enzymol       Date:  2014-12-26       Impact factor: 1.600

6.  Directional tissue migration through a self-generated chemokine gradient.

Authors:  Erika Donà; Joseph D Barry; Guillaume Valentin; Charlotte Quirin; Anton Khmelinskii; Andreas Kunze; Sevi Durdu; Lionel R Newton; Ana Fernandez-Minan; Wolfgang Huber; Michael Knop; Darren Gilmour
Journal:  Nature       Date:  2013-09-25       Impact factor: 49.962

7.  Post-Translational Tubulin Modifications in Human Astrocyte Cultures.

Authors:  V Bleu Knight; Elba E Serrano
Journal:  Neurochem Res       Date:  2017-05-17       Impact factor: 3.996

8.  Automated Microscopic Analysis of Metal Sulfide Colonization by Acidophilic Microorganisms.

Authors:  Sören Bellenberg; Antoine Buetti-Dinh; Vanni Galli; Olga Ilie; Malte Herold; Stephan Christel; Mariia Boretska; Igor V Pivkin; Paul Wilmes; Wolfgang Sand; Mario Vera; Mark Dopson
Journal:  Appl Environ Microbiol       Date:  2018-10-01       Impact factor: 4.792

9.  Physiological and pathophysiological characteristics of ataxin-3 isoforms.

Authors:  Daniel Weishäupl; Juliane Schneider; Barbara Peixoto Pinheiro; Corinna Ruess; Sandra Maria Dold; Felix von Zweydorf; Christian Johannes Gloeckner; Jana Schmidt; Olaf Riess; Thorsten Schmidt
Journal:  J Biol Chem       Date:  2018-11-19       Impact factor: 5.157

10.  Image Segmentation, Registration and Characterization in R with SimpleITK.

Authors:  Richard Beare; Bradley Lowekamp; Ziv Yaniv
Journal:  J Stat Softw       Date:  2018-09-04       Impact factor: 6.440

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.