Literature DB >> 15980577

Data mining tools for the Saccharomyces cerevisiae morphological database.

Taro L Saito1, Jun Sese, Yoichiro Nakatani, Fumi Sano, Masashi Yukawa, Yoshikazu Ohya, Shinichi Morishita.   

Abstract

For comprehensive understanding of precise morphological changes resulting from loss-of-function mutagenesis, a large collection of 1,899,247 cell images was assembled from 91,71 micrographs of 4782 budding yeast disruptants of non-lethal genes. All the cell images were processed computationally to measure approximately 500 morphological parameters in individual mutants. We have recently made this morphological quantitative data available to the public through the Saccharomyces cerevisiae Morphological Database (SCMD). Inspecting the significance of morphological discrepancies between the wild type and the mutants is expected to provide clues to uncover genes that are relevant to the biological processes producing a particular morphology. To facilitate such intensive data mining, a suite of new software tools for visualizing parameter value distributions was developed to present mutants with significant changes in easily understandable forms. In addition, for a given group of mutants associated with a particular function, the system automatically identifies a combination of multiple morphological parameters that discriminates a mutant group from others significantly, thereby characterizing the function effectively. These data mining functions are available through the World Wide Web at http://scmd.gi.k.u-tokyo.ac.jp/.

Entities:  

Mesh:

Year:  2005        PMID: 15980577      PMCID: PMC1160212          DOI: 10.1093/nar/gki451

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


MORPHOLOGICAL DATABASE

To study the global regulation of cell morphological characteristics, a number of groups have recently reported genome-wide screening data for yeast mutants with abnormal morphology (1–5). Despite the relatively simple ellipsoidal shape of yeast cells, in the past, cell morphology researchers processed information on cells manually. These time consuming, entirely subjective tasks motivated us to develop image-processing software called CalMorph (6), which automatically extracts yeast cells from micrographs and processes them to measure morphological characteristics such as cell size, roundness, bud neck position angle, nuclear position and actin localization. Using our software, we have retrieved 1 899 247 cells from 91 271 micrographs of 4782 mutants, which cover almost all of the yeast non-essential mutants cultured from the deleted strains available from EUROSCARF. All cell images, micrographs and quantitative values of morphological parameters are freely available from the SCMD database (7), which presents information that is complementary to the existing sequence and gene-expression databases (8–12).

CELL IMAGE PROCESSING

Our software processes micrographs of cells stained with fluorescein isothiocyanate–Concanavalin A (FITC-ConA) for cell wall identification, with DAPI to localize nuclei and with Rh-ph to visualize the actin distribution. The photos in Figure 1A show three images stained with the respective dyes. Figure 1B presents the result of combining three photos by superimposing images of the cell wall, nuclei and actin for individual cells.
Figure 1

Workflow of image processing and data mining. (A) Input photos of cells strained with FITC–ConA, DAPI and Rh-ph to visualize the cell wall, nuclei and actin distribution, respectively. (B) Superimposition of three micrographs for individual cells. (C) Image-processing results. (D) Several examples of ∼500 morphological parameters. (E) Data mining processes.

Figure 1C displays image-processing results. Our image-processing software first identifies the cell wall, attempts to fit an ellipse to each mother cell or bud and colors the cell wall green. The yellow lines show the long and short axes of the fitted ellipses. Bud necks that separate mother cells and buds are illustrated by using two red bullets. Identifying the cell wall makes it easier to determine information on the localization of nuclei and actin patches relative to the cell wall. In Figure 1C, nuclei and actin patches are represented using yellow and light blue bullets, respectively. Figure 1D shows the primary morphological parameters of cells. The quantitative values of these parameters may change slightly from cell to cell. To perform rigorous statistical analysis of the significance of morphological changes, we need to know the distribution of morphological parameter values for individual cells; this requires that we collect an ample number of image-processed cells and their parameter values. More than 200 image-processed cells were collected for each mutant using a sufficient number of micrographs. Then, ∼500 morphological parameters were calculated for the mutants.

DATA MINING

Since there are so many parameters and mutants, some tools for assisting with data mining tasks will help users.

Search

Morphological data should be useful for identifying the morphological changes in particular mutants. Users can query a yeast mutant of interest using its open reading frame name or its gene name. They can also browse average shapes of the mutant, average morphological parameter values, raw and processed micrographs and lists of individual cells associated with morphological parameter values. Users can also provide a typical morphological shape or a particular mutant as a query and ask the system to search for mutants that are similar in shape to the query. This function is called ‘morphology search’ (7).

Teardrop view—juxtaposition of morphological parameter distributions

In order to understand which morphological parameters of a particular mutant are abnormal, the system displays the distribution of all mutants for each parameter and highlights the focal mutant value in pink (see Figure 2). The system juxtaposes the distributions of all parameters in parallel, making it easy for users to comprehend the overview of distributions and abnormal parameters at a glance. Parameters are colored blue or pink if their changes are statistically significant in terms of their distributions.
Figure 2

Teardrop view juxtaposes the morphological parameter distributions of all parameters for all mutants and the wild-type HIS3 (YOR202w). For each morphological parameter, the distribution of all mutants and that of the wild type are displayed back-to-back in the upper and lower halves, respectively. The thin central line in each distribution represents the average. The pink dots in the distributions show the data for the focal mutant. Since some wild-type distributions are abnormal and are difficult to fit to any established statistical distribution, the statistical significance of a particular parameter value for a mutant is not assessed in terms of the P-value but is estimated using the SD-score (or Z-score), the difference between the parameter value and the average of the wild type divided by the standard deviation of the wild-type distribution. The degree of each SD-score is represented by its color.

Mutant classification in terms of morphological parameters

Another promising application of morphological parameters is to use them to predict gene functions. For instance, suppose that one is interested in finding a group of genes involved in a particular biological process such as DNA repairs and cell wall construction. You can ask the system to look for a combination of multiple morphological parameters that discriminate disruptants of genes that are known to be relevant to the biological process of interest (see Figure 3). These morphological parameters allow us to define distances between disruptants. If we identify disruptants that are not known to be related to any particular biological process but are closer to disruptants that are relevant to the focal biological process, these disrupted genes are potentially involved in the biological process.
Figure 3

Mutant classification in terms of morphological parameters. (A) Select a group of mutants such that the disrupted genes are involved in a biological process of interest. In the example, CAP1 (YKL007w) and CAP2 (YIL034c), capping protein and its beta subunit, are selected. (B) The system returns two morphological parameters that best discriminate CAP1 and CAP2, which are represented by two pink dots in the 2D plot. Light blue spots represent mutants, while blue spots are instances of the wild type. Each parameter dimension is associated with the Teardrop view of the morphological parameter distributions.

CUSTOMIZATION AND DATA AVAILABILITY

To facilitate customization according to users' interests for the ease of browsing, a dialog-based interface for the parameter selection page helps users choose parameters displayed in datasheets and are memorized in the system. The system also allows users to download the list of selected parameter values for selected mutants in the XML format or in tabular form. Users can also select particular mutants of interest so that they are always shown in Teardrop View and 2D plot.

UPDATES AND FUTURE DIRECTIONS

The web server currently presents morphological parameter values of disruptants of non-essential genes, but mutants of lethal genes will be processed and available in the future.
  12 in total

1.  Development of image processing program for yeast cell morphology.

Authors:  Miwaka Ohtani; Ayaka Saka; Fumi Sano; Yoshikazu Ohya; Shinichi Morishita
Journal:  J Bioinform Comput Biol       Date:  2004-01       Impact factor: 1.122

2.  Genomic scale mutant hunt identifies cell size homeostasis genes in S. cerevisiae.

Authors:  Jian Zhang; Colette Schneider; Lisa Ottmers; Robert Rodriguez; Audra Day; Jody Markwardt; Brandt L Schneider
Journal:  Curr Biol       Date:  2002-12-10       Impact factor: 10.834

3.  Functional characterization of the S. cerevisiae genome by gene deletion and parallel analysis.

Authors:  E A Winzeler; D D Shoemaker; A Astromoff; H Liang; K Anderson; B Andre; R Bangham; R Benito; J D Boeke; H Bussey; A M Chu; C Connelly; K Davis; F Dietrich; S W Dow; M El Bakkoury; F Foury; S H Friend; E Gentalen; G Giaever; J H Hegemann; T Jones; M Laub; H Liao; N Liebundguth; D J Lockhart; A Lucau-Danila; M Lussier; N M'Rabet; P Menard; M Mittmann; C Pai; C Rebischung; J L Revuelta; L Riles; C J Roberts; P Ross-MacDonald; B Scherens; M Snyder; S Sookhai-Mahadeo; R K Storms; S Véronneau; M Voet; G Volckaert; T R Ward; R Wysocki; G S Yen; K Yu; K Zimmermann; P Philippsen; M Johnston; R W Davis
Journal:  Science       Date:  1999-08-06       Impact factor: 47.728

4.  A genomic study of the bipolar bud site selection pattern in Saccharomyces cerevisiae.

Authors:  L Ni; M Snyder
Journal:  Mol Biol Cell       Date:  2001-07       Impact factor: 4.138

5.  yMGV: a cross-species expression data mining tool.

Authors:  Gaëlle Lelandais; Stéphane Le Crom; Frédéric Devaux; Stéphane Vialette; George M Church; Claude Jacq; Philippe Marc
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

6.  MIPS: analysis and annotation of proteins from whole genomes.

Authors:  H W Mewes; C Amid; R Arnold; D Frishman; U Güldener; G Mannhaupt; M Münsterkötter; P Pagel; N Strack; V Stümpflen; J Warfsmann; A Ruepp
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

7.  Functional profiling of the Saccharomyces cerevisiae genome.

Authors:  Guri Giaever; Angela M Chu; Li Ni; Carla Connelly; Linda Riles; Steeve Véronneau; Sally Dow; Ankuta Lucau-Danila; Keith Anderson; Bruno André; Adam P Arkin; Anna Astromoff; Mohamed El-Bakkoury; Rhonda Bangham; Rocio Benito; Sophie Brachat; Stefano Campanaro; Matt Curtiss; Karen Davis; Adam Deutschbauer; Karl-Dieter Entian; Patrick Flaherty; Francoise Foury; David J Garfinkel; Mark Gerstein; Deanna Gotte; Ulrich Güldener; Johannes H Hegemann; Svenja Hempel; Zelek Herman; Daniel F Jaramillo; Diane E Kelly; Steven L Kelly; Peter Kötter; Darlene LaBonte; David C Lamb; Ning Lan; Hong Liang; Hong Liao; Lucy Liu; Chuanyun Luo; Marc Lussier; Rong Mao; Patrice Menard; Siew Loon Ooi; Jose L Revuelta; Christopher J Roberts; Matthias Rose; Petra Ross-Macdonald; Bart Scherens; Greg Schimmack; Brenda Shafer; Daniel D Shoemaker; Sharon Sookhai-Mahadeo; Reginald K Storms; Jeffrey N Strathern; Giorgio Valle; Marleen Voet; Guido Volckaert; Ching-yun Wang; Teresa R Ward; Julie Wilhelmy; Elizabeth A Winzeler; Yonghong Yang; Grace Yen; Elaine Youngman; Kexin Yu; Howard Bussey; Jef D Boeke; Michael Snyder; Peter Philippsen; Ronald W Davis; Mark Johnston
Journal:  Nature       Date:  2002-07-25       Impact factor: 49.962

8.  Systematic identification of pathways that couple cell growth and division in yeast.

Authors:  Paul Jorgensen; Joy L Nishikawa; Bobby-Joe Breitkreutz; Mike Tyers
Journal:  Science       Date:  2002-06-27       Impact factor: 47.728

9.  The Yeast Resource Center Public Data Repository.

Authors:  Michael Riffle; Lars Malmström; Trisha N Davis
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

10.  CYGD: the Comprehensive Yeast Genome Database.

Authors:  U Güldener; M Münsterkötter; G Kastenmüller; N Strack; J van Helden; C Lemer; J Richelles; S J Wodak; J García-Martínez; J E Pérez-Ortín; H Michael; A Kaps; E Talla; B Dujon; B André; J L Souciet; J De Montigny; E Bon; C Gaillardin; H W Mewes
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

View more
  5 in total

1.  Evaluation of image processing programs for accurate measurement of budding and fission yeast morphology.

Authors:  Genjiro Suzuki; Hiroshi Sawai; Miwaka Ohtani; Satoru Nogami; Fumi Sano-Kumagai; Ayaka Saka; Masashi Yukawa; Taro L Saito; Jun Sese; Dai Hirata; Shinichi Morishita; Yoshikazu Ohya
Journal:  Curr Genet       Date:  2006-01-06       Impact factor: 3.886

2.  High-dimensional and large-scale phenotyping of yeast mutants.

Authors:  Yoshikazu Ohya; Jun Sese; Masashi Yukawa; Fumi Sano; Yoichiro Nakatani; Taro L Saito; Ayaka Saka; Tomoyuki Fukuda; Satoru Ishihara; Satomi Oka; Genjiro Suzuki; Machika Watanabe; Aiko Hirata; Miwaka Ohtani; Hiroshi Sawai; Nicolas Fraysse; Jean-Paul Latgé; Jean M François; Markus Aebi; Seiji Tanaka; Sachiko Muramatsu; Hiroyuki Araki; Kintake Sonoike; Satoru Nogami; Shinichi Morishita
Journal:  Proc Natl Acad Sci U S A       Date:  2005-12-19       Impact factor: 11.205

3.  Yeast Rrp14p is a nucleolar protein involved in both ribosome biogenesis and cell polarity.

Authors:  Hiroko Yamada; Chihiro Horigome; Takafumi Okada; Chiharu Shirai; Keiko Mizuta
Journal:  RNA       Date:  2007-09-05       Impact factor: 4.942

4.  Efficient framework for automated classification of subcellular patterns in budding yeast.

Authors:  Seungil Huh; Donghun Lee; Robert F Murphy
Journal:  Cytometry A       Date:  2009-11       Impact factor: 4.355

5.  An algorithm to automate yeast segmentation and tracking.

Authors:  Andreas Doncic; Umut Eser; Oguzhan Atay; Jan M Skotheim
Journal:  PLoS One       Date:  2013-03-08       Impact factor: 3.240

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.