| Literature DB >> 15980577 |
Taro L Saito1, Jun Sese, Yoichiro Nakatani, Fumi Sano, Masashi Yukawa, Yoshikazu Ohya, Shinichi Morishita.
Abstract
For comprehensive understanding of precise morphological changes resulting from loss-of-function mutagenesis, a large collection of 1,899,247 cell images was assembled from 91,71 micrographs of 4782 budding yeast disruptants of non-lethal genes. All the cell images were processed computationally to measure approximately 500 morphological parameters in individual mutants. We have recently made this morphological quantitative data available to the public through the Saccharomyces cerevisiae Morphological Database (SCMD). Inspecting the significance of morphological discrepancies between the wild type and the mutants is expected to provide clues to uncover genes that are relevant to the biological processes producing a particular morphology. To facilitate such intensive data mining, a suite of new software tools for visualizing parameter value distributions was developed to present mutants with significant changes in easily understandable forms. In addition, for a given group of mutants associated with a particular function, the system automatically identifies a combination of multiple morphological parameters that discriminates a mutant group from others significantly, thereby characterizing the function effectively. These data mining functions are available through the World Wide Web at http://scmd.gi.k.u-tokyo.ac.jp/.Entities:
Mesh:
Year: 2005 PMID: 15980577 PMCID: PMC1160212 DOI: 10.1093/nar/gki451
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Workflow of image processing and data mining. (A) Input photos of cells strained with FITC–ConA, DAPI and Rh-ph to visualize the cell wall, nuclei and actin distribution, respectively. (B) Superimposition of three micrographs for individual cells. (C) Image-processing results. (D) Several examples of ∼500 morphological parameters. (E) Data mining processes.
Figure 2Teardrop view juxtaposes the morphological parameter distributions of all parameters for all mutants and the wild-type HIS3 (YOR202w). For each morphological parameter, the distribution of all mutants and that of the wild type are displayed back-to-back in the upper and lower halves, respectively. The thin central line in each distribution represents the average. The pink dots in the distributions show the data for the focal mutant. Since some wild-type distributions are abnormal and are difficult to fit to any established statistical distribution, the statistical significance of a particular parameter value for a mutant is not assessed in terms of the P-value but is estimated using the SD-score (or Z-score), the difference between the parameter value and the average of the wild type divided by the standard deviation of the wild-type distribution. The degree of each SD-score is represented by its color.
Figure 3Mutant classification in terms of morphological parameters. (A) Select a group of mutants such that the disrupted genes are involved in a biological process of interest. In the example, CAP1 (YKL007w) and CAP2 (YIL034c), capping protein and its beta subunit, are selected. (B) The system returns two morphological parameters that best discriminate CAP1 and CAP2, which are represented by two pink dots in the 2D plot. Light blue spots represent mutants, while blue spots are instances of the wild type. Each parameter dimension is associated with the Teardrop view of the morphological parameter distributions.