M V Boland1, R F Murphy. 1. Center for Light Microscope Imaging and Biotechnology, Biomedical and Health Engineering Program, Carnegie Mellon University, 4400 Fifth Ave., Pittsburgh, PA 15213, USA.
Abstract
MOTIVATION: Assessment of protein subcellular location is crucial to proteomics efforts since localization information provides a context for a protein's sequence, structure, and function. The work described below is the first to address the subcellular localization of proteins in a quantitative, comprehensive manner. RESULTS: Images for ten different subcellular patterns (including all major organelles) were collected using fluorescence microscopy. The patterns were described using a variety of numeric features, including Zernike moments, Haralick texture features, and a set of new features developed specifically for this purpose. To test the usefulness of these features, they were used to train a neural network classifier. The classifier was able to correctly recognize an average of 83% of previously unseen cells showing one of the ten patterns. The same classifier was then used to recognize previously unseen sets of homogeneously prepared cells with 98% accuracy. AVAILABILITY: Algorithms were implemented using the commercial products Matlab, S-Plus, and SAS, as well as some functions written in C. The scripts and source code generated for this work are available at http://murphylab.web.cmu.edu/software. CONTACT: murphy@cmu.edu
MOTIVATION: Assessment of protein subcellular location is crucial to proteomics efforts since localization information provides a context for a protein's sequence, structure, and function. The work described below is the first to address the subcellular localization of proteins in a quantitative, comprehensive manner. RESULTS: Images for ten different subcellular patterns (including all major organelles) were collected using fluorescence microscopy. The patterns were described using a variety of numeric features, including Zernike moments, Haralick texture features, and a set of new features developed specifically for this purpose. To test the usefulness of these features, they were used to train a neural network classifier. The classifier was able to correctly recognize an average of 83% of previously unseen cells showing one of the ten patterns. The same classifier was then used to recognize previously unseen sets of homogeneously prepared cells with 98% accuracy. AVAILABILITY: Algorithms were implemented using the commercial products Matlab, S-Plus, and SAS, as well as some functions written in C. The scripts and source code generated for this work are available at http://murphylab.web.cmu.edu/software. CONTACT: murphy@cmu.edu
Authors: Qing Zhong; Alberto Giovanni Busetto; Juan P Fededa; Joachim M Buhmann; Daniel W Gerlich Journal: Nat Methods Date: 2012-05-27 Impact factor: 28.547
Authors: Michael Held; Michael H A Schmitz; Bernd Fischer; Thomas Walter; Beate Neumann; Michael H Olma; Matthias Peter; Jan Ellenberg; Daniel W Gerlich Journal: Nat Methods Date: 2010-08-08 Impact factor: 28.547
Authors: Lior Shamir; Nikita Orlov; David Mark Eckley; Tomasz J Macura; Ilya G Goldberg Journal: Med Biol Eng Comput Date: 2008-07-31 Impact factor: 2.602