Literature DB >> 17956879

Boosting multiclass learning with repeating codes and weak detectors for protein subcellular localization.

Chung-Chih Lin1, Yuh-Show Tsai, Yu-Shi Lin, Tai-Yu Chiu, Chia-Cheng Hsiung, May-I Lee, Jeremy C Simpson, Chun-Nan Hsu.   

Abstract

MOTIVATION: Determining locations of protein expression is essential to understand protein function. Advances in green fluorescence protein (GFP) fusion proteins and automated fluorescence microscopy allow for rapid acquisition of large collections of protein localization images. Recognition of these cell images requires an automated image analysis system. Approaches taken by previous work concentrated on designing a set of optimal features and then applying standard machine-learning algorithms. In fact, trends of recent advances in machine learning and computer vision can be applied to improve the performance. One trend is the advances in multiclass learning with error-correcting output codes (ECOC). Another trend is the use of a large number of weak detectors with boosting for detecting objects in images of real-world scenes.
RESULTS: We take advantage of these advances to propose a new learning algorithm, AdaBoost.ERC, coupled with weak and strong detectors, to improve the performance of automatic recognition of protein subcellular locations in cell images. We prepared two image data sets of CHO and Vero cells and downloaded a HeLa cell image data set in the public domain to evaluate our new method. We show that AdaBoost.ERC outperforms other AdaBoost extensions. We demonstrate the benefit of weak detectors by showing significant performance improvements over classifiers using only strong detectors. We also empirically test our method's capability of generalizing to heterogeneous image collections. Compared with previous work, our method performs reasonably well for the HeLa cell images. AVAILABILITY: CHO and Vero cell images, their corresponding feature sets (SSLF and WSLF), our new learning algorithm, AdaBoost.ERC, and Supplementary Material are available at http://aiia.iis.sinica.edu.tw/

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17956879     DOI: 10.1093/bioinformatics/btm497

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  7 in total

1.  Determining the subcellular location of new proteins from microscope images using local features.

Authors:  Luis Pedro Coelho; Joshua D Kangas; Armaghan W Naik; Elvira Osuna-Highley; Estelle Glory-Afshar; Margaret Fuhrman; Ramanuja Simha; Peter B Berget; Jonathan W Jarvik; Robert F Murphy
Journal:  Bioinformatics       Date:  2013-07-08       Impact factor: 6.937

2.  A spectral graph theoretic approach to quantification and calibration of collective morphological differences in cell images.

Authors:  Yu-Shi Lin; Chung-Chih Lin; Yuh-Show Tsai; Tien-Chuan Ku; Yi-Hung Huang; Chun-Nan Hsu
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

3.  Screening cellular feature measurements for image-based assay development.

Authors:  David J Logan; Anne E Carpenter
Journal:  J Biomol Screen       Date:  2010-06-01

4.  Ranking of multidimensional drug profiling data by fractional-adjusted bi-partitional scores.

Authors:  Dorit S Hochbaum; Chun-Nan Hsu; Yan T Yang
Journal:  Bioinformatics       Date:  2012-06-15       Impact factor: 6.937

5.  Isolation and Characterization of an LBD Transcription Factor CsLBD39 from Tea Plant (Camellia sinensis) and Its Roles in Modulating Nitrate Content by Regulating Nitrate-Metabolism-Related Genes.

Authors:  Rui-Min Teng; Ni Yang; Jing-Wen Li; Chun-Fang Liu; Yi Chen; Tong Li; Ya-Hui Wang; Ai-Sheng Xiong; Jing Zhuang
Journal:  Int J Mol Sci       Date:  2022-08-18       Impact factor: 6.208

6.  Introduction to the quantitative analysis of two-dimensional fluorescence microscopy images for cell-based screening.

Authors:  Vebjorn Ljosa; Anne E Carpenter
Journal:  PLoS Comput Biol       Date:  2009-12-24       Impact factor: 4.475

7.  MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy.

Authors:  Fan Yang; Yang Liu; Yanbin Wang; Zhijian Yin; Zhen Yang
Journal:  BMC Bioinformatics       Date:  2019-10-26       Impact factor: 3.169

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.