Literature DB >> 27153655

Incorporating organelle correlations into semi-supervised learning for protein subcellular localization prediction.

Ying-Ying Xu1, Fan Yang1, Hong-Bin Shen1.   

Abstract

MOTIVATION: Bioimages of subcellular protein distribution as a new data source have attracted much attention in the field of automated prediction of proteins subcellular localization. Performance of existing systems is significantly limited by the small number of high-quality images with explicit annotations, resulting in the small sample size learning problem. This limitation is more serious for the multi-location proteins that co-exist at two or more organelles, because it is difficult to accurately annotate those proteins by biological experiments or automated systems.
RESULTS: In this study, we designed a new protein subcellular localization prediction pipeline aiming to deal with the small sample size learning and multi-location proteins annotation problems. Five semi-supervised algorithms that can make use of lower-quality data were integrated, and a new multi-label classification approach by incorporating the correlations among different organelles in cells was proposed. The organelle correlations were modeled by the Bayesian network, and the topology of the correlation graph was used to guide the order of binary classifiers training in the multi-label classification to reflect the label dependence relationship. The proposed protocol was applied on both immunohistochemistry and immunofluorescence images, and our experimental results demonstrated its efficiency.
AVAILABILITY AND IMPLEMENTATION: The datasets and code are available at: www.csbio.sjtu.edu.cn/bioinf/CorrASemiB CONTACT: hbshen@sjtu.edu.cn SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2016        PMID: 27153655     DOI: 10.1093/bioinformatics/btw219

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  9 in total

1.  eccCL: parallelized GPU implementation of Ensemble Classifier Chains.

Authors:  Mona Riemenschneider; Alexander Herbst; Ari Rasch; Sergei Gorlatch; Dominik Heider
Journal:  BMC Bioinformatics       Date:  2017-08-17       Impact factor: 3.169

2.  PlasmoSEP: Predicting surface-exposed proteins on the malaria parasite using semisupervised self-training and expert-annotated data.

Authors:  Yasser El-Manzalawy; Elyse E Munoz; Scott E Lindner; Vasant Honavar
Journal:  Proteomics       Date:  2016-11-21       Impact factor: 3.984

3.  Positive-unlabelled learning of glycosylation sites in the human proteome.

Authors:  Fuyi Li; Yang Zhang; Anthony W Purcell; Geoffrey I Webb; Kuo-Chen Chou; Trevor Lithgow; Chen Li; Jiangning Song
Journal:  BMC Bioinformatics       Date:  2019-03-06       Impact factor: 3.169

4.  Identify RNA-associated subcellular localizations based on multi-label learning using Chou's 5-steps rule.

Authors:  Hao Wang; Yijie Ding; Jijun Tang; Quan Zou; Fei Guo
Journal:  BMC Genomics       Date:  2021-01-15       Impact factor: 3.969

5.  Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information.

Authors:  Ge Wang; Yu-Jia Zhai; Zhen-Zhen Xue; Ying-Ying Xu
Journal:  Biomolecules       Date:  2021-10-29

6.  Protein subnuclear localization based on a new effective representation and intelligent kernel linear discriminant analysis by dichotomous greedy genetic algorithm.

Authors:  Shunfang Wang; Yaoting Yue
Journal:  PLoS One       Date:  2018-04-12       Impact factor: 3.240

7.  PSIONplusm Server for Accurate Multi-Label Prediction of Ion Channels and Their Types.

Authors:  Jianzhao Gao; Hong Wei; Alberto Cano; Lukasz Kurgan
Journal:  Biomolecules       Date:  2020-06-07

8.  A reference library for assigning protein subcellular localizations by image-based machine learning.

Authors:  Wiebke Schormann; Santosh Hariharan; David W Andrews
Journal:  J Cell Biol       Date:  2020-03-02       Impact factor: 10.539

9.  MIC_Locator: a novel image-based protein subcellular location multi-label prediction model based on multi-scale monogenic signal representation and intensity encoding strategy.

Authors:  Fan Yang; Yang Liu; Yanbin Wang; Zhijian Yin; Zhen Yang
Journal:  BMC Bioinformatics       Date:  2019-10-26       Impact factor: 3.169

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.