Literature DB >> 25449328

mPLR-Loc: an adaptive decision multi-label classifier based on penalized logistic regression for protein subcellular localization prediction.

Shibiao Wan1, Man-Wai Mak2, Sun-Yuan Kung3.   

Abstract

Proteins located in appropriate cellular compartments are of paramount importance to exert their biological functions. Prediction of protein subcellular localization by computational methods is required in the post-genomic era. Recent studies have been focusing on predicting not only single-location proteins but also multi-location proteins. However, most of the existing predictors are far from effective for tackling the challenges of multi-label proteins. This article proposes an efficient multi-label predictor, namely mPLR-Loc, based on penalized logistic regression and adaptive decisions for predicting both single- and multi-location proteins. Specifically, for each query protein, mPLR-Loc exploits the information from the Gene Ontology (GO) database by using its accession number (AC) or the ACs of its homologs obtained via BLAST. The frequencies of GO occurrences are used to construct feature vectors, which are then classified by an adaptive decision-based multi-label penalized logistic regression classifier. Experimental results based on two recent stringent benchmark datasets (virus and plant) show that mPLR-Loc remarkably outperforms existing state-of-the-art multi-label predictors. In addition to being able to rapidly and accurately predict subcellular localization of single- and multi-label proteins, mPLR-Loc can also provide probabilistic confidence scores for the prediction decisions. For readers' convenience, the mPLR-Loc server is available online (http://bioinfo.eie.polyu.edu.hk/mPLRLocServer).
Copyright © 2014 Elsevier Inc. All rights reserved.

Keywords:  Adaptive decision; Logistic regression; Multi-label classification; Multi-location proteins; Protein subcellular localization

Mesh:

Substances:

Year:  2014        PMID: 25449328     DOI: 10.1016/j.ab.2014.10.014

Source DB:  PubMed          Journal:  Anal Biochem        ISSN: 0003-2697            Impact factor:   3.365


  11 in total

1.  Subcellular location prediction of apoptosis proteins using two novel feature extraction methods based on evolutionary information and LDA.

Authors:  Lei Du; Qingfang Meng; Yuehui Chen; Peng Wu
Journal:  BMC Bioinformatics       Date:  2020-05-24       Impact factor: 3.169

2.  Sparse regressions for predicting and interpreting subcellular localization of multi-label proteins.

Authors:  Shibiao Wan; Man-Wai Mak; Sun-Yuan Kung
Journal:  BMC Bioinformatics       Date:  2016-02-24       Impact factor: 3.169

3.  Prediction of subcellular location of apoptosis proteins by incorporating PsePSSM and DCCA coefficient based on LFDA dimensionality reduction.

Authors:  Bin Yu; Shan Li; Wenying Qiu; Minghui Wang; Junwei Du; Yusen Zhang; Xing Chen
Journal:  BMC Genomics       Date:  2018-06-19       Impact factor: 3.969

4.  Use of Chou's 5-steps rule to predict the subcellular localization of gram-negative and gram-positive bacterial proteins by multi-label learning based on gene ontology annotation and profile alignment.

Authors:  Hafida Bouziane; Abdallah Chouarfia
Journal:  J Integr Bioinform       Date:  2020-06-29

5.  BERT-m7G: A Transformer Architecture Based on BERT and Stacking Ensemble to Identify RNA N7-Methylguanosine Sites from Sequence Information.

Authors:  Lu Zhang; Xinyi Qin; Min Liu; Guangzhong Liu; Yuxiao Ren
Journal:  Comput Math Methods Med       Date:  2021-08-25       Impact factor: 2.238

6.  Predicting the multi-label protein subcellular localization through multi-information fusion and MLSI dimensionality reduction based on MLFE classifier.

Authors:  Yushuang Liu; Shuping Jin; Hongli Gao; Xue Wang; Congjing Wang; Weifeng Zhou; Bin Yu
Journal:  Bioinformatics       Date:  2021-12-02       Impact factor: 6.937

7.  Benchmark data for identifying multi-functional types of membrane proteins.

Authors:  Shibiao Wan; Man-Wai Mak; Sun-Yuan Kung
Journal:  Data Brief       Date:  2016-05-21

8.  Protein sequence information extraction and subcellular localization prediction with gapped k-Mer method.

Authors:  Yu-Hua Yao; Ya-Ping Lv; Ling Li; Hui-Min Xu; Bin-Bin Ji; Jing Chen; Chun Li; Bo Liao; Xu-Ying Nan
Journal:  BMC Bioinformatics       Date:  2019-12-30       Impact factor: 3.169

9.  Metabolic pathway inference using multi-label classification with rich pathway features.

Authors:  Abdur Rahman M A Basher; Ryan J McLaughlin; Steven J Hallam
Journal:  PLoS Comput Biol       Date:  2020-10-01       Impact factor: 4.475

10.  HumDLoc: Human Protein Subcellular Localization Prediction Using Deep Neural Network.

Authors:  Rahul Semwal; Pritish Kumar Varadwaj
Journal:  Curr Genomics       Date:  2020-11       Impact factor: 2.236

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.