Literature DB >> 26163693

Application of learning to rank to protein remote homology detection.

Bin Liu1, Junjie Chen2, Xiaolong Wang3.   

Abstract

MOTIVATION: Protein remote homology detection is one of the fundamental problems in computational biology, aiming to find protein sequences in a database of known structures that are evolutionarily related to a given query protein. Some computational methods treat this problem as a ranking problem and achieve the state-of-the-art performance, such as PSI-BLAST, HHblits and ProtEmbed. This raises the possibility to combine these methods to improve the predictive performance. In this regard, we are to propose a new computational method called ProtDec-LTR for protein remote homology detection, which is able to combine various ranking methods in a supervised manner via using the Learning to Rank (LTR) algorithm derived from natural language processing.
RESULTS: Experimental results on a widely used benchmark dataset showed that ProtDec-LTR can achieve an ROC1 score of 0.8442 and an ROC50 score of 0.9023 outperforming all the individual predictors and some state-of-the-art methods. These results indicate that it is correct to treat protein remote homology detection as a ranking problem, and predictive performance improvement can be achieved by combining different ranking approaches in a supervised manner via using LTR.
AVAILABILITY AND IMPLEMENTATION: For users' convenience, the software tools of three basic ranking predictors and Learning to Rank algorithm were provided at http://bioinformatics.hitsz.edu.cn/ProtDec-LTR/home/ CONTACT: bliu@insun.hit.edu.cn SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2015        PMID: 26163693     DOI: 10.1093/bioinformatics/btv413

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  31 in total

1.  Identification of Bacterial Cell Wall Lyases via Pseudo Amino Acid Composition.

Authors:  Xin-Xin Chen; Hua Tang; Wen-Chao Li; Hao Wu; Wei Chen; Hui Ding; Hao Lin
Journal:  Biomed Res Int       Date:  2016-06-29       Impact factor: 3.411

2.  DNA binding protein identification by combining pseudo amino acid composition and profile-based protein representation.

Authors:  Bin Liu; Shanyi Wang; Xiaolong Wang
Journal:  Sci Rep       Date:  2015-10-20       Impact factor: 4.379

3.  Research on B Cell Algorithm for Learning to Rank Method Based on Parallel Strategy.

Authors:  Yuling Tian; Hongxian Zhang
Journal:  PLoS One       Date:  2016-08-03       Impact factor: 3.240

4.  Predicting cancerlectins by the optimal g-gap dipeptides.

Authors:  Hao Lin; Wei-Xin Liu; Jiao He; Xin-Hui Liu; Hui Ding; Wei Chen
Journal:  Sci Rep       Date:  2015-12-09       Impact factor: 4.379

5.  Using the Relevance Vector Machine Model Combined with Local Phase Quantization to Predict Protein-Protein Interactions from Protein Sequences.

Authors:  Ji-Yong An; Fan-Rong Meng; Zhu-Hong You; Yu-Hong Fang; Yu-Jun Zhao; Ming Zhang
Journal:  Biomed Res Int       Date:  2016-05-23       Impact factor: 3.411

6.  DephosSite: a machine learning approach for discovering phosphotase-specific dephosphorylation sites.

Authors:  Xiaofeng Wang; Renxiang Yan; Jiangning Song
Journal:  Sci Rep       Date:  2016-03-22       Impact factor: 4.379

7.  Identification of Multi-Functional Enzyme with Multi-Label Classifier.

Authors:  Yuxin Che; Ying Ju; Ping Xuan; Ren Long; Fei Xing
Journal:  PLoS One       Date:  2016-04-14       Impact factor: 3.240

8.  Improving classification of mature microRNA by solving class imbalance problem.

Authors:  Ying Wang; Xiaoye Li; Bairui Tao
Journal:  Sci Rep       Date:  2016-05-16       Impact factor: 4.379

9.  AnABlast: a new in silico strategy for the genome-wide search of novel genes and fossil regions.

Authors:  Juan Jimenez; Caia D S Duncan; María Gallardo; Juan Mata; Antonio J Perez-Pulido
Journal:  DNA Res       Date:  2015-10-21       Impact factor: 4.458

10.  Recombination spot identification Based on gapped k-mers.

Authors:  Rong Wang; Yong Xu; Bin Liu
Journal:  Sci Rep       Date:  2016-03-31       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.