Literature DB >> 19706744

A new taxonomy-based protein fold recognition approach based on autocross-covariance transformation.

Qiwen Dong1, Shuigeng Zhou, Jihong Guan.   

Abstract

MOTIVATION: Fold recognition is an important step in protein structure and function prediction. Traditional sequence comparison methods fail to identify reliable homologies with low sequence identity, while the taxonomic methods are effective alternatives, but their prediction accuracies are around 70%, which are still relatively low for practical usage.
RESULTS: In this study, a simple and powerful method is presented for taxonomic fold recognition, which combines support vector machine (SVM) with autocross-covariance (ACC) transformation. The evolutionary information represented in the form of position-specific score matrices is converted into a series of fixed-length vectors by ACC transformation and these vectors are then input to a SVM classifier for fold recognition. The sequence-order effect can be effectively captured by this scheme. Experiments are performed on the widely used D-B dataset and the corresponding extended dataset, respectively. The proposed method, called ACCFold, gets an overall accuracy of 70.1% on the D-B dataset, which is higher than major existing taxonomic methods by 2-14%. Furthermore, the method achieves an overall accuracy of 87.6% on the extended dataset, which surpasses major existing taxonomic methods by 9-17%. Additionally, our method obtains an overall accuracy of 80.9% for 86-folds and 77.2% for 199-folds. These results demonstrate that the ACCFold method provides the state-of-the-art performance for taxonomic fold recognition. AVAILABILITY: The source code for ACC transformation is freely available at http://www.iipl.fudan.edu.cn/demo/accpkg.html.

Mesh:

Substances:

Year:  2009        PMID: 19706744     DOI: 10.1093/bioinformatics/btp500

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  51 in total

1.  Accurate prediction of bacterial type IV secreted effectors using amino acid composition and PSSM profiles.

Authors:  Lingyun Zou; Chonghan Nan; Fuquan Hu
Journal:  Bioinformatics       Date:  2013-09-23       Impact factor: 6.937

2.  MULTiPly: a novel multi-layer predictor for discovering general and specific types of promoters.

Authors:  Meng Zhang; Fuyi Li; Tatiana T Marquez-Lago; André Leier; Cunshuo Fan; Chee Keong Kwoh; Kuo-Chen Chou; Jiangning Song; Cangzhi Jia
Journal:  Bioinformatics       Date:  2019-09-01       Impact factor: 6.937

3.  Systematic analysis and prediction of type IV secreted effector proteins by machine learning approaches.

Authors:  Jiawei Wang; Bingjiao Yang; Yi An; Tatiana Marquez-Lago; André Leier; Jonathan Wilksch; Qingyang Hong; Yang Zhang; Morihiro Hayashida; Tatsuya Akutsu; Geoffrey I Webb; Richard A Strugnell; Jiangning Song; Trevor Lithgow
Journal:  Brief Bioinform       Date:  2019-05-21       Impact factor: 11.622

4.  HLPI-Ensemble: Prediction of human lncRNA-protein interactions based on ensemble strategy.

Authors:  Huan Hu; Li Zhang; Haixin Ai; Hui Zhang; Yetian Fan; Qi Zhao; Hongsheng Liu
Journal:  RNA Biol       Date:  2018-06-06       Impact factor: 4.652

5.  PaCRISPR: a server for predicting and visualizing anti-CRISPR proteins.

Authors:  Jiawei Wang; Wei Dai; Jiahui Li; Ruopeng Xie; Rhys A Dunstan; Christopher Stubenrauch; Yanju Zhang; Trevor Lithgow
Journal:  Nucleic Acids Res       Date:  2020-07-02       Impact factor: 16.971

6.  iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization.

Authors:  Zhen Chen; Pei Zhao; Chen Li; Fuyi Li; Dongxu Xiang; Yong-Zi Chen; Tatsuya Akutsu; Roger J Daly; Geoffrey I Webb; Quanzhi Zhao; Lukasz Kurgan; Jiangning Song
Journal:  Nucleic Acids Res       Date:  2021-06-04       Impact factor: 16.971

7.  Computational analysis and prediction of lysine malonylation sites by exploiting informative features in an integrative machine-learning framework.

Authors:  Yanju Zhang; Ruopeng Xie; Jiawei Wang; André Leier; Tatiana T Marquez-Lago; Tatsuya Akutsu; Geoffrey I Webb; Kuo-Chen Chou; Jiangning Song
Journal:  Brief Bioinform       Date:  2019-11-27       Impact factor: 11.622

8.  FaaPred: a SVM-based prediction method for fungal adhesins and adhesin-like proteins.

Authors:  Jayashree Ramana; Dinesh Gupta
Journal:  PLoS One       Date:  2010-03-15       Impact factor: 3.240

9.  A novel fusion based on the evolutionary features for protein fold recognition using support vector machines.

Authors:  Mohammad Saleh Refahi; A Mir; Jalal A Nasiri
Journal:  Sci Rep       Date:  2020-09-01       Impact factor: 4.379

10.  Prediction of protein-protein interaction sites using an ensemble method.

Authors:  Lei Deng; Jihong Guan; Qiwen Dong; Shuigeng Zhou
Journal:  BMC Bioinformatics       Date:  2009-12-16       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.