Literature DB >> 17942444

Predicting disulfide connectivity from protein sequence using multiple sequence feature vectors and secondary structure.

Jiangning Song1, Zheng Yuan, Hao Tan, Thomas Huber, Kevin Burrage.   

Abstract

MOTIVATION: Disulfide bonds are primary covalent crosslinks between two cysteine residues in proteins that play critical roles in stabilizing the protein structures and are commonly found in extracy-toplasmatic or secreted proteins. In protein folding prediction, the localization of disulfide bonds can greatly reduce the search in conformational space. Therefore, there is a great need to develop computational methods capable of accurately predicting disulfide connectivity patterns in proteins that could have potentially important applications.
RESULTS: We have developed a novel method to predict disulfide connectivity patterns from protein primary sequence, using a support vector regression (SVR) approach based on multiple sequence feature vectors and predicted secondary structure by the PSIPRED program. The results indicate that our method could achieve a prediction accuracy of 74.4% and 77.9%, respectively, when averaged on proteins with two to five disulfide bridges using 4-fold cross-validation, measured on the protein and cysteine pair on a well-defined non-homologous dataset. We assessed the effects of different sequence encoding schemes on the prediction performance of disulfide connectivity. It has been shown that the sequence encoding scheme based on multiple sequence feature vectors coupled with predicted secondary structure can significantly improve the prediction accuracy, thus enabling our method to outperform most of other currently available predictors. Our work provides a complementary approach to the current algorithms that should be useful in computationally assigning disulfide connectivity patterns and helps in the annotation of protein sequences generated by large-scale whole-genome projects. AVAILABILITY: The prediction web server and Supplementary Material are accessible at http://foo.maths.uq.edu.au/~huber/disulfide

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17942444     DOI: 10.1093/bioinformatics/btm505

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  21 in total

1.  Accurate disulfide-bonding network predictions improve ab initio structure prediction of cysteine-rich proteins.

Authors:  Jing Yang; Bao-Ji He; Richard Jang; Yang Zhang; Hong-Bin Shen
Journal:  Bioinformatics       Date:  2015-08-07       Impact factor: 6.937

2.  Systematic analysis and prediction of type IV secreted effector proteins by machine learning approaches.

Authors:  Jiawei Wang; Bingjiao Yang; Yi An; Tatiana Marquez-Lago; André Leier; Jonathan Wilksch; Qingyang Hong; Yang Zhang; Morihiro Hayashida; Tatsuya Akutsu; Geoffrey I Webb; Richard A Strugnell; Jiangning Song; Trevor Lithgow
Journal:  Brief Bioinform       Date:  2019-05-21       Impact factor: 11.622

3.  Computational analysis and prediction of lysine malonylation sites by exploiting informative features in an integrative machine-learning framework.

Authors:  Yanju Zhang; Ruopeng Xie; Jiawei Wang; André Leier; Tatiana T Marquez-Lago; Tatsuya Akutsu; Geoffrey I Webb; Kuo-Chen Chou; Jiangning Song
Journal:  Brief Bioinform       Date:  2019-11-27       Impact factor: 11.622

4.  DBCP: a web server for disulfide bonding connectivity pattern prediction without the prior knowledge of the bonding state of cysteines.

Authors:  Hsuan-Hung Lin; Lin-Yu Tseng
Journal:  Nucleic Acids Res       Date:  2010-06-08       Impact factor: 16.971

5.  APIS: accurate prediction of hot spots in protein interfaces by combining protrusion index with solvent accessibility.

Authors:  Jun-Feng Xia; Xing-Ming Zhao; Jiangning Song; De-Shuang Huang
Journal:  BMC Bioinformatics       Date:  2010-04-08       Impact factor: 3.169

6.  Learning gene regulatory networks from only positive and unlabeled data.

Authors:  Luigi Cerulo; Charles Elkan; Michele Ceccarelli
Journal:  BMC Bioinformatics       Date:  2010-05-05       Impact factor: 3.169

7.  An integrative computational framework based on a two-step random forest algorithm improves prediction of zinc-binding sites in proteins.

Authors:  Cheng Zheng; Mingjun Wang; Kazuhiro Takemoto; Tatsuya Akutsu; Ziding Zhang; Jiangning Song
Journal:  PLoS One       Date:  2012-11-14       Impact factor: 3.240

8.  FunSAV: predicting the functional effect of single amino acid variants using a two-stage random forest model.

Authors:  Mingjun Wang; Xing-Ming Zhao; Kazuhiro Takemoto; Haisong Xu; Yuan Li; Tatsuya Akutsu; Jiangning Song
Journal:  PLoS One       Date:  2012-08-24       Impact factor: 3.240

9.  Prodepth: predict residue depth by support vector regression approach from protein sequences only.

Authors:  Jiangning Song; Hao Tan; Khalid Mahmood; Ruby H P Law; Ashley M Buckle; Geoffrey I Webb; Tatsuya Akutsu; James C Whisstock
Journal:  PLoS One       Date:  2009-09-17       Impact factor: 3.240

10.  Prediction of protein binding sites in protein structures using hidden Markov support vector machine.

Authors:  Bin Liu; Xiaolong Wang; Lei Lin; Buzhou Tang; Qiwen Dong; Xuan Wang
Journal:  BMC Bioinformatics       Date:  2009-11-20       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.