Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Predicting disulfide connectivity from protein sequence using multiple sequence feature vectors and secondary structure.

Literature DB >> 17942444

Predicting disulfide connectivity from protein sequence using multiple sequence feature vectors and secondary structure.

Jiangning Song¹, Zheng Yuan, Hao Tan, Thomas Huber, Kevin Burrage.

Abstract

MOTIVATION: Disulfide bonds are primary covalent crosslinks between two cysteine residues in proteins that play critical roles in stabilizing the protein structures and are commonly found in extracy-toplasmatic or secreted proteins. In protein folding prediction, the localization of disulfide bonds can greatly reduce the search in conformational space. Therefore, there is a great need to develop computational methods capable of accurately predicting disulfide connectivity patterns in proteins that could have potentially important applications.
RESULTS: We have developed a novel method to predict disulfide connectivity patterns from protein primary sequence, using a support vector regression (SVR) approach based on multiple sequence feature vectors and predicted secondary structure by the PSIPRED program. The results indicate that our method could achieve a prediction accuracy of 74.4% and 77.9%, respectively, when averaged on proteins with two to five disulfide bridges using 4-fold cross-validation, measured on the protein and cysteine pair on a well-defined non-homologous dataset. We assessed the effects of different sequence encoding schemes on the prediction performance of disulfide connectivity. It has been shown that the sequence encoding scheme based on multiple sequence feature vectors coupled with predicted secondary structure can significantly improve the prediction accuracy, thus enabling our method to outperform most of other currently available predictors. Our work provides a complementary approach to the current algorithms that should be useful in computationally assigning disulfide connectivity patterns and helps in the annotation of protein sequences generated by large-scale whole-genome projects. AVAILABILITY: The prediction web server and Supplementary Material are accessible at http://foo.maths.uq.edu.au/~huber/disulfide

Entities: Chemical

Mesh：

Substances：
Disulfides

Year: 2007 PMID： 17942444 DOI： 10.1093/bioinformatics/btm505

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

21 in total

1. Accurate disulfide-bonding network predictions improve ab initio structure prediction of cysteine-rich proteins.

Authors: Jing Yang; Bao-Ji He; Richard Jang; Yang Zhang; Hong-Bin Shen
Journal: Bioinformatics Date: 2015-08-07 Impact factor: 6.937

2. Systematic analysis and prediction of type IV secreted effector proteins by machine learning approaches.

Authors: Jiawei Wang; Bingjiao Yang; Yi An; Tatiana Marquez-Lago; André Leier; Jonathan Wilksch; Qingyang Hong; Yang Zhang; Morihiro Hayashida; Tatsuya Akutsu; Geoffrey I Webb; Richard A Strugnell; Jiangning Song; Trevor Lithgow
Journal: Brief Bioinform Date: 2019-05-21 Impact factor: 11.622

3. Computational analysis and prediction of lysine malonylation sites by exploiting informative features in an integrative machine-learning framework.

Authors: Yanju Zhang; Ruopeng Xie; Jiawei Wang; André Leier; Tatiana T Marquez-Lago; Tatsuya Akutsu; Geoffrey I Webb; Kuo-Chen Chou; Jiangning Song
Journal: Brief Bioinform Date: 2019-11-27 Impact factor: 11.622

4. DBCP: a web server for disulfide bonding connectivity pattern prediction without the prior knowledge of the bonding state of cysteines.

Authors: Hsuan-Hung Lin; Lin-Yu Tseng
Journal: Nucleic Acids Res Date: 2010-06-08 Impact factor: 16.971

5. APIS: accurate prediction of hot spots in protein interfaces by combining protrusion index with solvent accessibility.

Authors: Jun-Feng Xia; Xing-Ming Zhao; Jiangning Song; De-Shuang Huang
Journal: BMC Bioinformatics Date: 2010-04-08 Impact factor: 3.169

6. Learning gene regulatory networks from only positive and unlabeled data.

Authors: Luigi Cerulo; Charles Elkan; Michele Ceccarelli
Journal: BMC Bioinformatics Date: 2010-05-05 Impact factor: 3.169

7. An integrative computational framework based on a two-step random forest algorithm improves prediction of zinc-binding sites in proteins.

Authors: Cheng Zheng; Mingjun Wang; Kazuhiro Takemoto; Tatsuya Akutsu; Ziding Zhang; Jiangning Song
Journal: PLoS One Date: 2012-11-14 Impact factor: 3.240