Literature DB >> 20127740

Improving the accuracy of predicting disulfide connectivity by feature selection.

Lin Zhu1, Jie Yang, Jiang-Ning Song, Kuo-Chen Chou, Hong-Bin Shen.   

Abstract

Disulfide bonds are primary covalent cross-links formed between two cysteine residues in the same or different protein polypeptide chains, which play important roles in the folding and stability of proteins. However, computational prediction of disulfide connectivity directly from protein primary sequences is challenging due to the nonlocal nature of disulfide bonds in the context of sequences, and the number of possible disulfide patterns grows exponentially when the number of cysteine residues increases. In the previous studies, disulfide connectivity prediction was usually performed in high-dimensional feature space, which can cause a variety of problems in statistical learning, such as the dimension disaster, overfitting, and feature redundancy. In this study, we propose an efficient feature selection technique for analyzing the importance of each feature component. On the basis of this approach, we selected the most important features for predicting the connectivity pattern of intra-chain disulfide bonds. Our results have shown that the high-dimensional features contain redundant information, and the prediction performance can be further improved when these high-dimensional features are reduced to a lower but more compact dimensional space. Our results also indicate that the global protein features contribute little to the formation and prediction of disulfide bonds, while the local sequential and structural information play important roles. All these findings provide important insights for structural studies of disulfide-rich proteins. Copyright 2010 Wiley Periodicals, Inc.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20127740     DOI: 10.1002/jcc.21433

Source DB:  PubMed          Journal:  J Comput Chem        ISSN: 0192-8651            Impact factor:   3.376


  16 in total

1.  Accurate disulfide-bonding network predictions improve ab initio structure prediction of cysteine-rich proteins.

Authors:  Jing Yang; Bao-Ji He; Richard Jang; Yang Zhang; Hong-Bin Shen
Journal:  Bioinformatics       Date:  2015-08-07       Impact factor: 6.937

2.  Development of a high-dynamic range, GFP-based FRET probe sensitive to oxidative microenvironments.

Authors:  Vladimir L Kolossov; Bryan Q Spring; Robert M Clegg; Jennifer J Henry; Anna Sokolowski; Paul J A Kenis; H Rex Gaskins
Journal:  Exp Biol Med (Maywood)       Date:  2011-05-23

3.  IDDLncLoc: Subcellular Localization of LncRNAs Based on a Framework for Imbalanced Data Distributions.

Authors:  Yan Wang; Xiaopeng Zhu; Lili Yang; Xuemei Hu; Kai He; Cuinan Yu; Shaoqing Jiao; Jiali Chen; Rui Guo; Sen Yang
Journal:  Interdiscip Sci       Date:  2022-02-22       Impact factor: 2.233

4.  An integrative computational framework based on a two-step random forest algorithm improves prediction of zinc-binding sites in proteins.

Authors:  Cheng Zheng; Mingjun Wang; Kazuhiro Takemoto; Tatsuya Akutsu; Ziding Zhang; Jiangning Song
Journal:  PLoS One       Date:  2012-11-14       Impact factor: 3.240

5.  Prediction of bioluminescent proteins using auto covariance transformation of evolutional profiles.

Authors:  Xiaowei Zhao; Jiakui Li; Yanxin Huang; Zhiqiang Ma; Minghao Yin
Journal:  Int J Mol Sci       Date:  2012-03-19       Impact factor: 6.208

6.  Improved local ternary patterns for automatic target recognition in infrared imagery.

Authors:  Xiaosheng Wu; Junding Sun; Guoliang Fan; Zhiheng Wang
Journal:  Sensors (Basel)       Date:  2015-03-16       Impact factor: 3.576

Review 7.  Briefing in application of machine learning methods in ion channel prediction.

Authors:  Hao Lin; Wei Chen
Journal:  ScientificWorldJournal       Date:  2015-04-16

8.  Mining Chemical Activity Status from High-Throughput Screening Assays.

Authors:  Othman Soufan; Wail Ba-alawi; Moataz Afeef; Magbubah Essack; Valentin Rodionov; Panos Kalnis; Vladimir B Bajic
Journal:  PLoS One       Date:  2015-12-14       Impact factor: 3.240

9.  A new avenue for classification and prediction of olive cultivars using supervised and unsupervised algorithms.

Authors:  Amir H Beiki; Saba Saboor; Mansour Ebrahimi
Journal:  PLoS One       Date:  2012-09-05       Impact factor: 3.240

10.  PROSPER: an integrated feature-based tool for predicting protease substrate cleavage sites.

Authors:  Jiangning Song; Hao Tan; Andrew J Perry; Tatsuya Akutsu; Geoffrey I Webb; James C Whisstock; Robert N Pike
Journal:  PLoS One       Date:  2012-11-29       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.