Literature DB >> 25758094

Application of data mining tools for classification of protein structural class from residue based averaged NMR chemical shifts.

Arun V Kumar1, Rehana F M Ali1, Yu Cao1, V V Krishnan2.   

Abstract

The number of protein sequences deriving from genome sequencing projects is outpacing our knowledge about the function of these proteins. With the gap between experimentally characterized and uncharacterized proteins continuing to widen, it is necessary to develop new computational methods and tools for protein structural information that is directly related to function. Nuclear magnetic resonance (NMR) provides powerful means to determine three-dimensional structures of proteins in the solution state. However, translation of the NMR spectral parameters to even low-resolution structural information such as protein class requires multiple time consuming steps. In this paper, we present an unorthodox method to predict the protein structural class directly by using the residue's averaged chemical shifts (ACS) based on machine learning algorithms. Experimental chemical shift information from 1491 proteins obtained from Biological Magnetic Resonance Bank (BMRB) and their respective protein structural classes derived from structural classification of proteins (SCOP) were used to construct a data set with 119 attributes and 5 different classes. Twenty four different classification schemes were evaluated using several performance measures. Overall the residue based ACS values can predict the protein structural classes with 80% accuracy measured by Matthew correlation coefficient. Specifically protein classes defined by mixed αβ or small proteins are classified with >90% correlation. Our results indicate that this NMR-based method can be utilized as a low-resolution tool for protein structural class identification without any prior chemical shift assignments.
Copyright © 2015 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Chemical shift; Data mining; NMR; Protein structural class

Mesh:

Substances:

Year:  2015        PMID: 25758094      PMCID: PMC4547871          DOI: 10.1016/j.bbapap.2015.02.016

Source DB:  PubMed          Journal:  Biochim Biophys Acta        ISSN: 0006-3002


  62 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The predictive value of microbiologic diagnostic tests if asymptomatic carriers are present.

Authors:  Ronny K Gunnarsson; Jan Lanke
Journal:  Stat Med       Date:  2002-06-30       Impact factor: 2.373

3.  RefDB: a database of uniformly referenced protein chemical shifts.

Authors:  Haiyan Zhang; Stephen Neal; David S Wishart
Journal:  J Biomol NMR       Date:  2003-03       Impact factor: 2.835

4.  Estimation of protein secondary structure content directly from NMR spectra using an improved empirical correlation with averaged chemical shift.

Authors:  S P Mielke; V V Krishnan
Journal:  J Struct Funct Genomics       Date:  2005-11-09

5.  Does secondary structure determine tertiary structure in proteins?

Authors:  Haipeng Gong; George D Rose
Journal:  Proteins       Date:  2005-11-01

6.  AutoSCOP: automated prediction of SCOP classifications using unique pattern-class mappings.

Authors:  Jan E Gewehr; Volker Hintermair; Ralf Zimmer
Journal:  Bioinformatics       Date:  2007-03-22       Impact factor: 6.937

7.  Protein fold recognition using sequence-derived predictions.

Authors:  D Fischer; D Eisenberg
Journal:  Protein Sci       Date:  1996-05       Impact factor: 6.725

8.  Incorporating secondary structural features into sequence information for predicting protein structural class.

Authors:  Bo Liao; Ting Peng; Haowen Chen; Yaping Lin
Journal:  Protein Pept Lett       Date:  2013-10       Impact factor: 1.890

9.  Protein backbone angle restraints from searching a database for chemical shift and sequence homology.

Authors:  G Cornilescu; F Delaglio; A Bax
Journal:  J Biomol NMR       Date:  1999-03       Impact factor: 2.835

10.  2DCSi: identification of protein secondary structure and redox state using 2D cluster analysis of NMR chemical shifts.

Authors:  Ching-Cheng Wang; Jui-Hung Chen; Wen-Chung Lai; Woei-Jer Chuang
Journal:  J Biomol NMR       Date:  2007-02-27       Impact factor: 2.582

View more
  3 in total

1.  Prediction of protein structural classes by different feature expressions based on 2-D wavelet denoising and fusion.

Authors:  Shunfang Wang; Xiaoheng Wang
Journal:  BMC Bioinformatics       Date:  2019-12-24       Impact factor: 3.169

2.  Using Recursive Feature Selection with Random Forest to Improve Protein Structural Class Prediction for Low-Similarity Sequences.

Authors:  Yaoxin Wang; Yingjie Xu; Zhenyu Yang; Xiaoqing Liu; Qi Dai
Journal:  Comput Math Methods Med       Date:  2021-05-07       Impact factor: 2.238

3.  Comparative Study on Feature Selection in Protein Structure and Function Prediction.

Authors:  Wenjing Yi; Ao Sun; Manman Liu; Xiaoqing Liu; Wei Zhang; Qi Dai
Journal:  Comput Math Methods Med       Date:  2022-10-11       Impact factor: 2.809

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.