Literature DB >> 27559156

KScons: a Bayesian approach for protein residue contact prediction using the knob-socket model of protein tertiary structure.

Qiwei Li1, David B Dahl2, Marina Vannucci1, Hyun Joo3, Jerry W Tsai3.   

Abstract

MOTIVATION: By simplifying the many-bodied complexity of residue packing into patterns of simple pairwise secondary structure interactions between a single knob residue with a three-residue socket, the knob-socket construct allows a more direct incorporation of structural information into the prediction of residue contacts. By modeling the preferences between the amino acid composition of a socket and knob, we undertake an investigation of the knob-socket construct's ability to improve the prediction of residue contacts. The statistical model considers three priors and two posterior estimations to better understand how the input data affects predictions. This produces six implementations of KScons that are tested on three sets: PSICOV, CASP10 and CASP11. We compare against the current leading contact prediction methods.
RESULTS: The results demonstrate the usefulness as well as the limits of knob-socket based structural modeling of protein contacts. The construct is able to extract good predictions from known structural homologs, while its performance degrades when no homologs exist. Among our six implementations, KScons MST-MP (which uses the multiple structure alignment prior and marginal posterior incorporating structural homolog information) performs the best in all three prediction sets. An analysis of recall and precision finds that KScons MST-MP improves accuracy not only by improving identification of true positives, but also by decreasing the number of false positives. Over the CASP10 and CASP11 sets, KScons MST-MP performs better than the leading methods using only evolutionary coupling data, but not quite as well as the supervised learning methods of MetaPSICOV and CoinDCA-NN that incorporate a large set of structural features. CONTACT: qiwei.li@rice.eduSupplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27559156      PMCID: PMC6078168          DOI: 10.1093/bioinformatics/btw553

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  33 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  New encouraging developments in contact prediction: Assessment of the CASP11 results.

Authors:  Bohdan Monastyrskyy; Daniel D'Andrea; Krzysztof Fidelis; Anna Tramontano; Andriy Kryshtafovych
Journal:  Proteins       Date:  2015-11-17

3.  Two crystal structures of the B1 immunoglobulin-binding domain of streptococcal protein G and comparison with NMR.

Authors:  T Gallagher; P Alexander; P Bryan; G L Gilliland
Journal:  Biochemistry       Date:  1994-04-19       Impact factor: 3.162

4.  An amino acid code to define a protein's tertiary packing surface.

Authors:  Keith J Fraga; Hyun Joo; Jerry Tsai
Journal:  Proteins       Date:  2015-12-22

5.  An amino acid code for β-sheet packing structure.

Authors:  Hyun Joo; Jerry Tsai
Journal:  Proteins       Date:  2014-04-16

6.  Three-dimensional structures of membrane proteins from genomic sequencing.

Authors:  Thomas A Hopf; Lucy J Colwell; Robert Sheridan; Burkhard Rost; Chris Sander; Debora S Marks
Journal:  Cell       Date:  2012-05-10       Impact factor: 41.582

7.  Improved residue contact prediction using support vector machines and a large feature set.

Authors:  Jianlin Cheng; Pierre Baldi
Journal:  BMC Bioinformatics       Date:  2007-04-02       Impact factor: 3.169

8.  MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.

Authors:  David T Jones; Tanya Singh; Tomasz Kosciolek; Stuart Tetchner
Journal:  Bioinformatics       Date:  2014-11-26       Impact factor: 6.937

9.  Accurate contact predictions using covariation techniques and machine learning.

Authors:  Tomasz Kosciolek; David T Jones
Journal:  Proteins       Date:  2015-08-14

10.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity.

Authors:  Robert C Edgar
Journal:  BMC Bioinformatics       Date:  2004-08-19       Impact factor: 3.169

View more
  1 in total

1.  DeepECA: an end-to-end learning framework for protein contact prediction from a multiple sequence alignment.

Authors:  Hiroyuki Fukuda; Kentaro Tomii
Journal:  BMC Bioinformatics       Date:  2020-01-09       Impact factor: 3.169

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.