Literature DB >> 21031154

A DIRICHLET PROCESS MIXTURE OF HIDDEN MARKOV MODELS FOR PROTEIN STRUCTURE PREDICTION.

Kristin P Lennox1, David B Dahl, Marina Vannucci, Ryan Day, Jerry W Tsai.   

Abstract

By providing new insights into the distribution of a protein's torsion angles, recent statistical models for this data have pointed the way to more efficient methods for protein structure prediction. Most current approaches have concentrated on bivariate models at a single sequence position. There is, however, considerable value in simultaneously modeling angle pairs at multiple sequence positions in a protein. One area of application for such models is in structure prediction for the highly variable loop and turn regions. Such modeling is difficult due to the fact that the number of known protein structures available to estimate these torsion angle distributions is typically small. Furthermore, the data is "sparse" in that not all proteins have angle pairs at each sequence position. We propose a new semiparametric model for the joint distributions of angle pairs at multiple sequence positions. Our model accommodates sparse data by leveraging known information about the behavior of protein secondary structure. We demonstrate our technique by predicting the torsion angles in a loop from the globin fold family. Our results show that a template-based approach can now be successfully extended to modeling the notoriously difficult loop and turn regions.

Entities:  

Year:  2010        PMID: 21031154      PMCID: PMC2964143          DOI: 10.1214/09-AOAS296

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  17 in total

Review 1.  Ab initio protein folding.

Authors:  D J Osguthorpe
Journal:  Curr Opin Struct Biol       Date:  2000-04       Impact factor: 6.809

2.  The PSIPRED protein structure prediction server.

Authors:  L J McGuffin; K Bryson; D T Jones
Journal:  Bioinformatics       Date:  2000-04       Impact factor: 6.937

3.  Protein structure prediction and structural genomics.

Authors:  D Baker; A Sali
Journal:  Science       Date:  2001-10-05       Impact factor: 47.728

4.  Revisiting the Ramachandran plot: hard-sphere repulsion, electrostatics, and H-bonding in the alpha-helix.

Authors:  Bosco K Ho; Annick Thomas; Robert Brasseur
Journal:  Protein Sci       Date:  2003-11       Impact factor: 6.725

5.  Structure validation by Calpha geometry: phi,psi and Cbeta deviation.

Authors:  Simon C Lovell; Ian W Davis; W Bryan Arendall; Paul I W de Bakker; J Michael Word; Michael G Prisant; Jane S Richardson; David C Richardson
Journal:  Proteins       Date:  2003-02-15

6.  MUSCLE: multiple sequence alignment with high accuracy and high throughput.

Authors:  Robert C Edgar
Journal:  Nucleic Acids Res       Date:  2004-03-19       Impact factor: 16.971

7.  Protein imperfections: separating intrinsic from extrinsic variation of torsion angles.

Authors:  Glenn L Butterfoss; Jane S Richardson; Jan Hermans
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2004-12-17

8.  Predicting protein structure using hidden Markov models.

Authors:  K Karplus; K Sjölander; C Barrett; M Cline; D Haussler; R Hughey; L Holm; C Sander
Journal:  Proteins       Date:  1997

9.  Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features.

Authors:  W Kabsch; C Sander
Journal:  Biopolymers       Date:  1983-12       Impact factor: 2.505

10.  Density Estimation for Protein Conformation Angles Using a Bivariate von Mises Distribution and Bayesian Nonparametrics.

Authors:  Kristin P Lennox; David B Dahl; Marina Vannucci; Jerry W Tsai
Journal:  J Am Stat Assoc       Date:  2009-06-01       Impact factor: 5.033

View more
  5 in total

1.  Assessing protein conformational sampling methods based on bivariate lag-distributions of backbone angles.

Authors:  Mehdi Maadooliat; Xin Gao; Jianhua Z Huang
Journal:  Brief Bioinform       Date:  2012-08-27       Impact factor: 11.622

2.  Understanding the general packing rearrangements required for successful template based modeling of protein structure from a CASP experiment.

Authors:  Ryan Day; Hyun Joo; Archana C Chavan; Kristin P Lennox; Y Ann Chen; David B Dahl; Marina Vannucci; Jerry W Tsai
Journal:  Comput Biol Chem       Date:  2012-11-23       Impact factor: 2.877

3.  Learning sparse models for a dynamic Bayesian network classifier of protein secondary structure.

Authors:  Zafer Aydin; Ajit Singh; Jeff Bilmes; William S Noble
Journal:  BMC Bioinformatics       Date:  2011-05-13       Impact factor: 3.169

4.  Near-native protein loop sampling using nonparametric density estimation accommodating sparcity.

Authors:  Hyun Joo; Archana G Chavan; Ryan Day; Kristin P Lennox; Paul Sukhanov; David B Dahl; Marina Vannucci; Jerry Tsai
Journal:  PLoS Comput Biol       Date:  2011-10-20       Impact factor: 4.475

5.  Protein Structure Classification and Loop Modeling Using Multiple Ramachandran Distributions.

Authors:  Seyed Morteza Najibi; Mehdi Maadooliat; Lan Zhou; Jianhua Z Huang; Xin Gao
Journal:  Comput Struct Biotechnol J       Date:  2017-02-08       Impact factor: 7.271

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.