Literature DB >> 17660202

Consensus Data Mining (CDM) Protein Secondary Structure Prediction Server: combining GOR V and Fragment Database Mining (FDM).

Haitao Cheng1, Taner Z Sen, Robert L Jernigan, Andrzej Kloczkowski.   

Abstract

One of the challenges in protein secondary structure prediction is to overcome the cross-validated 80% prediction accuracy barrier. Here, we propose a novel approach to surpass this barrier. Instead of using a single algorithm that relies on a limited data set for training, we combine two complementary methods having different strengths: Fragment Database Mining (FDM) and GOR V. FDM harnesses the availability of the known protein structures in the Protein Data Bank and provides highly accurate secondary structure predictions when sequentially similar structural fragments are identified. In contrast, the GOR V algorithm is based on information theory, Bayesian statistics, and PSI-BLAST multiple sequence alignments to predict the secondary structure of residues inside a sliding window along a protein chain. A combination of these two different methods benefits from the large number of structures in the PDB and significantly improves the secondary structure prediction accuracy, resulting in Q3 ranging from 67.5 to 93.2%, depending on the availability of highly similar fragments in the Protein Data Bank.

Mesh:

Substances:

Year:  2007        PMID: 17660202      PMCID: PMC2553684          DOI: 10.1093/bioinformatics/btm379

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  21 in total

1.  Protein secondary structure prediction based on position-specific scoring matrices.

Authors:  D T Jones
Journal:  J Mol Biol       Date:  1999-09-17       Impact factor: 5.469

2.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  Coupled prediction of protein secondary and tertiary structure.

Authors:  Jens Meiler; David Baker
Journal:  Proc Natl Acad Sci U S A       Date:  2003-10-03       Impact factor: 11.205

4.  Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions.

Authors:  E Krissinel; K Henrick
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2004-11-26

5.  The effect of long-range interactions on the secondary structure formation of proteins.

Authors:  Daisuke Kihara
Journal:  Protein Sci       Date:  2005-06-29       Impact factor: 6.725

6.  Distinct structural elements in the first membrane-spanning segment of the epithelial sodium channel.

Authors:  Ossama B Kashlan; Ahmad B Maarouf; Cassandra Kussius; Robert M Denshaw; Kenneth M Blumenthal; Thomas R Kleyman
Journal:  J Biol Chem       Date:  2006-08-14       Impact factor: 5.157

7.  A Consensus Data Mining secondary structure prediction by combining GOR V and Fragment Database Mining.

Authors:  Taner Z Sen; Haitao Cheng; Andrzej Kloczkowski; Robert L Jernigan
Journal:  Protein Sci       Date:  2006-09-25       Impact factor: 6.725

8.  Prediction of protein secondary structure by mining structural fragment database.

Authors:  Haitao Cheng; Taner Z Sen; Andrzej Kloczkowski; Dimitris Margaritis; Robert L Jernigan
Journal:  Polymer (Guildf)       Date:  2005-05-26       Impact factor: 4.430

9.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions.

Authors:  K T Simons; C Kooperberg; E Huang; D Baker
Journal:  J Mol Biol       Date:  1997-04-25       Impact factor: 5.469

10.  Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins.

Authors:  J Garnier; D J Osguthorpe; B Robson
Journal:  J Mol Biol       Date:  1978-03-25       Impact factor: 5.469

View more
  11 in total

Review 1.  From local structure to a global framework: recognition of protein folds.

Authors:  Agnel Praveen Joseph; Alexandre G de Brevern
Journal:  J R Soc Interface       Date:  2014-04-16       Impact factor: 4.118

2.  SPINE X: improving protein secondary structure prediction by multistep learning coupled with prediction of solvent accessible surface area and backbone torsion angles.

Authors:  Eshel Faraggi; Tuo Zhang; Yuedong Yang; Lukasz Kurgan; Yaoqi Zhou
Journal:  J Comput Chem       Date:  2011-11-02       Impact factor: 3.376

3.  Distributions of amino acids suggest that certain residue types more effectively determine protein secondary structure.

Authors:  S Saraswathi; J L Fernández-Martínez; A Koliński; R L Jernigan; A Kloczkowski
Journal:  J Mol Model       Date:  2013-08-02       Impact factor: 1.810

Review 4.  Template-based protein modeling: recent methodological advances.

Authors:  Pankaj R Daga; Ronak Y Patel; Robert J Doerksen
Journal:  Curr Top Med Chem       Date:  2010       Impact factor: 3.295

5.  Platelet adhesion to decorin but not collagen I correlates with the integrin α2 dimorphism E534K, the basis of the human platelet alloantigen (HPA)-5 system.

Authors:  Thomas J Kunicki; Shirley A Williams; Daniel Diaz; Richard W Farndale; Diane J Nugent
Journal:  Haematologica       Date:  2011-12-01       Impact factor: 9.941

6.  Evolutionary history of tissue kallikreins.

Authors:  Athanasia Pavlopoulou; Georgios Pampalakis; Ioannis Michalopoulos; Georgia Sotiropoulou
Journal:  PLoS One       Date:  2010-11-01       Impact factor: 3.240

7.  Improving prediction of secondary structure, local backbone angles, and solvent accessible surface area of proteins by iterative deep learning.

Authors:  Rhys Heffernan; Kuldip Paliwal; James Lyons; Abdollah Dehzangi; Alok Sharma; Jihua Wang; Abdul Sattar; Yuedong Yang; Yaoqi Zhou
Journal:  Sci Rep       Date:  2015-06-22       Impact factor: 4.379

8.  Sixty-five years of the long march in protein secondary structure prediction: the final stretch?

Authors:  Yuedong Yang; Jianzhao Gao; Jihua Wang; Rhys Heffernan; Jack Hanson; Kuldip Paliwal; Yaoqi Zhou
Journal:  Brief Bioinform       Date:  2018-05-01       Impact factor: 11.622

9.  Knowledge-based prediction of protein backbone conformation using a structural alphabet.

Authors:  Iyanar Vetrivel; Swapnil Mahajan; Manoj Tyagi; Lionel Hoffmann; Yves-Henri Sanejouand; Narayanaswamy Srinivasan; Alexandre G de Brevern; Frédéric Cadet; Bernard Offmann
Journal:  PLoS One       Date:  2017-11-21       Impact factor: 3.240

10.  A role of SCN9A in human epilepsies, as a cause of febrile seizures and as a potential modifier of Dravet syndrome.

Authors:  Nanda A Singh; Chris Pappas; E Jill Dahle; Lieve R F Claes; Timothy H Pruess; Peter De Jonghe; Joel Thompson; Missy Dixon; Christina Gurnett; Andy Peiffer; H Steve White; Francis Filloux; Mark F Leppert
Journal:  PLoS Genet       Date:  2009-09-18       Impact factor: 5.917

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.