Literature DB >> 18707536

Support vector training of protein alignment models.

Chun-Nam John Yu1, Thorsten Joachims, Ron Elber, Jaroslaw Pillardy.   

Abstract

Sequence to structure alignment is an important step in homology modeling of protein structures. Incorporation of features such as secondary structure, solvent accessibility, or evolutionary information improve sequence to structure alignment accuracy, but conventional generative estimation techniques for alignment models impose independence assumptions that make these features difficult to include in a principled way. In this paper, we overcome this problem using a Support Vector Machine (SVM) method that provides a well-founded way of estimating complex alignment models with hundred of thousands of parameters. Furthermore, we show that the method can be trained using a variety of loss functions. In a rigorous empirical evaluation, the SVM algorithm outperforms the generative alignment method SSALN, a highly accurate generative alignment model that incorporates structural information. The alignment model learned by the SVM aligns 50% of the residues correctly and aligns over 70% of the residues within a shift of four positions.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18707536      PMCID: PMC2612564          DOI: 10.1089/cmb.2007.0152

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  10 in total

1.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

2.  Accurate prediction of solvent accessibility using neural networks-based regression.

Authors:  Rafał Adamczak; Aleksey Porollo; Jarosław Meller
Journal:  Proteins       Date:  2004-09-01

3.  Parametric inference for biological sequence analysis.

Authors:  Lior Pachter; Bernd Sturmfels
Journal:  Proc Natl Acad Sci U S A       Date:  2004-11-08       Impact factor: 11.205

4.  SSALN: an alignment algorithm using structure-dependent substitution matrices and gap penalties learned from structurally aligned protein pairs.

Authors:  Jian Qiu; Ron Elber
Journal:  Proteins       Date:  2006-03-01

5.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path.

Authors:  I N Shindyalov; P E Bourne
Journal:  Protein Eng       Date:  1998-09

6.  Parametric and inverse-parametric sequence alignment with XPARAL.

Authors:  D Gusfield; P Stelling
Journal:  Methods Enzymol       Date:  1996       Impact factor: 1.600

Review 7.  A sequence similarity search algorithm based on a probabilistic interpretation of an alignment scoring system.

Authors:  P Bucher; K Hofmann
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1996

8.  Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features.

Authors:  W Kabsch; C Sander
Journal:  Biopolymers       Date:  1983-12       Impact factor: 2.505

9.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

10.  TM-align: a protein structure alignment algorithm based on the TM-score.

Authors:  Yang Zhang; Jeffrey Skolnick
Journal:  Nucleic Acids Res       Date:  2005-04-22       Impact factor: 16.971

  10 in total
  2 in total

1.  Energy design for protein-protein interactions.

Authors:  D V S Ravikant; Ron Elber
Journal:  J Chem Phys       Date:  2011-08-14       Impact factor: 3.488

2.  Boosting Protein Threading Accuracy.

Authors:  Jian Peng; Jinbo Xu
Journal:  Res Comput Mol Biol       Date:  2009
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.