| Literature DB >> 18707536 |
Chun-Nam John Yu1, Thorsten Joachims, Ron Elber, Jaroslaw Pillardy.
Abstract
Sequence to structure alignment is an important step in homology modeling of protein structures. Incorporation of features such as secondary structure, solvent accessibility, or evolutionary information improve sequence to structure alignment accuracy, but conventional generative estimation techniques for alignment models impose independence assumptions that make these features difficult to include in a principled way. In this paper, we overcome this problem using a Support Vector Machine (SVM) method that provides a well-founded way of estimating complex alignment models with hundred of thousands of parameters. Furthermore, we show that the method can be trained using a variety of loss functions. In a rigorous empirical evaluation, the SVM algorithm outperforms the generative alignment method SSALN, a highly accurate generative alignment model that incorporates structural information. The alignment model learned by the SVM aligns 50% of the residues correctly and aligns over 70% of the residues within a shift of four positions.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18707536 PMCID: PMC2612564 DOI: 10.1089/cmb.2007.0152
Source DB: PubMed Journal: J Comput Biol ISSN: 1066-5277 Impact factor: 1.479