Literature DB >> 12946352

Potential for dramatic improvement in sequence alignment against structures of remote homologous proteins by extracting structural information from multiple structure alignment.

Ziding Zhang1, Mats Lindstam, Johan Unge, Carsten Peterson, Guoguang Lu.   

Abstract

A novel method has been developed for acquiring the correct alignment of a query sequence against remotely homologous proteins by extracting structural information from profiles of multiple structure alignment. A systematic search algorithm combined with a group of score functions based on sequence information and structural information has been introduced in this procedure. A limited number of top solutions (15,000) with high scores were selected as candidates for further examination. On a test-set comprising 301 proteins from 75 protein families with sequence identity less than 30%, the proportion of proteins with completely correct alignment as first candidate was improved to 39.8% by our method, whereas the typical performance of existing sequence-based alignment methods was only between 16.1% and 22.7%. Furthermore, multiple candidates for possible alignment were provided in our approach, which dramatically increased the possibility of finding correct alignment, such that completely correct alignments were found amongst the top-ranked 1000 candidates in 88.3% of the proteins. With the assistance of a sequence database, completely correct alignment solutions were achieved amongst the top 1000 candidates in 94.3% of the proteins. From such a limited number of candidates, it would become possible to identify more correct alignment using a more time-consuming but more powerful method with more detailed structural information, such as side-chain packing and energy minimization, etc. The results indicate that the novel alignment strategy could be helpful for extending the application of highly reliable methods for fold identification and homology modeling to a huge number of homologous proteins of low sequence similarity. Details of the methods, together with the results and implications for future development are presented.

Mesh:

Substances:

Year:  2003        PMID: 12946352     DOI: 10.1016/s0022-2836(03)00858-1

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  8 in total

1.  Structure of the dimerization domain of DiGeorge critical region 8.

Authors:  Rachel Senturia; Michael Faller; Sheng Yin; Joseph A Loo; Duilio Cascio; Michael R Sawaya; Daniel Hwang; Robert T Clubb; Feng Guo
Journal:  Protein Sci       Date:  2010-07       Impact factor: 6.725

2.  Apo and ligand-bound structures of ModA from the archaeon Methanosarcina acetivorans.

Authors:  Sum Chan; Iulia Giuroiu; Irina Chernishof; Michael R Sawaya; Janet Chiang; Robert P Gunsalus; Mark A Arbing; L Jeanne Perry
Journal:  Acta Crystallogr Sect F Struct Biol Cryst Commun       Date:  2010-02-23

3.  Structural and functional analysis of a new subfamily of glycosyltransferases required for glycosylation of serine-rich streptococcal adhesins.

Authors:  Fan Zhu; Heidi Erlandsen; Lei Ding; Jingzhi Li; Ying Huang; Meixian Zhou; Xiaobo Liang; Jinbiao Ma; Hui Wu
Journal:  J Biol Chem       Date:  2011-06-07       Impact factor: 5.157

4.  Parallel evolution of non-homologous isofunctional enzymes in methionine biosynthesis.

Authors:  Karine Bastard; Alain Perret; Aline Mariage; Thomas Bessonnet; Agnès Pinet-Turpault; Jean-Louis Petit; Ekaterina Darii; Pascal Bazire; Carine Vergne-Vaxelaire; Clémence Brewee; Adrien Debard; Virginie Pellouin; Marielle Besnard-Gonnet; François Artiguenave; Claudine Médigue; David Vallenet; Antoine Danchin; Anne Zaparucha; Jean Weissenbach; Marcel Salanoubat; Véronique de Berardinis
Journal:  Nat Chem Biol       Date:  2017-06-05       Impact factor: 15.040

5.  An Improved Strategy for Task Scheduling in the Parallel Computational Alignment of Multiple Sequences.

Authors:  Muhammad Ishaq; Asfandyar Khan; Mazliham Mohd Su'ud; Muhammad Mansoor Alam; Javed Iqbal Bangash; Abdullah Khan
Journal:  Comput Math Methods Med       Date:  2022-01-28       Impact factor: 2.238

6.  Dimerization and heme binding are conserved in amphibian and starfish homologues of the microRNA processing protein DGCR8.

Authors:  Rachel Senturia; Arthur Laganowsky; Ian Barr; Brooke D Scheidemantle; Feng Guo
Journal:  PLoS One       Date:  2012-07-02       Impact factor: 3.240

7.  Expresso: automatic incorporation of structural information in multiple sequence alignments using 3D-Coffee.

Authors:  Fabrice Armougom; Sébastien Moretti; Olivier Poirot; Stéphane Audic; Pierre Dumas; Basile Schaeli; Vladimir Keduas; Cedric Notredame
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

8.  Molecular function prediction for a family exhibiting evolutionary tendencies toward substrate specificity swapping: recurrence of tyrosine aminotransferase activity in the Iα subfamily.

Authors:  Kathryn E Muratore; Barbara E Engelhardt; John R Srouji; Michael I Jordan; Steven E Brenner; Jack F Kirsch
Journal:  Proteins       Date:  2013-06-17
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.