Literature DB >> 11524371

AL2CO: calculation of positional conservation in a protein sequence alignment.

J Pei1, N V Grishin.   

Abstract

MOTIVATION: Amino acid sequence alignments are widely used in the analysis of protein structure, function and evolutionary relationships. Proteins within a superfamily usually share the same fold and possess related functions. These structural and functional constraints are reflected in the alignment conservation patterns. Positions of functional and/or structural importance tend to be more conserved. Conserved positions are usually clustered in distinct motifs surrounded by sequence segments of low conservation. Poorly conserved regions might also arise from the imperfections in multiple alignment algorithms and thus indicate possible alignment errors. Quantification of conservation by attributing a conservation index to each aligned position makes motif detection more convenient. Mapping these conservation indices onto a protein spatial structure helps to visualize spatial conservation features of the molecule and to predict functionally and/or structurally important sites. Analysis of conservation indices could be a useful tool in detection of potentially misaligned regions and will aid in improvement of multiple alignments.
RESULTS: We developed a program to calculate a conservation index at each position in a multiple sequence alignment using several methods. Namely, amino acid frequencies at each position are estimated and the conservation index is calculated from these frequencies. We utilize both unweighted frequencies and frequencies weighted using two different strategies. Three conceptually different approaches (entropy-based, variance-based and matrix score-based) are implemented in the algorithm to define the conservation index. Calculating conservation indices for 35522 positions in 284 alignments from SMART database we demonstrate that different methods result in highly correlated (correlation coefficient more than 0.85) conservation indices. Conservation indices show statistically significant correlation between sequentially adjacent positions i and i + j, where j < 13, and averaging of the indices over the window of three positions is optimal for motif detection. Positions with gaps display substantially lower conservation properties. We compare conservation properties of the SMART alignments or FSSP structural alignments to those of the ClustalW alignments. The results suggest that conservation indices should be a valuable tool of alignment quality assessment and might be used as an objective function for refinement of multiple alignments. AVAILABILITY: The C code of the AL2CO program and its pre-compiled versions for several platforms as well as the details of the analysis are freely available at ftp://iole.swmed.edu/pub/al2co/.

Entities:  

Mesh:

Substances:

Year:  2001        PMID: 11524371     DOI: 10.1093/bioinformatics/17.8.700

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  198 in total

1.  Peptidase family U34 belongs to the superfamily of N-terminal nucleophile hydrolases.

Authors:  Jimin Pei; Nick V Grishin
Journal:  Protein Sci       Date:  2003-05       Impact factor: 6.725

2.  The 2.3-angstrom structure of porcine circovirus 2.

Authors:  Reza Khayat; Nicholas Brunn; Jeffrey A Speir; John M Hardham; Robert G Ankenbauer; Anette Schneemann; John E Johnson
Journal:  J Virol       Date:  2011-06-01       Impact factor: 5.103

3.  ArchDB: automated protein loop classification as a tool for structural genomics.

Authors:  Jordi Espadaler; Narcis Fernandez-Fuentes; Antonio Hermoso; Enrique Querol; Francesc X Aviles; Michael J E Sternberg; Baldomero Oliva
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

Review 4.  Structural genomics: computational methods for structure analysis.

Authors:  Sharon Goldsmith-Fischman; Barry Honig
Journal:  Protein Sci       Date:  2003-09       Impact factor: 6.725

Review 5.  The IUBMB-endorsed transporter classification system.

Authors:  Wolfgang Busch; Milton H Saier
Journal:  Mol Biotechnol       Date:  2004-07       Impact factor: 2.695

6.  PIN domain of Nob1p is required for D-site cleavage in 20S pre-rRNA.

Authors:  Alessandro Fatica; David Tollervey; Mensur Dlakić
Journal:  RNA       Date:  2004-09-23       Impact factor: 4.942

7.  LEON: multiple aLignment Evaluation Of Neighbours.

Authors:  Julie D Thompson; Véronique Prigent; Olivier Poch
Journal:  Nucleic Acids Res       Date:  2004-02-24       Impact factor: 16.971

Review 8.  Bioinformatics for personal genome interpretation.

Authors:  Emidio Capriotti; Nathan L Nehrt; Maricel G Kann; Yana Bromberg
Journal:  Brief Bioinform       Date:  2012-01-13       Impact factor: 11.622

9.  A Structure-Based Strategy for Engineering Selective Ubiquitin Variant Inhibitors of Skp1-Cul1-F-Box Ubiquitin Ligases.

Authors:  Maryna Gorelik; Noah Manczyk; Alevtina Pavlenco; Igor Kurinov; Sachdev S Sidhu; Frank Sicheri
Journal:  Structure       Date:  2018-07-19       Impact factor: 5.006

10.  How natalizumab binds and antagonizes α4 integrins.

Authors:  Yamei Yu; Thomas Schürpf; Timothy A Springer
Journal:  J Biol Chem       Date:  2013-09-18       Impact factor: 5.157

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.