Literature DB >> 14982955

LEON: multiple aLignment Evaluation Of Neighbours.

Julie D Thompson1, Véronique Prigent, Olivier Poch.   

Abstract

Sequence alignments are fundamental to a wide range of applications, including database searching, functional residue identification and structure prediction techniques. These applications predict or propagate structural/functional/evolutionary information based on a presumed homology between the aligned sequences. If the initial hypothesis of homology is wrong, no subsequent application, however sophisticated, can be expected to yield accurate results. Here we present a novel method, LEON, to predict homology between proteins based on a multiple alignment of complete sequences (MACS). In MACS, weak signals from distantly related proteins can be considered in the overall context of the family. Intermediate sequences and the combination of individual weak matches are used to increase the significance of low-scoring regions. Residue composition is also taken into account by incorporation of several existing methods for the detection of compositionally biased sequence segments. The accuracy and reliability of the predictions is demonstrated in large-scale comparisons with structural and sequence family databases, where the specificity was shown to be >99% and the sensitivity was estimated to be approximately 76%. LEON can thus be used to reliably identify the complex relationships between large multidomain proteins and should be useful for automatic high-throughput genome annotations, 2D/3D structure predictions, protein-protein interaction predictions etc.

Mesh:

Year:  2004        PMID: 14982955      PMCID: PMC390283          DOI: 10.1093/nar/gkh294

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  53 in total

1.  Identifying DNA and protein patterns with statistically significant alignments of multiple sequences.

Authors:  G Z Hertz; G D Stormo
Journal:  Bioinformatics       Date:  1999 Jul-Aug       Impact factor: 6.937

2.  Evolution of protein sequences and structures.

Authors:  T C Wood; W R Pearson
Journal:  J Mol Biol       Date:  1999-08-27       Impact factor: 5.469

3.  A comprehensive comparison of multiple sequence alignment programs.

Authors:  J D Thompson; F Plewniak; O Poch
Journal:  Nucleic Acids Res       Date:  1999-07-01       Impact factor: 16.971

4.  Combining sensitive database searches with multiple intermediates to detect distant homologues.

Authors:  A A Salamov; M Suwa; C A Orengo; M B Swindells
Journal:  Protein Eng       Date:  1999-02

5.  Errors in genome annotation.

Authors:  S E Brenner
Journal:  Trends Genet       Date:  1999-04       Impact factor: 11.639

6.  The CATH Database provides insights into protein structure/function relationships.

Authors:  C A Orengo; F M Pearl; J E Bray; A E Todd; A C Martin; L Lo Conte; J M Thornton
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

Review 7.  Protein annotation: detective work for function prediction.

Authors:  T Doerks; A Bairoch; P Bork
Journal:  Trends Genet       Date:  1998-06       Impact factor: 11.639

8.  The PRINTS database of protein fingerprints: a novel information resource for computational molecular biology.

Authors:  T K Attwood; H Avison; M E Beck; M Bewley; A J Bleasby; F Brewster; P Cooper; K Degtyarenko; A J Geddes; D R Flower; M P Kelly; S Lott; K M Measures; D J Parry-Smith; D N Perkins; P Scordis; D Scott; C Worledge
Journal:  J Chem Inf Comput Sci       Date:  1997 May-Jun

Review 9.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

10.  Intermediate sequences increase the detection of homology between sequences.

Authors:  J Park; S A Teichmann; T Hubbard; C Chothia
Journal:  J Mol Biol       Date:  1997-10-17       Impact factor: 5.469

View more
  15 in total

1.  Structural genomics of eukaryotic targets at a laboratory scale.

Authors:  Didier Busso; Pierre Poussin-Courmontagne; David Rosé; Raymond Ripp; Alain Litt; Jean-Claude Thierry; Dino Moras
Journal:  J Struct Funct Genomics       Date:  2005

2.  Sequence and comparative genomic analysis of actin-related proteins.

Authors:  Jean Muller; Yukako Oma; Laurent Vallar; Evelyne Friederich; Olivier Poch; Barbara Winsor
Journal:  Mol Biol Cell       Date:  2005-09-29       Impact factor: 4.138

3.  Accuracy estimation and parameter advising for protein multiple sequence alignment.

Authors:  John Kececioglu; Dan DeBlasio
Journal:  J Comput Biol       Date:  2013-03-14       Impact factor: 1.479

4.  A comprehensive benchmark study of multiple sequence alignment methods: current challenges and future perspectives.

Authors:  Julie D Thompson; Benjamin Linard; Odile Lecompte; Olivier Poch
Journal:  PLoS One       Date:  2011-03-31       Impact factor: 3.240

5.  EvoluCode: Evolutionary Barcodes as a Unifying Framework for Multilevel Evolutionary Data.

Authors:  Benjamin Linard; Ngoc Hoan Nguyen; Francisco Prosdocimi; Olivier Poch; Julie D Thompson
Journal:  Evol Bioinform Online       Date:  2011-12-21       Impact factor: 1.625

6.  MSV3d: database of human MisSense Variants mapped to 3D protein structure.

Authors:  Tien-Dao Luu; Alin-Mihai Rusu; Vincent Walter; Raymond Ripp; Luc Moulinier; Jean Muller; Thierry Toursel; Julie D Thompson; Olivier Poch; Hoan Nguyen
Journal:  Database (Oxford)       Date:  2012-04-03       Impact factor: 3.451

7.  Controversies in modern evolutionary biology: the imperative for error detection and quality control.

Authors:  Francisco Prosdocimi; Benjamin Linard; Pierre Pontarotti; Olivier Poch; Julie D Thompson
Journal:  BMC Genomics       Date:  2012-01-04       Impact factor: 3.969

8.  A gold standard set of mechanistically diverse enzyme superfamilies.

Authors:  Shoshana D Brown; John A Gerlt; Jennifer L Seffernick; Patricia C Babbitt
Journal:  Genome Biol       Date:  2006-01-31       Impact factor: 13.583

9.  MACSIMS: multiple alignment of complete sequences information management system.

Authors:  Julie D Thompson; Arnaud Muller; Andrew Waterhouse; Jim Procter; Geoffrey J Barton; Frédéric Plewniak; Olivier Poch
Journal:  BMC Bioinformatics       Date:  2006-06-23       Impact factor: 3.169

10.  The chordate proteome history database.

Authors:  Anthony Levasseur; Julien Paganini; Jacques Dainat; Julie D Thompson; Olivier Poch; Pierre Pontarotti; Philippe Gouret
Journal:  Evol Bioinform Online       Date:  2012-08-01       Impact factor: 1.625

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.