Literature DB >> 11435413

Massive sequence comparisons as a help in annotating genomic sequences.

A Louis1, E Ollivier, J C Aude, J L Risler.   

Abstract

An all-by-all comparison of all the publicly available protein sequences from plants has been performed, followed by a clusterization process. Within each of the 1064 resulting clusters-containing sequences that are orthologous as well as paralogous-the sequences have been submitted to a pyramidal classification and their domains delineated by an automated procedure à la. This process provides a means for easily checking for any apparent inconsistency in a cluster, for example, whether one sequence is shorter or longer than the others, one domain is missing, etc. In such cases, the alignment of the DNA sequence of the gene with that of a close homologous protein often reveals (in 10% of the clusters) probable sequencing errors (leading to frameshifts) or probable wrong intron/exon predictions. The composition of the clusters, their pyramidal classifications, and domain decomposition, as well as our comments when appropriate, are available from http://chlora.infobiogen.fr:1234/PHYTOPROT.

Mesh:

Substances:

Year:  2001        PMID: 11435413      PMCID: PMC311131          DOI: 10.1101/gr.gr-1776r

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  27 in total

1.  InterPro--an integrated documentation resource for protein families, domains and functional sites.

Authors:  R Apweiler; T K Attwood; A Bairoch; A Bateman; E Birney; M Biswas; P Bucher; L Cerutti; F Corpet; M D Croning; R Durbin; L Falquet; W Fleischmann; J Gouzy; H Hermjakob; N Hulo; I Jonassen; D Kahn; A Kanapin; Y Karavidopoulou; R Lopez; B Marx; N J Mulder; T M Oinn; M Pagni; F Servant; C J Sigrist; E M Zdobnov
Journal:  Bioinformatics       Date:  2000-12       Impact factor: 6.937

2.  FramePlus: aligning DNA to protein sequences.

Authors:  E Halperin; S Faigler; R Gill-More
Journal:  Bioinformatics       Date:  1999-11       Impact factor: 6.937

3.  Homology-based gene structure prediction: simplified matching algorithm using a translated codon (tron) and improved accuracy by allowing for long gaps.

Authors:  O Gotoh
Journal:  Bioinformatics       Date:  2000-03       Impact factor: 6.937

4.  GeneRAGE: a robust algorithm for sequence clustering and domain detection.

Authors:  A J Enright; C A Ouzounis
Journal:  Bioinformatics       Date:  2000-05       Impact factor: 6.937

5.  PairWise and SearchWise: finding the optimal alignment in a simultaneous comparison of a protein profile against all DNA translation frames.

Authors:  E Birney; J D Thompson; T J Gibson
Journal:  Nucleic Acids Res       Date:  1996-07-15       Impact factor: 16.971

6.  Evaluation of gene prediction software using a genomic data set: application to Arabidopsis thaliana sequences.

Authors:  N Pavy; S Rombauts; P Déhais; C Mathé; D V Ramana; P Leroy; P Rouzé
Journal:  Bioinformatics       Date:  1999-11       Impact factor: 6.937

7.  Modular arrangement of proteins as inferred from analysis of homology.

Authors:  E L Sonnhammer; D Kahn
Journal:  Protein Sci       Date:  1994-03       Impact factor: 6.725

8.  On the statistical significance of nucleic acid similarities.

Authors:  D J Lipman; W J Wilbur; T F Smith; M S Waterman
Journal:  Nucleic Acids Res       Date:  1984-01-11       Impact factor: 16.971

9.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

10.  Analysis of the genome sequence of the flowering plant Arabidopsis thaliana.

Authors: 
Journal:  Nature       Date:  2000-12-14       Impact factor: 49.962

View more
  4 in total

1.  PHYTOPROT: a database of clusters of plant proteins.

Authors:  S Mohseni-Zadeh; A Louis; P Brézellec; J-L Risler
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  BioMOBY successfully integrates distributed heterogeneous bioinformatics Web Services. The PlaNet exemplar case.

Authors:  Mark Wilkinson; Heiko Schoof; Rebecca Ernst; Dirk Haase
Journal:  Plant Physiol       Date:  2005-05       Impact factor: 8.340

Review 3.  Spliceosomal introns as tools for genomic and evolutionary analysis.

Authors:  Manuel Irimia; Scott William Roy
Journal:  Nucleic Acids Res       Date:  2008-02-07       Impact factor: 16.971

4.  A configuration space of homologous proteins conserving mutual information and allowing a phylogeny inference based on pair-wise Z-score probabilities.

Authors:  Olivier Bastien; Philippe Ortet; Sylvaine Roy; Eric Maréchal
Journal:  BMC Bioinformatics       Date:  2005-03-10       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.