Literature DB >> 15140831

Visualizing sequence similarity of protein families.

Vamsi Veeramachaneni1, Wojciech Makałowski.   

Abstract

Classification of proteins into families is one of the main goals of functional analysis. Proteins are usually assigned to a family on the basis of the presence of family-specific patterns, domains, or structural elements. Whereas proteins belonging to the same family are generally similar to each other, the extent of similarity varies widely across families. Some families are characterized by short, well-defined motifs, whereas others contain longer, less-specific motifs. We present a simple method for visualizing such differences. We applied our method to the Arabidopsis thaliana families listed at The Arabidopsis Information Resource (TAIR) Web site and for 76% of the nontrivial families (families with more than one member), our method identifies simple similarity measures that are necessary and sufficient to cluster members of the family together. Our visualization method can be used as part of an annotation pipeline to identify potentially incorrectly defined families. We also describe how our method can be extended to identify novel families and to assign unclassified proteins into known families. Copyright 2004 Cold Spring Harbor Laboratory Press

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15140831      PMCID: PMC419794          DOI: 10.1101/gr.2079204

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  37 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Genomes OnLine Database (GOLD): a monitor of genome projects world-wide.

Authors:  A Bernal; U Ear; N Kyrpides
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

Review 3.  Protein function in the post-genomic era.

Authors:  D Eisenberg; E M Marcotte; I Xenarios; T O Yeates
Journal:  Nature       Date:  2000-06-15       Impact factor: 49.962

4.  Detecting protein function and protein-protein interactions from genome sequences.

Authors:  E M Marcotte; M Pellegrini; H L Ng; D W Rice; T O Yeates; D Eisenberg
Journal:  Science       Date:  1999-07-30       Impact factor: 47.728

Review 5.  Towards a covering set of protein family profiles.

Authors:  A Heger; L Holm
Journal:  Prog Biophys Mol Biol       Date:  2000       Impact factor: 3.667

6.  GeneRAGE: a robust algorithm for sequence clustering and domain detection.

Authors:  A J Enright; C A Ouzounis
Journal:  Bioinformatics       Date:  2000-05       Impact factor: 6.937

7.  The U-box protein family in plants.

Authors:  C Azevedo; M J Santos-Rosa; K Shirasu
Journal:  Trends Plant Sci       Date:  2001-08       Impact factor: 18.313

Review 8.  Recent developments and future directions in computational genomics.

Authors:  S Tsoka; C A Ouzounis
Journal:  FEBS Lett       Date:  2000-08-25       Impact factor: 4.124

9.  Analysis of the genome sequence of the flowering plant Arabidopsis thaliana.

Authors: 
Journal:  Nature       Date:  2000-12-14       Impact factor: 49.962

10.  The Arabidopsis thaliana ABC protein superfamily, a complete inventory.

Authors:  R Sánchez-Fernández; T G Davies; J O Coleman; P A Rea
Journal:  J Biol Chem       Date:  2001-05-09       Impact factor: 5.157

View more
  2 in total

1.  Comparison of protein coding gene contents of the fungal phyla Pezizomycotina and Saccharomycotina.

Authors:  Mikko Arvas; Teemu Kivioja; Alex Mitchell; Markku Saloheimo; David Ussery; Merja Penttila; Stephen Oliver
Journal:  BMC Genomics       Date:  2007-09-17       Impact factor: 3.969

2.  De Novo Assembly and Genome Analyses of the Marine-Derived Scopulariopsis brevicaulis Strain LF580 Unravels Life-Style Traits and Anticancerous Scopularide Biosynthetic Gene Cluster.

Authors:  Abhishek Kumar; Bernard Henrissat; Mikko Arvas; Muhammad Fahad Syed; Nils Thieme; J Philipp Benz; Jens Laurids Sørensen; Eric Record; Stefanie Pöggeler; Frank Kempken
Journal:  PLoS One       Date:  2015-10-27       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.