Literature DB >> 8019419

Modular arrangement of proteins as inferred from analysis of homology.

E L Sonnhammer1, D Kahn.   

Abstract

The structure of many proteins consists of a combination of discrete modules that have been shuffled during evolution. Such modules can frequently be recognized from the analysis of homology. Here we present a systematic analysis of the modular organization of all sequenced proteins. To achieve this we have developed an automatic method to identify protein domains from sequence comparisons. Homologous domains can then be clustered into consistent families. The method was applied to all 21,098 nonfragment protein sequences in SWISS-PROT 21.0, which was automatically reorganized into a comprehensive protein domain database, ProDom. We have constructed multiple sequence alignments for each domain family in ProDom, from which consensus sequences were generated. These nonreduntant domain consensuses are useful for fast homology searches. Domain organization in ProDom is exemplified for proteins of the phosphoenolpyruvate:sugar phosphotransferase system (PEP:PTS) and for bacterial 2-component regulators. We provide 2 examples of previously unrecognized domain arrangements discovered with the help of ProDom.

Entities:  

Mesh:

Substances:

Year:  1994        PMID: 8019419      PMCID: PMC2142695          DOI: 10.1002/pro.5560030314

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  34 in total

1.  Proposed uniform nomenclature for the proteins and protein domains of the bacterial phosphoenolpyruvate: sugar phosphotransferase system.

Authors:  M H Saier; J Reizer
Journal:  J Bacteriol       Date:  1992-03       Impact factor: 3.490

Review 2.  Protein modules.

Authors:  M Baron; D G Norman; I D Campbell
Journal:  Trends Biochem Sci       Date:  1991-01       Impact factor: 13.807

3.  Sequence ordinations: a multivariate analysis approach to analysing large sequence data sets.

Authors:  D G Higgins
Journal:  Comput Appl Biosci       Date:  1992-02

4.  PROSITE: a dictionary of sites and patterns in proteins.

Authors:  A Bairoch
Journal:  Nucleic Acids Res       Date:  1992-05-11       Impact factor: 16.971

5.  RNA recognition: towards identifying determinants of specificity.

Authors:  D J Kenan; C C Query; J D Keene
Journal:  Trends Biochem Sci       Date:  1991-06       Impact factor: 13.807

6.  Automated assembly of protein blocks for database searching.

Authors:  S Henikoff; J G Henikoff
Journal:  Nucleic Acids Res       Date:  1991-12-11       Impact factor: 16.971

7.  Modular structure of FixJ: homology of the transcriptional activator domain with the -35 binding domain of sigma factors.

Authors:  D Kahn; G Ditta
Journal:  Mol Microbiol       Date:  1991-04       Impact factor: 3.501

Review 8.  Statistical methods and insights for protein and DNA sequences.

Authors:  S Karlin; P Bucher; V Brendel; S F Altschul
Journal:  Annu Rev Biophys Biophys Chem       Date:  1991

9.  A new family of powerful multivariate statistical sequence analysis techniques.

Authors:  M van Heel
Journal:  J Mol Biol       Date:  1991-08-20       Impact factor: 5.469

10.  Structural and genetic analysis of the bvg locus in Bordetella species.

Authors:  B Aricò; V Scarlato; D M Monack; S Falkow; R Rappuoli
Journal:  Mol Microbiol       Date:  1991-10       Impact factor: 3.501

View more
  57 in total

1.  ProDom and ProDom-CG: tools for protein domain analysis and whole genome comparisons.

Authors:  F Corpet; F Servant; J Gouzy; D Kahn
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The Pfam protein families database.

Authors:  A Bateman; E Birney; R Durbin; S R Eddy; K L Howe; E L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  Automated search of natively folded protein fragments for high-throughput structure determination in structural genomics.

Authors:  Y Kuroda; K Tani; Y Matsuo; S Yokoyama
Journal:  Protein Sci       Date:  2000-12       Impact factor: 6.725

4.  Visual management of large scale data mining projects.

Authors:  I Shah; L Hunter
Journal:  Pac Symp Biocomput       Date:  2000

5.  Massive sequence comparisons as a help in annotating genomic sequences.

Authors:  A Louis; E Ollivier; J C Aude; J L Risler
Journal:  Genome Res       Date:  2001-07       Impact factor: 9.043

6.  The complete sequence of the 1,683-kb pSymB megaplasmid from the N2-fixing endosymbiont Sinorhizobium meliloti.

Authors:  T M Finan; S Weidner; K Wong; J Buhrmester; P Chain; F J Vorhölter; I Hernandez-Lucas; A Becker; A Cowie; J Gouzy; B Golding; A Pühler
Journal:  Proc Natl Acad Sci U S A       Date:  2001-07-31       Impact factor: 11.205

7.  A semiautomated approach to gene discovery through expressed sequence tag data mining: discovery of new human transporter genes.

Authors:  Shoshana Brown; Jean L Chang; Wolfgang Sadée; Patricia C Babbitt
Journal:  AAPS PharmSci       Date:  2003

Review 8.  Synthetic biology with surgical precision: targeted reengineering of signaling proteins.

Authors:  Vsevolod V Gurevich; Eugenia V Gurevich
Journal:  Cell Signal       Date:  2012-06-01       Impact factor: 4.315

9.  Characterization and prediction of linker sequences of multi-domain proteins by a neural network.

Authors:  Satoshi Miyazaki; Yutaka Kuroda; Shigeyuki Yokoyama
Journal:  J Struct Funct Genomics       Date:  2002

10.  Dictionary-driven protein annotation.

Authors:  Isidore Rigoutsos; Tien Huynh; Aris Floratos; Laxmi Parida; Daniel Platt
Journal:  Nucleic Acids Res       Date:  2002-09-01       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.