Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Modular arrangement of proteins as inferred from analysis of homology.

Literature DB >> 8019419

Modular arrangement of proteins as inferred from analysis of homology.

Abstract

The structure of many proteins consists of a combination of discrete modules that have been shuffled during evolution. Such modules can frequently be recognized from the analysis of homology. Here we present a systematic analysis of the modular organization of all sequenced proteins. To achieve this we have developed an automatic method to identify protein domains from sequence comparisons. Homologous domains can then be clustered into consistent families. The method was applied to all 21,098 nonfragment protein sequences in SWISS-PROT 21.0, which was automatically reorganized into a comprehensive protein domain database, ProDom. We have constructed multiple sequence alignments for each domain family in ProDom, from which consensus sequences were generated. These nonreduntant domain consensuses are useful for fast homology searches. Domain organization in ProDom is exemplified for proteins of the phosphoenolpyruvate:sugar phosphotransferase system (PEP:PTS) and for bacterial 2-component regulators. We provide 2 examples of previously unrecognized domain arrangements discovered with the help of ProDom.

Entities: Chemical

Mesh：

Substances：

Year: 1994 PMID： 8019419 PMCID： PMC2142695 DOI： 10.1002/pro.5560030314

Source DB: PubMed Journal: Protein Sci ISSN： 0961-8368 Impact factor: 6.725

34 in total

1. Proposed uniform nomenclature for the proteins and protein domains of the bacterial phosphoenolpyruvate: sugar phosphotransferase system.

Authors: M H Saier; J Reizer
Journal: J Bacteriol Date: 1992-03 Impact factor: 3.490

Review 2. Protein modules.

Authors: M Baron; D G Norman; I D Campbell
Journal: Trends Biochem Sci Date: 1991-01 Impact factor: 13.807

3. Sequence ordinations: a multivariate analysis approach to analysing large sequence data sets.

Authors: D G Higgins
Journal: Comput Appl Biosci Date: 1992-02

4. PROSITE: a dictionary of sites and patterns in proteins.

Authors: A Bairoch
Journal: Nucleic Acids Res Date: 1992-05-11 Impact factor: 16.971

5. RNA recognition: towards identifying determinants of specificity.

Authors: D J Kenan; C C Query; J D Keene
Journal: Trends Biochem Sci Date: 1991-06 Impact factor: 13.807

6. Automated assembly of protein blocks for database searching.

Authors: S Henikoff; J G Henikoff
Journal: Nucleic Acids Res Date: 1991-12-11 Impact factor: 16.971

7. Modular structure of FixJ: homology of the transcriptional activator domain with the -35 binding domain of sigma factors.

Authors: D Kahn; G Ditta
Journal: Mol Microbiol Date: 1991-04 Impact factor: 3.501

Review 8. Statistical methods and insights for protein and DNA sequences.

Authors: S Karlin; P Bucher; V Brendel; S F Altschul
Journal: Annu Rev Biophys Biophys Chem Date: 1991

9. A new family of powerful multivariate statistical sequence analysis techniques.

Authors: M van Heel
Journal: J Mol Biol Date: 1991-08-20 Impact factor: 5.469

10. Structural and genetic analysis of the bvg locus in Bordetella species.

Authors: B Aricò; V Scarlato; D M Monack; S Falkow; R Rappuoli
Journal: Mol Microbiol Date: 1991-10 Impact factor: 3.501

57 in total

1. ProDom and ProDom-CG: tools for protein domain analysis and whole genome comparisons.

Authors: F Corpet; F Servant; J Gouzy; D Kahn
Journal: Nucleic Acids Res Date: 2000-01-01 Impact factor: 16.971

2. The Pfam protein families database.

Authors: A Bateman; E Birney; R Durbin; S R Eddy; K L Howe; E L Sonnhammer
Journal: Nucleic Acids Res Date: 2000-01-01 Impact factor: 16.971

3. Automated search of natively folded protein fragments for high-throughput structure determination in structural genomics.

Authors: Y Kuroda; K Tani; Y Matsuo; S Yokoyama
Journal: Protein Sci Date: 2000-12 Impact factor: 6.725

4. Visual management of large scale data mining projects.

Authors: I Shah; L Hunter
Journal: Pac Symp Biocomput Date: 2000

5. Massive sequence comparisons as a help in annotating genomic sequences.

Authors: A Louis; E Ollivier; J C Aude; J L Risler
Journal: Genome Res Date: 2001-07 Impact factor: 9.043

6. The complete sequence of the 1,683-kb pSymB megaplasmid from the N2-fixing endosymbiont Sinorhizobium meliloti.

Authors: T M Finan; S Weidner; K Wong; J Buhrmester; P Chain; F J Vorhölter; I Hernandez-Lucas; A Becker; A Cowie; J Gouzy; B Golding; A Pühler
Journal: Proc Natl Acad Sci U S A Date: 2001-07-31 Impact factor: 11.205

7. A semiautomated approach to gene discovery through expressed sequence tag data mining: discovery of new human transporter genes.

Authors: Shoshana Brown; Jean L Chang; Wolfgang Sadée; Patricia C Babbitt
Journal: AAPS PharmSci Date: 2003