Literature DB >> 9843945

Structural assignments to the Mycoplasma genitalium proteins show extensive gene duplications and domain rearrangements.

S A Teichmann1, J Park, C Chothia.   

Abstract

The parasitic bacterium Mycoplasma genitalium has a small, reduced genome with close to a basic set of genes. As a first step toward determining the families of protein domains that form the products of these genes, we have used the multiple sequence programs PSI-BLAST and GEANFAMMER to match the sequences of the 467 gene products of M. genitalium to the sequences of the domains that form proteins of known structure [Protein Data Bank (PDB) sequences]. PDB sequences (274) match all of 106 M. genitalium sequences and some parts of another 85; thus, 41% of its total sequences are matched in all or part. The evolutionary relationships of the PDB domains that match M. genitalium are described in the structural classification of proteins (SCOP) database. Using this information, we show that the domains in the matched M. genitalium sequences come from 114 superfamilies and that 58% of them have arisen by gene duplication. This level of duplication is more than twice that found by using pairwise sequence comparisons. The PDB domain matches also describe the domain structure of the matched sequences: just over a quarter contain one domain and the rest have combinations of two or more domains.

Entities:  

Mesh:

Substances:

Year:  1998        PMID: 9843945      PMCID: PMC24505          DOI: 10.1073/pnas.95.25.14658

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  31 in total

1.  Proteins. One thousand families for the molecular biologist.

Authors:  C Chothia
Journal:  Nature       Date:  1992-06-18       Impact factor: 49.962

Review 2.  Hidden Markov models.

Authors:  S R Eddy
Journal:  Curr Opin Struct Biol       Date:  1996-06       Impact factor: 6.809

3.  Complete sequence analysis of the genome of the bacterium Mycoplasma pneumoniae.

Authors:  R Himmelreich; H Hilbert; H Plagens; E Pirkl; B C Li; R Herrmann
Journal:  Nucleic Acids Res       Date:  1996-11-15       Impact factor: 16.971

4.  Protein evolution viewed through Escherichia coli protein sequences: introducing the notion of a structural segment of homology, the module.

Authors:  M Riley; B Labedan
Journal:  J Mol Biol       Date:  1997-05-23       Impact factor: 5.469

5.  Improved tools for biological sequence comparison.

Authors:  W R Pearson; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1988-04       Impact factor: 11.205

6.  The minimal gene complement of Mycoplasma genitalium.

Authors:  C M Fraser; J D Gocayne; O White; M D Adams; R A Clayton; R D Fleischmann; C J Bult; A R Kerlavage; G Sutton; J M Kelley; R D Fritchman; J F Weidman; K V Small; M Sandusky; J Fuhrmann; D Nguyen; T R Utterback; D M Saudek; C A Phillips; J M Merrick; J F Tomb; B A Dougherty; K F Bott; P C Hu; T S Lucier; S N Peterson; H O Smith; C A Hutchison; J C Venter
Journal:  Science       Date:  1995-10-20       Impact factor: 47.728

7.  Gene duplications in H. influenzae.

Authors:  S E Brenner; T Hubbard; A Murzin; C Chothia
Journal:  Nature       Date:  1995-11-09       Impact factor: 49.962

8.  SCOP: a structural classification of proteins database for the investigation of sequences and structures.

Authors:  A G Murzin; S E Brenner; T Hubbard; C Chothia
Journal:  J Mol Biol       Date:  1995-04-07       Impact factor: 5.469

9.  Hidden Markov models in computational biology. Applications to protein modeling.

Authors:  A Krogh; M Brown; I S Mian; K Sjölander; D Haussler
Journal:  J Mol Biol       Date:  1994-02-04       Impact factor: 5.469

10.  Structure of the dsRNA binding domain of E. coli RNase III.

Authors:  A Kharrat; M J Macias; T J Gibson; M Nilges; A Pastore
Journal:  EMBO J       Date:  1995-07-17       Impact factor: 11.598

View more
  49 in total

1.  Detection of protein fold similarity based on correlation of amino acid properties.

Authors:  I V Grigoriev; S H Kim
Journal:  Proc Natl Acad Sci U S A       Date:  1999-12-07       Impact factor: 11.205

2.  MODBASE, a database of annotated comparative protein structure models.

Authors:  R Sánchez; U Pieper; N Mirković; P I de Bakker; E Wittenstein; A Sali
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  The ASTRAL compendium for protein structure and sequence analysis.

Authors:  S E Brenner; P Koehl; M Levitt
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

4.  PartsList: a web-based system for dynamically ranking protein folds based on disparate attributes, including whole-genome expression and interaction information.

Authors:  J Qian; B Stenger; C A Wilson; J Lin; R Jansen; S A Teichmann; J Park; W G Krebs; H Yu; V Alexandrov; N Echols; M Gerstein
Journal:  Nucleic Acids Res       Date:  2001-04-15       Impact factor: 16.971

5.  Estimating the probability for a protein to have a new fold: A statistical computational model.

Authors:  E Portugaly; M Linial
Journal:  Proc Natl Acad Sci U S A       Date:  2000-05-09       Impact factor: 11.205

6.  Analysis of the yeast transcriptome with structural and functional categories: characterizing highly expressed proteins.

Authors:  R Jansen; M Gerstein
Journal:  Nucleic Acids Res       Date:  2000-03-15       Impact factor: 16.971

7.  MODBASE, a database of annotated comparative protein structure models.

Authors:  Ursula Pieper; Narayanan Eswar; Ashley C Stuart; Valentin A Ilyin; Andrej Sali
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

8.  Fold recognition without folds.

Authors:  Kristin K Koretke; Robert B Russell; Andrei N Lupas
Journal:  Protein Sci       Date:  2002-06       Impact factor: 6.725

9.  Structural characterization of the human proteome.

Authors:  Arne Müller; Robert M MacCallum; Michael J E Sternberg
Journal:  Genome Res       Date:  2002-11       Impact factor: 9.043

10.  Proteomics of Mycoplasma genitalium: identification and characterization of unannotated and atypical proteins in a small model genome.

Authors:  S Balasubramanian; T Schneider; M Gerstein; L Regan
Journal:  Nucleic Acids Res       Date:  2000-08-15       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.