Literature DB >> 9342339

Assigning folds to the proteins encoded by the genome of Mycoplasma genitalium.

D Fischer1, D Eisenberg.   

Abstract

A crucial step in exploiting the information inherent in genome sequences is to assign to each protein sequence its three-dimensional fold and biological function. Here we describe fold assignment for the proteins encoded by the small genome of Mycoplasma genitalium. The assignment was carried out by our computer server (http://www.doe-mbi.ucla.edu/people/frsvr/ frsvr. html), which assigns folds to amino acid sequences by comparing sequence-derived predictions with known structures. Of the total of 468 protein ORFs, 103 (22%) can be assigned a known protein fold with high confidence, as cross-validated with tests on known structures. Of these sequences, 75 (16%) show enough sequence similarity to proteins of known structure that they can also be detected by traditional sequence-sequence comparison methods. That is, the difference of 28 sequences (6%) are assignable by the sequence-structure method of the server but not by current sequence-sequence methods. Of the remaining 78% of sequences in the genome, 18% belong to membrane proteins and the remaining 60% cannot be assigned either because these sequences correspond to no presently known fold or because of insensitivity of the method. At the current rate of determination of new folds by x-ray and NMR methods, extrapolation suggests that folds will be assigned to most soluble proteins in the next decade.

Entities:  

Mesh:

Substances:

Year:  1997        PMID: 9342339      PMCID: PMC23659          DOI: 10.1073/pnas.94.22.11929

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  18 in total

1.  The alpha/beta hydrolase fold.

Authors:  D L Ollis; E Cheah; M Cygler; B Dijkstra; F Frolow; S M Franken; M Harel; S J Remington; I Silman; J Schrag
Journal:  Protein Eng       Date:  1992-04

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  PROSITE: a dictionary of sites and patterns in proteins.

Authors:  A Bairoch
Journal:  Nucleic Acids Res       Date:  1992-05-11       Impact factor: 16.971

4.  Exhaustive matching of the entire protein sequence database.

Authors:  G H Gonnet; M A Cohen; S A Benner
Journal:  Science       Date:  1992-06-05       Impact factor: 47.728

5.  A method to identify protein sequences that fold into a known three-dimensional structure.

Authors:  J U Bowie; R Lüthy; D Eisenberg
Journal:  Science       Date:  1991-07-12       Impact factor: 47.728

6.  Fold assignments for amino acid sequences of the CASP2 experiment.

Authors:  D W Rice; D Fischer; R Weiss; D Eisenberg
Journal:  Proteins       Date:  1997

7.  The Protein Data Bank: a computer-based archival file for macromolecular structures.

Authors:  F C Bernstein; T F Koetzle; G J Williams; E F Meyer; M D Brice; J R Rodgers; O Kennard; T Shimanouchi; M Tasumi
Journal:  J Mol Biol       Date:  1977-05-25       Impact factor: 5.469

8.  Improved tools for biological sequence comparison.

Authors:  W R Pearson; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1988-04       Impact factor: 11.205

9.  Prediction of protein secondary structure at better than 70% accuracy.

Authors:  B Rost; C Sander
Journal:  J Mol Biol       Date:  1993-07-20       Impact factor: 5.469

10.  Analysis of membrane and surface protein sequences with the hydrophobic moment plot.

Authors:  D Eisenberg; E Schwarz; M Komaromy; R Wall
Journal:  J Mol Biol       Date:  1984-10-15       Impact factor: 5.469

View more
  25 in total

1.  MODBASE, a database of annotated comparative protein structure models.

Authors:  R Sánchez; U Pieper; N Mirković; P I de Bakker; E Wittenstein; A Sali
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  LiveBench-1: continuous benchmarking of protein structure prediction servers.

Authors:  J M Bujnicki; A Elofsson; D Fischer; L Rychlewski
Journal:  Protein Sci       Date:  2001-02       Impact factor: 6.725

3.  Gene content phylogeny of herpesviruses.

Authors:  M G Montague; C A Hutchison
Journal:  Proc Natl Acad Sci U S A       Date:  2000-05-09       Impact factor: 11.205

4.  Genome analysis: Assigning protein coding regions to three-dimensional structures.

Authors:  A A Salamov; M Suwa; C A Orengo; M B Swindells
Journal:  Protein Sci       Date:  1999-04       Impact factor: 6.725

5.  MODBASE, a database of annotated comparative protein structure models.

Authors:  Ursula Pieper; Narayanan Eswar; Ashley C Stuart; Valentin A Ilyin; Andrej Sali
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

6.  Ab initio protein structure prediction on a genomic scale: application to the Mycoplasma genitalium genome.

Authors:  Daisuke Kihara; Yang Zhang; Hui Lu; Andrzej Kolinski; Jeffrey Skolnick
Journal:  Proc Natl Acad Sci U S A       Date:  2002-04-16       Impact factor: 11.205

7.  Feasibility in the inverse protein folding protocol.

Authors:  M Ota; K Nishikawa
Journal:  Protein Sci       Date:  1999-05       Impact factor: 6.725

8.  Proteomics of Mycoplasma genitalium: identification and characterization of unannotated and atypical proteins in a small model genome.

Authors:  S Balasubramanian; T Schneider; M Gerstein; L Regan
Journal:  Nucleic Acids Res       Date:  2000-08-15       Impact factor: 16.971

9.  Refinement of homology-based protein structures by molecular dynamics simulation techniques.

Authors:  Hao Fan; Alan E Mark
Journal:  Protein Sci       Date:  2004-01       Impact factor: 6.725

10.  Automated structure prediction of weakly homologous proteins on a genomic scale.

Authors:  Yang Zhang; Jeffrey Skolnick
Journal:  Proc Natl Acad Sci U S A       Date:  2004-05-04       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.