Literature DB >> 10706641

Selecting protein targets for structural genomics of Pyrobaculum aerophilum: validating automated fold assignment methods by using binary hypothesis testing.

P Mallick1, K E Goodwill, S Fitz-Gibbon, J H Miller, D Eisenberg.   

Abstract

Three-dimensional protein folds were assigned to all ORFs of the recently sequenced genome of the hyperthermophilic archaeon Pyrobaculum aerophilum. Binary hypothesis testing was used to estimate a confidence level for each assignment. A separate test was conducted to assign a probability for whether each sequence has a novel fold-i.e., one that is not yet represented in the experimental database of known structures. Of the 2,130 predicted nontransmembrane proteins in this organism, 916 matched a fold at a cumulative 90% confidence level, and 245 could be assigned at a 99% confidence level. Likewise, 286 proteins were predicted to have a previously unobserved fold with a 90% confidence level, and 14 at a 99% confidence level. These statistically based tools are combined with homology searches against the Online Mendelian Inheritance in Man (OMIM) human genetics database and other protein databases for the selection of attractive targets for crystallographic or NMR structure determination. Results of these studies have been collated and placed at http://www.doe-mbi.ucla.edu/people/parag/P A_HOME/, the University of California, Los Angeles-Department of Energy Pyrobaculum aerophilum web site.

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 10706641      PMCID: PMC15949          DOI: 10.1073/pnas.050589297

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  28 in total

1.  Exhaustive matching of the entire protein sequence database.

Authors:  G H Gonnet; M A Cohen; S A Benner
Journal:  Science       Date:  1992-06-05       Impact factor: 47.728

2.  SCOP: a structural classification of proteins database for the investigation of sequences and structures.

Authors:  A G Murzin; S E Brenner; T Hubbard; C Chothia
Journal:  J Mol Biol       Date:  1995-04-07       Impact factor: 5.469

3.  PHD--an automatic mail server for protein secondary structure prediction.

Authors:  B Rost; C Sander; R Schneider
Journal:  Comput Appl Biosci       Date:  1994-02

4.  Rapid and accurate estimates of statistical significance for sequence data base searches.

Authors:  M S Waterman; M Vingron
Journal:  Proc Natl Acad Sci U S A       Date:  1994-05-24       Impact factor: 11.205

5.  Dali: a network tool for protein structure comparison.

Authors:  L Holm; C Sander
Journal:  Trends Biochem Sci       Date:  1995-11       Impact factor: 13.807

6.  The FSSP database: fold classification based on structure-structure alignment of proteins.

Authors:  L Holm; C Sander
Journal:  Nucleic Acids Res       Date:  1996-01-01       Impact factor: 16.971

7.  Identification of protein coding regions by database similarity search.

Authors:  W Gish; D J States
Journal:  Nat Genet       Date:  1993-03       Impact factor: 38.330

8.  Enlarged representative set of protein structures.

Authors:  U Hobohm; C Sander
Journal:  Protein Sci       Date:  1994-03       Impact factor: 6.725

9.  Three-dimensional structure of myosin subfragment-1: a molecular motor.

Authors:  I Rayment; W R Rypniewski; K Schmidt-Bäse; R Smith; D R Tomchick; M M Benning; D A Winkelmann; G Wesenberg; H M Holden
Journal:  Science       Date:  1993-07-02       Impact factor: 47.728

10.  Crystal structure of the heterodimeric bZIP transcription factor c-Fos-c-Jun bound to DNA.

Authors:  J N Glover; S C Harrison
Journal:  Nature       Date:  1995-01-19       Impact factor: 49.962

View more
  11 in total

1.  Automated search of natively folded protein fragments for high-throughput structure determination in structural genomics.

Authors:  Y Kuroda; K Tani; Y Matsuo; S Yokoyama
Journal:  Protein Sci       Date:  2000-12       Impact factor: 6.725

2.  Links from genome proteins to known 3-D structures.

Authors:  Y Wang; S Bryant; R Tatusov; T Tatusova
Journal:  Genome Res       Date:  2000-10       Impact factor: 9.043

3.  PFIT and PFRIT: bioinformatic algorithms for detecting glycosidase function from structure and sequence.

Authors:  Gary Kleiger; Ekaterina M Panina; Parag Mallick; David Eisenberg
Journal:  Protein Sci       Date:  2004-01       Impact factor: 6.725

4.  Crystal structure of a major secreted protein of Mycobacterium tuberculosis-MPT63 at 1.5-A resolution.

Authors:  Celia W Goulding; Angineh Parseghian; Michael R Sawaya; Duilio Cascio; Marcin I Apostol; Maria Laura Gennaro; David Eisenberg
Journal:  Protein Sci       Date:  2002-12       Impact factor: 6.725

Review 5.  The impact of extremophiles on structural genomics (and vice versa).

Authors:  Francis E Jenney; Michael W W Adams
Journal:  Extremophiles       Date:  2007-06-13       Impact factor: 2.395

6.  Interactions of peptide mimics of hyaluronic acid with the receptor for hyaluronan mediated motility (RHAMM).

Authors:  Michael R Ziebell; Glenn D Prestwich
Journal:  J Comput Aided Mol Des       Date:  2004-10       Impact factor: 3.686

7.  Genomic evidence that the intracellular proteins of archaeal microbes contain disulfide bonds.

Authors:  Parag Mallick; Daniel R Boutz; David Eisenberg; Todd O Yeates
Journal:  Proc Natl Acad Sci U S A       Date:  2002-07-09       Impact factor: 11.205

8.  GDAP: a web tool for genome-wide protein disulfide bond prediction.

Authors:  Brian D O'Connor; Todd O Yeates
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

9.  Identification of putative domain linkers by a neural network - application to a large sequence database.

Authors:  Satoshi Miyazaki; Yutaka Kuroda; Shigeyuki Yokoyama
Journal:  BMC Bioinformatics       Date:  2006-06-27       Impact factor: 3.169

10.  Fishing with (Proto)Net-a principled approach to protein target selection.

Authors:  Michal Linial
Journal:  Comp Funct Genomics       Date:  2003
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.