Literature DB >> 14649288

Coverage of protein sequence space by current structural genomics targets.

Nicholas O'Toole1, Stéphane Raymond, Miroslaw Cygler.   

Abstract

By its purest definition the ultimate goal of structural genomics (SG) is the determination of the structures of all proteins encoded by genomes. Most of these will be obtained by homology modeling using the structures of a set of target proteins for experimental determination. Thanks to the open exchange of SG target information, we are able to analyze the sequences of the current target list to evaluate the extent of its coverage of protein sequence space. The presence of homologous sequences currently either in the Protein Data Bank (PDB) or among SG targets has been determined for each of the protein sequences in several organisms. In this way we are able to evaluate the coverage by existing or targeted structural data for the non-membranous parts of entire proteomes. For small bacterial proteomes such as that of H. influenzae almost all proteins have homologous sequences among SG targets or in the PDB. There is significantly lower coverage for more complex organisms, such as C. elegans. We have mapped the SG target list onto the ProtoMap clustering of protein sequences. Clusters occupied by SG targets represent over 150,000 protein sequences, which is approximately 44% of the total protein sequences classified by ProtoMap. The mapping of SG targets also enables an evaluation of the degree of overlap within the target list. An SG target typically occupies a ProtoMap cluster with more than six other homologous targets.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 14649288     DOI: 10.1023/a:1026156025612

Source DB:  PubMed          Journal:  J Struct Funct Genomics        ISSN: 1345-711X


  16 in total

1.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.

Authors:  A Krogh; B Larsson; G von Heijne; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-01-19       Impact factor: 5.469

2.  The PROSITE database, its status in 2002.

Authors:  Laurent Falquet; Marco Pagni; Philipp Bucher; Nicolas Hulo; Christian J A Sigrist; Kay Hofmann; Amos Bairoch
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

3.  GenBank.

Authors:  Dennis A Benson; Ilene Karsch-Mizrachi; David J Lipman; James Ostell; Barbara A Rapp; David L Wheeler
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

4.  Completeness in structural genomics.

Authors:  D Vitkup; E Melamud; J Moult; C Sander
Journal:  Nat Struct Biol       Date:  2001-06

5.  Target space for structural genomics revisited.

Authors:  Jinfeng Liu; Burkhard Rost
Journal:  Bioinformatics       Date:  2002-07       Impact factor: 6.937

6.  Crystal structure of D-ribose-5-phosphate isomerase (RpiA) from Escherichia coli.

Authors:  Erumbi S Rangarajan; J Sivaraman; Allan Matte; Miroslaw Cygler
Journal:  Proteins       Date:  2002-09-01

7.  Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins.

Authors:  A Bateman; E Birney; R Durbin; S R Eddy; R D Finn; E L Sonnhammer
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

Review 8.  Shining a light on structural genomics.

Authors:  S H Kim
Journal:  Nat Struct Biol       Date:  1998-08

9.  Structural genomics: keystone for a Human Proteome Project.

Authors:  G T Montelione; S Anderson
Journal:  Nat Struct Biol       Date:  1999-01

Review 10.  Structural genomics: beyond the human genome project.

Authors:  S K Burley; S C Almo; J B Bonanno; M Capel; M R Chance; T Gaasterland; D Lin; A Sali; F W Studier; S Swaminathan
Journal:  Nat Genet       Date:  1999-10       Impact factor: 38.330

View more
  6 in total

Review 1.  Structural genomics: keeping up with expanding knowledge of the protein universe.

Authors:  Marek Grabowski; Andrzej Joachimiak; Zbyszek Otwinowski; Wladek Minor
Journal:  Curr Opin Struct Biol       Date:  2007-06-22       Impact factor: 6.809

Review 2.  Chemical genomics: a challenge for de novo drug design.

Authors:  P M Dean
Journal:  Mol Biotechnol       Date:  2007-06-30       Impact factor: 2.695

3.  Coverage of whole proteome by structural genomics observed through protein homology modeling database.

Authors:  Kei Yura; Akihiro Yamaguchi; Mitiko Go
Journal:  J Struct Funct Genomics       Date:  2006-12-05

4.  The utility of geometrical and chemical restraint information extracted from predicted ligand-binding sites in protein structure refinement.

Authors:  Michal Brylinski; Seung Yup Lee; Hongyi Zhou; Jeffrey Skolnick
Journal:  J Struct Biol       Date:  2010-09-17       Impact factor: 2.867

5.  Identification of putative domain linkers by a neural network - application to a large sequence database.

Authors:  Satoshi Miyazaki; Yutaka Kuroda; Shigeyuki Yokoyama
Journal:  BMC Bioinformatics       Date:  2006-06-27       Impact factor: 3.169

6.  Preservation of protein clefts in comparative models.

Authors:  David Piedra; Sergi Lois; Xavier de la Cruz
Journal:  BMC Struct Biol       Date:  2008-01-16
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.