Literature DB >> 18629007

Fishing with (Proto)Net-a principled approach to protein target selection.

Michal Linial1.   

Abstract

Structural genomics strives to represent the entire protein space. The first step towards achieving this goal is by rationally selecting proteins whose structures have not been determined, but that represent an as yet unknown structural superfamily or fold. Once such a structure is solved, it can be used as a template for modelling homologous proteins. This will aid in unveiling the structural diversity of the protein space. Currently, no reliable method for accurate 3D structural prediction is available when a sequence or a structure homologue is not available. Here we present a systematic methodology for selecting target proteins whose structure is likely to adopt a new, as yet unknown superfamily or fold. Our method takes advantage of a global classification of the sequence space as presented by ProtoNet-3D, which is a hierarchical agglomerative clustering of the proteins of interest (the proteins in Swiss-Prot) along with all solved structures (taken from the PDB). By navigating in the scaffold of ProtoNet-3D, we yield a prioritized list of proteins that are not yet structurally solved, along with the probability of each of the proteins belonging to a new superfamily or fold. The sorted list has been self-validated against real structural data that was not available when the predictions were made. The practical application of using our computational-statistical method to determine novel superfamilies for structural genomics projects is also discussed.

Year:  2003        PMID: 18629007      PMCID: PMC2447289          DOI: 10.1002/cfg.328

Source DB:  PubMed          Journal:  Comp Funct Genomics        ISSN: 1531-6912


  32 in total

1.  A rapid classification protocol for the CATH Domain Database to support structural genomics.

Authors:  F M Pearl; N Martin; J E Bray; D W Buchan; A P Harrison; D Lee; G A Reeves; A J Shepherd; I Sillitoe; A E Todd; J M Thornton; C A Orengo
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

Review 2.  Protein fold recognition using sequence profiles and its application in structural genomics.

Authors:  E V Koonin; Y I Wolf; L Aravind
Journal:  Adv Protein Chem       Date:  2000

Review 3.  Macromolecular electron microscopy in the era of structural genomics.

Authors:  W Baumeister; A C Steven
Journal:  Trends Biochem Sci       Date:  2000-12       Impact factor: 13.807

Review 4.  Target selection for structural genomics.

Authors:  S E Brenner
Journal:  Nat Struct Biol       Date:  2000-11

5.  The Protein Data Bank and the challenge of structural genomics.

Authors:  H M Berman; T N Bhat; P E Bourne; Z Feng; G Gilliland; H Weissig; J Westbrook
Journal:  Nat Struct Biol       Date:  2000-11

6.  An overview of structural genomics.

Authors:  S K Burley
Journal:  Nat Struct Biol       Date:  2000-11

Review 7.  Methodologies for target selection in structural genomics.

Authors:  M Linial; G Yona
Journal:  Prog Biophys Mol Biol       Date:  2000       Impact factor: 3.667

Review 8.  Towards a covering set of protein family profiles.

Authors:  A Heger; L Holm
Journal:  Prog Biophys Mol Biol       Date:  2000       Impact factor: 3.667

Review 9.  Comparative protein structure modeling of genes and genomes.

Authors:  M A Martí-Renom; A C Stuart; A Fiser; R Sánchez; F Melo; A Sali
Journal:  Annu Rev Biophys Biomol Struct       Date:  2000

10.  Proteomics of Mycoplasma genitalium: identification and characterization of unannotated and atypical proteins in a small model genome.

Authors:  S Balasubramanian; T Schneider; M Gerstein; L Regan
Journal:  Nucleic Acids Res       Date:  2000-08-15       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.