Literature DB >> 12117789

Target space for structural genomics revisited.

Jinfeng Liu1, Burkhard Rost.   

Abstract

MOTIVATION: Structural genomics eventually aims at determining structures for all proteins. However, in the beginning experimentalists are likely to focus on globular proteins to achieve a rapid basic coverage of protein sequence space. How many proteins will structural genomics have to target? How many proteins will be excluded since we already have structural information for these or since they are not globular? We have to answer these questions in the context of our target selection for the North-East Structural Genomics Consortium (NESG).
RESULTS: We estimated that structural information is available for about 6-38% of all proteins; 6% if we require high accuracy in comparative modelling, 38% if we are satisfied with having a rough idea about the fold. Excluding all regions that are not globular, we found that structural genomics may have to target about 48% of all proteins. This corresponded to a similar percentage of residues of the entire proteomes (52%). We explored a number of different strategies to cluster protein space in order to find the number of families representing these 48% of structurally unknown proteins. For the subset of all entirely sequenced eukaryotes, we found over 18 000 fragment clusters each of which may be a suitable target for structural genomics. AVAILABILITY: All data are available from the authors, most results are summarized at: http://cubic.bioc.columbia.edu/genomes/RES/2002_bioinformatics/

Mesh:

Substances:

Year:  2002        PMID: 12117789     DOI: 10.1093/bioinformatics/18.7.922

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  19 in total

1.  A novel member of the split betaalphabeta fold: Solution structure of the hypothetical protein YML108W from Saccharomyces cerevisiae.

Authors:  Antonio Pineda-Lucena; Jack C C Liao; John R Cort; Adelinda Yee; Michael A Kennedy; Aled M Edwards; Cheryl H Arrowsmith
Journal:  Protein Sci       Date:  2003-05       Impact factor: 6.725

2.  Coverage of protein sequence space by current structural genomics targets.

Authors:  Nicholas O'Toole; Stéphane Raymond; Miroslaw Cygler
Journal:  J Struct Funct Genomics       Date:  2003

3.  SPINE 2: a system for collaborative structural proteomics within a federated database framework.

Authors:  Chern-Sing Goh; Ning Lan; Nathaniel Echols; Shawn M Douglas; Duncan Milburn; Paul Bertone; Rong Xiao; Li-Chung Ma; Deyou Zheng; Zeba Wunderlich; Tom Acton; Gaetano T Montelione; Mark Gerstein
Journal:  Nucleic Acids Res       Date:  2003-06-01       Impact factor: 16.971

4.  Long membrane helices and short loops predicted less accurately.

Authors:  Chien Peter Chen; Burkhard Rost
Journal:  Protein Sci       Date:  2002-12       Impact factor: 6.725

5.  NORSp: Predictions of long regions without regular secondary structure.

Authors:  Jinfeng Liu; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

6.  Sequence-based prediction of protein domains.

Authors:  Jinfeng Liu; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2004-07-07       Impact factor: 16.971

7.  High-quality homology models derived from NMR and X-ray structures of E. coli proteins YgdK and Suf E suggest that all members of the YgdK/Suf E protein family are enhancers of cysteine desulfurases.

Authors:  Gaohua Liu; Zhaohui Li; Yiwen Chiang; Thomas Acton; Gaetano T Montelione; Diana Murray; Thomas Szyperski
Journal:  Protein Sci       Date:  2005-06       Impact factor: 6.725

8.  PEP: Predictions for Entire Proteomes.

Authors:  Phil Carter; Jinfeng Liu; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

9.  Comprehensive genome analysis of 203 genomes provides structural genomics with new insights into protein family space.

Authors:  Russell L Marsden; David Lee; Michael Maibaum; Corin Yeats; Christine A Orengo
Journal:  Nucleic Acids Res       Date:  2006-02-15       Impact factor: 16.971

10.  Towards a comprehensive structural coverage of completed genomes: a structural genomics viewpoint.

Authors:  Russell L Marsden; Tony A Lewis; Christine A Orengo
Journal:  BMC Bioinformatics       Date:  2007-03-09       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.