Literature DB >> 17154423

Strategies for high-throughput comparative modeling: applications to leverage analysis in structural genomics and protein family organization.

Nebojsa Mirkovic1, Zhaohui Li, Andrew Parnassa, Diana Murray.   

Abstract

The technological breakthroughs in structural genomics were designed to facilitate the solution of a sufficient number of structures, so that as many protein sequences as possible can be structurally characterized with the aid of comparative modeling. The leverage of a solved structure is the number and quality of the models that can be produced using the structure as a template for modeling and may be viewed as the "currency" with which the success of a structural genomics endeavor can be measured. Moreover, the models obtained in this way should be valuable to all biologists. To this end, at the Northeast Structural Genomics Consortium (NESG), a modular computational pipeline for automated high-throughput leverage analysis was devised and used to assess the leverage of the 186 unique NESG structures solved during the first phase of the Protein Structure Initiative (January 2000 to July 2005). Here, the results of this analysis are presented. The number of sequences in the nonredundant protein sequence database covered by quality models produced by the pipeline is approximately 39,000, so that the average leverage is approximately 210 models per structure. Interestingly, only 7900 of these models fulfill the stringent modeling criterion of being at least 30% sequence-identical to the corresponding NESG structures. This study shows how high-throughput modeling increases the efficiency of structure determination efforts by providing enhanced coverage of protein structure space. In addition, the approach is useful in refining the boundaries of structural domains within larger protein sequences, subclassifying sequence diverse protein families, and defining structure-based strategies specific to a particular family. (c) 2006 Wiley-Liss, Inc.

Mesh:

Substances:

Year:  2007        PMID: 17154423     DOI: 10.1002/prot.21191

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  13 in total

1.  Internal organization of large protein families: relationship between the sequence, structure, and function-based clustering.

Authors:  Xiao-Hui Cai; Lukasz Jaroszewski; John Wooley; Adam Godzik
Journal:  Proteins       Date:  2011-05-31

2.  Toward prediction of functional protein pockets using blind docking and pocket search algorithms.

Authors:  Csaba Hetényi; David van der Spoel
Journal:  Protein Sci       Date:  2011-03-30       Impact factor: 6.725

3.  High-throughput computational structure-based characterization of protein families: START domains and implications for structural genomics.

Authors:  Hunjoong Lee; Zhaohui Li; Antonina Silkov; Markus Fischer; Donald Petrey; Barry Honig; Diana Murray
Journal:  J Struct Funct Genomics       Date:  2010-04-11

4.  Genome-wide structural analysis reveals novel membrane binding properties of AP180 N-terminal homology (ANTH) domains.

Authors:  Antonina Silkov; Youngdae Yoon; Hunjoong Lee; Nikhil Gokhale; Emmanuel Adu-Gyamfi; Robert V Stahelin; Wonhwa Cho; Diana Murray
Journal:  J Biol Chem       Date:  2011-08-02       Impact factor: 5.157

5.  Structure and function of Pseudomonas aeruginosa protein PA1324 (21-170).

Authors:  Kelly A Mercier; John R Cort; Michael A Kennedy; Erin E Lockert; Shuisong Ni; Matthew D Shortridge; Robert Powers
Journal:  Protein Sci       Date:  2009-03       Impact factor: 6.725

6.  Structure-based prediction of protein-protein interactions on a genome-wide scale.

Authors:  Qiangfeng Cliff Zhang; Donald Petrey; Lei Deng; Li Qiang; Yu Shi; Chan Aye Thu; Brygida Bisikirska; Celine Lefebvre; Domenico Accili; Tony Hunter; Tom Maniatis; Andrea Califano; Barry Honig
Journal:  Nature       Date:  2012-09-30       Impact factor: 49.962

7.  Identification and structural characterization of FYVE domain-containing proteins of Arabidopsis thaliana.

Authors:  Ewa Wywial; Shaneen M Singh
Journal:  BMC Plant Biol       Date:  2010-08-02       Impact factor: 4.215

8.  Unique opportunities for NMR methods in structural genomics.

Authors:  Gaetano T Montelione; Cheryl Arrowsmith; Mark E Girvin; Michael A Kennedy; John L Markley; Robert Powers; James H Prestegard; Thomas Szyperski
Journal:  J Struct Funct Genomics       Date:  2009-03-15

9.  The Protein Model Portal.

Authors:  Konstantin Arnold; Florian Kiefer; Jürgen Kopp; James N D Battey; Michael Podvinec; John D Westbrook; Helen M Berman; Lorenza Bordoli; Torsten Schwede
Journal:  J Struct Funct Genomics       Date:  2008-11-27

10.  PrePPI: a structure-informed database of protein-protein interactions.

Authors:  Qiangfeng Cliff Zhang; Donald Petrey; José Ignacio Garzón; Lei Deng; Barry Honig
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.