Literature DB >> 11292848

PartsList: a web-based system for dynamically ranking protein folds based on disparate attributes, including whole-genome expression and interaction information.

J Qian1, B Stenger, C A Wilson, J Lin, R Jansen, S A Teichmann, J Park, W G Krebs, H Yu, V Alexandrov, N Echols, M Gerstein.   

Abstract

As the number of protein folds is quite limited, a mode of analysis that will be increasingly common in the future, especially with the advent of structural genomics, is to survey and re-survey the finite parts list of folds from an expanding number of perspectives. We have developed a new resource, called PartsList, that lets one dynamically perform these comparative fold surveys. It is available on the web at http://bioinfo.mbb.yale.edu/partslist and http://www.partslist.org. The system is based on the existing fold classifications and functions as a form of companion annotation for them, providing 'global views' of many already completed fold surveys. The central idea in the system is that of comparison through ranking; PartsList will rank the approximately 420 folds based on more than 180 attributes. These include: (i) occurrence in a number of completely sequenced genomes (e.g. it will show the most common folds in the worm versus yeast); (ii) occurrence in the structure databank (e.g. most common folds in the PDB); (iii) both absolute and relative gene expression information (e.g. most changing folds in expression over the cell cycle); (iv) protein-protein interactions, based on experimental data in yeast and comprehensive PDB surveys (e.g. most interacting fold); (v) sensitivity to inserted transposons; (vi) the number of functions associated with the fold (e.g. most multi-functional folds); (vii) amino acid composition (e.g. most Cys-rich folds); (viii) protein motions (e.g. most mobile folds); and (ix) the level of similarity based on a comprehensive set of structural alignments (e.g. most structurally variable folds). The integration of whole-genome expression and protein-protein interaction data with structural information is a particularly novel feature of our system. We provide three ways of visualizing the rankings: a profiler emphasizing the progression of high and low ranks across many pre-selected attributes, a dynamic comparer for custom comparisons and a numerical rankings correlator. These allow one to directly compare very different attributes of a fold (e.g. expression level, genome occurrence and maximum motion) in the uniform numerical format of ranks. This uniform framework, in turn, highlights the way that the frequency of many of the attributes falls off with approximate power-law behavior (i.e. according to V(-b), for attribute value V and constant exponent b), with a few folds having large values and most having small values.

Entities:  

Mesh:

Substances:

Year:  2001        PMID: 11292848      PMCID: PMC31319          DOI: 10.1093/nar/29.8.1750

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  71 in total

1.  DIP: the database of interacting proteins.

Authors:  I Xenarios; D W Rice; L Salwinski; M K Baron; E M Marcotte; D Eisenberg
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The ASTRAL compendium for protein structure and sequence analysis.

Authors:  S E Brenner; P Koehl; M Levitt
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  MMDB: 3D structure data in Entrez.

Authors:  Y Wang; K J Addess; L Geer; T Madej; A Marchler-Bauer; D Zimmerman; S H Bryant
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

4.  Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins.

Authors:  A Bateman; E Birney; R Durbin; S R Eddy; R D Finn; E L Sonnhammer
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

5.  Patterns of protein-fold usage in eight microbial genomes: a comprehensive structural census.

Authors:  M Gerstein
Journal:  Proteins       Date:  1998-12-01

6.  The FlyBase database of the Drosophila Genome Projects and community literature.

Authors: 
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

Review 7.  Iterated profile searches with PSI-BLAST--a tool for discovery in protein databases.

Authors:  S F Altschul; E V Koonin
Journal:  Trends Biochem Sci       Date:  1998-11       Impact factor: 13.807

8.  Structural assignments to the Mycoplasma genitalium proteins show extensive gene duplications and domain rearrangements.

Authors:  S A Teichmann; J Park; C Chothia
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

9.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization.

Authors:  P T Spellman; G Sherlock; M Q Zhang; V R Iyer; K Anders; M B Eisen; P O Brown; D Botstein; B Futcher
Journal:  Mol Biol Cell       Date:  1998-12       Impact factor: 4.138

Review 10.  Genome sequence of the nematode C. elegans: a platform for investigating biology.

Authors: 
Journal:  Science       Date:  1998-12-11       Impact factor: 47.728

View more
  16 in total

1.  SPINE: an integrated tracking database and data mining approach for identifying feasible targets in high-throughput structural proteomics.

Authors:  P Bertone; Y Kluger; N Lan; D Zheng; D Christendat; A Yee; A M Edwards; C H Arrowsmith; G T Montelione; M Gerstein
Journal:  Nucleic Acids Res       Date:  2001-07-01       Impact factor: 16.971

2.  SCOP database in 2002: refinements accommodate structural genomics.

Authors:  Loredana Lo Conte; Steven E Brenner; Tim J P Hubbard; Cyrus Chothia; Alexey G Murzin
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

3.  Annotation transfer for genomics: measuring functional divergence in multi-domain proteins.

Authors:  H Hegyi; M Gerstein
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

4.  Comprehensive analysis of amino acid and nucleotide composition in eukaryotic genomes, comparing genes and pseudogenes.

Authors:  Nathaniel Echols; Paul Harrison; Suganthi Balasubramanian; Nicholas M Luscombe; Paul Bertone; Zhaolei Zhang; Mark Gerstein
Journal:  Nucleic Acids Res       Date:  2002-06-01       Impact factor: 16.971

5.  Structural characterization of the human proteome.

Authors:  Arne Müller; Robert M MacCallum; Michael J E Sternberg
Journal:  Genome Res       Date:  2002-11       Impact factor: 9.043

6.  GeneCensus: genome comparisons in terms of metabolic pathway activity and protein family sharing.

Authors:  J Lin; J Qian; D Greenbaum; P Bertone; R Das; N Echols; A Senes; B Stenger; M Gerstein
Journal:  Nucleic Acids Res       Date:  2002-10-15       Impact factor: 16.971

7.  SPINE 2: a system for collaborative structural proteomics within a federated database framework.

Authors:  Chern-Sing Goh; Ning Lan; Nathaniel Echols; Shawn M Douglas; Duncan Milburn; Paul Bertone; Rong Xiao; Li-Chung Ma; Deyou Zheng; Zeba Wunderlich; Tom Acton; Gaetano T Montelione; Mark Gerstein
Journal:  Nucleic Acids Res       Date:  2003-06-01       Impact factor: 16.971

8.  MolMovDB: analysis and visualization of conformational change and structural flexibility.

Authors:  Nathaniel Echols; Duncan Milburn; Mark Gerstein
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

9.  Integration of genomic datasets to predict protein complexes in yeast.

Authors:  Ronald Jansen; Ning Lan; Jiang Qian; Mark Gerstein
Journal:  J Struct Funct Genomics       Date:  2002

10.  Wavelet-based functional clustering for patterns of high-dimensional dynamic gene expression.

Authors:  Bong-Rae Kim; Timothy McMurry; Wei Zhao; Rongling Wu; Arthur Berg
Journal:  J Comput Biol       Date:  2010-08       Impact factor: 1.479

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.