Literature DB >> 15103630

CHOP proteins into structural domain-like fragments.

Jinfeng Liu1, Burkhard Rost.   

Abstract

We developed a method CHOP dissecting proteins into domain-like fragments. The basic idea was to cut proteins beginning from very reliable experimental information (PDB), proceeding to expert annotations of domain-like regions (Pfam-A), and completing through cuts based on termini of known proteins. In this way, CHOP dissected more than two thirds of all proteins from 62 proteomes. Analysis of our structural domain-like fragments revealed four surprising results. First, >70% of all dissected proteins contained more than one fragment. Second, most domains spanned on average over approximately 100 residues. This average was similar for eukaryotic and prokaryotic proteins, and it is also valid-although previously not described-for all proteins in the PDB. Third, single-domain proteins were significant longer than most domains in multidomain proteins. Fourth, three fourths of all domains appeared shorter than 210 residues. We believe that our CHOP fragments constituted an important resource for functional and structural genomics. Nevertheless, our main motivation to develop CHOP was that the single-linkage clustering method failed to adequately group full-length proteins. In contrast, CLUP-the simple clustering scheme CLUP introduced here-succeeded largely to group the CHOP fragments from 62 proteomes such that all members of one cluster shared a basic structural core. CLUP found >63,000 multi- and >118,000 single-member clusters. Although most fragments were restricted to a particular cluster, approximately 24% of the fragments were duplicated in at least two clusters. Our thresholds for grouping two fragments into the same cluster were rather conservative. Nevertheless, our results suggested that structural genomics initiatives have to target >30,000 fragments to at least cover the multimember clusters in 62 proteomes. Copyright 2004 Wiley-Liss, Inc.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15103630     DOI: 10.1002/prot.20095

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  27 in total

1.  CHOP: parsing proteins into structural domains.

Authors:  Jinfeng Liu; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

2.  Predicting transmembrane beta-barrels in proteomes.

Authors:  Henry R Bigelow; Donald S Petrey; Jinfeng Liu; Dariusz Przybylski; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2004-05-11       Impact factor: 16.971

3.  Sequence-based prediction of protein domains.

Authors:  Jinfeng Liu; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2004-07-07       Impact factor: 16.971

4.  Modeling the evolution of protein domain architectures using maximum parsimony.

Authors:  Jessica H Fong; Lewis Y Geer; Anna R Panchenko; Stephen H Bryant
Journal:  J Mol Biol       Date:  2006-11-10       Impact factor: 5.469

5.  Domain mobility in proteins: functional and evolutionary implications.

Authors:  Malay Kumar Basu; Eugenia Poliakov; Igor B Rogozin
Journal:  Brief Bioinform       Date:  2009-01-16       Impact factor: 11.622

6.  DeepDom: Predicting protein domain boundary from sequence alone using stacked bidirectional LSTM.

Authors:  Yuexu Jiang; Duolin Wang; Dong Xu
Journal:  Pac Symp Biocomput       Date:  2019

7.  A protein domain-based interactome network for C. elegans early embryogenesis.

Authors:  Mike Boxem; Zoltan Maliga; Niels Klitgord; Na Li; Irma Lemmens; Miyeko Mana; Lorenzo de Lichtervelde; Joram D Mul; Diederik van de Peut; Maxime Devos; Nicolas Simonis; Muhammed A Yildirim; Murat Cokol; Huey-Ling Kao; Anne-Sophie de Smet; Haidong Wang; Anne-Lore Schlaitz; Tong Hao; Stuart Milstein; Changyu Fan; Mike Tipsword; Kevin Drew; Matilde Galli; Kahn Rhrissorrakrai; David Drechsel; Daphne Koller; Frederick P Roth; Lilia M Iakoucheva; A Keith Dunker; Richard Bonneau; Kristin C Gunsalus; David E Hill; Fabio Piano; Jan Tavernier; Sander van den Heuvel; Anthony A Hyman; Marc Vidal
Journal:  Cell       Date:  2008-08-08       Impact factor: 41.582

8.  Structural genomics target selection for the New York consortium on membrane protein structure.

Authors:  Marco Punta; James Love; Samuel Handelman; John F Hunt; Lawrence Shapiro; Wayne A Hendrickson; Burkhard Rost
Journal:  J Struct Funct Genomics       Date:  2009-10-27

9.  Protein domain boundary predictions: a structural biology perspective.

Authors:  Svetlana Kirillova; Suresh Kumar; Oliviero Carugo
Journal:  Open Biochem J       Date:  2009-01-21

10.  Huntingtin facilitates polycomb repressive complex 2.

Authors:  Ihn Sik Seong; Juliana M Woda; Ji-Joon Song; Alejandro Lloret; Priyanka D Abeyrathne; Caroline J Woo; Gillian Gregory; Jong-Min Lee; Vanessa C Wheeler; Thomas Walz; Robert E Kingston; James F Gusella; Ronald A Conlon; Marcy E MacDonald
Journal:  Hum Mol Genet       Date:  2009-11-23       Impact factor: 6.150

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.