Literature DB >> 12421749

Structural characterization of the human proteome.

Arne Müller1, Robert M MacCallum, Michael J E Sternberg.   

Abstract

This paper reports an analysis of the encoded proteins (the proteome) of the genomes of human, fly, worm, yeast, and representatives of bacteria and archaea in terms of the three-dimensional structures of their globular domains together with a general sequence-based study. We show that 39% of the human proteome can be assigned to known structures. We estimate that for 77% of the proteome, there is some functional annotation, but only 26% of the proteome can be assigned to standard sequence motifs that characterize function. Of the human protein sequences, 13% are transmembrane proteins, but only 3% of the residues in the proteome form membrane-spanning regions. There are substantial differences in the composition of globular domains of transmembrane proteins between the proteomes we have analyzed. Commonly occurring structural superfamilies are identified within the proteome. The frequencies of these superfamilies enable us to estimate that 98% of the human proteome evolved by domain duplication, with four of the 10 most duplicated superfamilies specific for multicellular organisms. The zinc-finger superfamily is massively duplicated in human compared to fly and worm, and occurrence of domains in repeats is more common in metazoa than in single cellular organisms. Structural superfamilies over- and underrepresented in human disease genes have been identified. Data and results can be downloaded and analyzed via web-based applications at http://www.sbg.bio.ic.ac.uk.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 12421749      PMCID: PMC187559          DOI: 10.1101/gr.221202

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  42 in total

1.  SCOP: a structural classification of proteins database.

Authors:  L Lo Conte; B Ailey; T J Hubbard; S E Brenner; A G Murzin; C Chothia
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  The Pfam protein families database.

Authors:  A Bateman; E Birney; R Durbin; S R Eddy; K L Howe; E L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

4.  Assessing annotation transfer for genomics: quantifying the relations between protein sequence, structure and function through traditional and probabilistic scores.

Authors:  C A Wilson; J Kreychman; M Gerstein
Journal:  J Mol Biol       Date:  2000-03-17       Impact factor: 5.469

5.  IMPALA: matching a protein sequence against a collection of PSI-BLAST-constructed position-specific score matrices.

Authors:  A A Schäffer; Y I Wolf; C P Ponting; E V Koonin; L Aravind; S F Altschul
Journal:  Bioinformatics       Date:  1999-12       Impact factor: 6.937

6.  OMIM passes the 1,000-disease-gene mark.

Authors:  S E Antonarakis; V A McKusick
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

7.  Twilight zone of protein sequence alignments.

Authors:  B Rost
Journal:  Protein Eng       Date:  1999-02

8.  How representative are the known structures of the proteins in a complete genome? A comprehensive structural census.

Authors:  M Gerstein
Journal:  Fold Des       Date:  1998

Review 9.  Profile hidden Markov models.

Authors:  S R Eddy
Journal:  Bioinformatics       Date:  1998       Impact factor: 6.937

10.  Model building by comparison at CASP3: using expert knowledge and computer automation.

Authors:  P A Bates; M J Sternberg
Journal:  Proteins       Date:  1999
View more
  29 in total

1.  3D-GENOMICS: a database to compare structural and functional annotations of proteins between sequenced genomes.

Authors:  Keiran Fleming; Arne Müller; Robert M MacCallum; Michael J E Sternberg
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

Review 2.  The proteome: structure, function and evolution.

Authors:  Keiran Fleming; Lawrence A Kelley; Suhail A Islam; Robert M MacCallum; Arne Muller; Florencio Pazos; Michael J E Sternberg
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2006-03-29       Impact factor: 6.237

3.  Mutations in TMEM260 Cause a Pediatric Neurodevelopmental, Cardiac, and Renal Syndrome.

Authors:  Asaf Ta-Shma; Tahir N Khan; Asaf Vivante; Jason R Willer; Pavle Matak; Chaim Jalas; Ben Pode-Shakked; Yishay Salem; Yair Anikster; Friedhelm Hildebrandt; Nicholas Katsanis; Orly Elpeleg; Erica E Davis
Journal:  Am J Hum Genet       Date:  2017-03-16       Impact factor: 11.025

Review 4.  Zinc biochemistry: from a single zinc enzyme to a key element of life.

Authors:  Wolfgang Maret
Journal:  Adv Nutr       Date:  2013-01-01       Impact factor: 8.701

Review 5.  From local structure to a global framework: recognition of protein folds.

Authors:  Agnel Praveen Joseph; Alexandre G de Brevern
Journal:  J R Soc Interface       Date:  2014-04-16       Impact factor: 4.118

6.  Statistical removal of background signals from high-throughput (1)H NMR line-broadening ligand-affinity screens.

Authors:  Bradley Worley; Nicholas J Sisco; Robert Powers
Journal:  J Biomol NMR       Date:  2015-07-09       Impact factor: 2.835

7.  Dynamic interaction between Arf GAP and PH domains of ASAP1 in the regulation of GAP activity.

Authors:  Ruibai Luo; Lisa M Miller Jenkins; Paul A Randazzo; James Gruschus
Journal:  Cell Signal       Date:  2008-07-11       Impact factor: 4.315

8.  Representative transcript sets for evaluating a translational initiation sites predictor.

Authors:  Jia Zeng; Reda Alhajj; Douglas J Demetrick
Journal:  BMC Bioinformatics       Date:  2009-07-02       Impact factor: 3.169

9.  Stratification of co-evolving genomic groups using ranked phylogenetic profiles.

Authors:  Shiri Freilich; Leon Goldovsky; Assaf Gottlieb; Eric Blanc; Sophia Tsoka; Christos A Ouzounis
Journal:  BMC Bioinformatics       Date:  2009-10-27       Impact factor: 3.169

10.  Protein domain organisation: adding order.

Authors:  Sarah K Kummerfeld; Sarah A Teichmann
Journal:  BMC Bioinformatics       Date:  2009-01-29       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.