Literature DB >> 25014353

Analyzing the first drafts of the human proteome.

Iakes Ezkurdia, Jesús Vázquez, Alfonso Valencia1, Michael Tress1.   

Abstract

This letter analyzes two large-scale proteomics studies published in the same issue of Nature. At the time of the release, both studies were portrayed as draft maps of the human proteome and great advances in the field. As with the initial publication of the human genome, these papers have broad appeal and will no doubt lead to a great deal of further analysis by the scientific community. However, we were intrigued by the number of protein-coding genes detected by the two studies, numbers that far exceeded what has been reported for the multinational Human Proteome Project effort. We carried out a simple quality test on the data using the olfactory receptor family. A high-quality proteomics experiment that does not specifically analyze nasal tissues should not expect to detect many peptides for olfactory receptors. Neither of the studies carried out experiments on nasal tissues, yet we found peptide evidence for more than 100 olfactory receptors in the two studies. These results suggest that the two studies are substantially overestimating the number of protein coding genes they identify. We conclude that the experimental data from these two studies should be used with caution.

Entities:  

Keywords:  Nature; human proteome; olfactory receptors; protein coding genes; proteomics

Mesh:

Substances:

Year:  2014        PMID: 25014353      PMCID: PMC4334283          DOI: 10.1021/pr500572z

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


We read with great interest the recent cover of Nature (The Human Proteome). The issue contains two large-scale proteomics analyses based around publicly available databases, ProteomicsDB and Human Proteome Map. Bernhard Kuster and coworkers[1] describe ProteomicsDB as a “mass-spectrometry-based draft of the human proteome”, while the Human Proteome Map, developed by Akhilesh Pandey and colleagues,[2] offers a “draft map of the human proteome”. The studies have been portrayed as a great advance in the field. As with the initial publication of the human genome, the papers are of broad appeal and will no doubt lead to a great deal of further analysis by the scientific community. We were particularly intrigued by the number of genes detected by the two studies, numbers that far exceed what has been reported for the multinational Human Proteome Project effort.[3] These numbers were reached in part by combining spectra from multiple experiments. Although combining spectra from multiple experiments may increase coverage, the advantage of using very large data sets has been shown to come at the expense of higher false-positive protein rates.[4] Given this, we were concerned about the quality of the peptide identifications in these two studies. Data quality is especially important in large-scale proteomics experiments because researchers cannot carry out individual follow-up studies on peptides identified on a genome-wide scale. We decided to carry out a simple quality test on the data using the olfactory receptor family. Olfactory receptors are seven transmembrane helix receptors that trigger the olfactory signal transduction pathway. These receptors first appeared in vertebrates and have duplicated to such an extent that mammalian species possess many hundreds of these genes. From the point of view of proteomics analysis, this family is highly interesting. The functional specificity of these genes indicates that expression is predominantly limited to a single tissue, although the mouse orthologue of OR51E2 has been convincingly shown to have a function in the kidney,[5] and the Human Protein Atlas records limited RNA evidence of the expression of olfactory receptors outside of the nose (primarily in testes[6]). Olfactory receptors have very little transcript expression and should be particularly difficult to detect in proteomics experiments because they are transmembrane proteins. A high-quality proteomics experiment that does not include a specific analysis of nasal tissues should not expect to detect much evidence of peptide expression for these genes. For example, PeptideAtlas,[7] known for having high stringency criteria, identifies just two discriminating olfactory receptor peptides. As far as we know, neither of the studies carried out experiments on nasal tissues. We found peptide evidence of 108 of these olfactory receptors in the Human Proteome Map database, and another 200 olfactory receptors are recorded in ProteomicsDB. There are at least three reasons for the high numbers of olfactory receptors in the two studies. First, neither experiment properly distinguishes between discriminating and nondiscriminating peptides, so olfactory receptors are identified by peptides that map to more than one gene. (40 of the olfactory receptors detected in the Pandey study were identified solely by nondiscriminatory peptides.) Second, a number of peptides were wrongly identified as having a glutamine to pyroglutamic acid modification in non N-terminal positions. Third, both studies include very many low-quality spectra (Supporting Information). Most of the peptides that map to the remaining 68 olfactory receptors in the Pandey study were identified using poor spectra, and we were unable to find even one peptide that could provide unequivocal evidence of the presence of the protein. A similar in-depth study was not possible with the Kuster data, but we did look at the spectra for many olfactory receptors and found the same pattern. For example, the olfactory receptor with the most evidence in the Kuster study was OR6J1 with eight peptides. Despite what should be overwhelming evidence, the spectral evidence of the existence of each one of these peptides was inconclusive. The results of our analysis show that both studies are substantially overestimating the number of protein coding and noncoding genes they find. We suggest that the experimental data from these two should be used with great caution, and we feel that these two unique draft maps of the human proteome should be put on hold until they can be carefully analyzed.
  7 in total

1.  Towards a knowledge-based Human Protein Atlas.

Authors:  Mathias Uhlen; Per Oksvold; Linn Fagerberg; Emma Lundberg; Kalle Jonasson; Mattias Forsberg; Martin Zwahlen; Caroline Kampf; Kenneth Wester; Sophia Hober; Henrik Wernerus; Lisa Björling; Fredrik Ponten
Journal:  Nat Biotechnol       Date:  2010-12       Impact factor: 54.908

2.  Protein identification false discovery rates for very large proteomics data sets generated by tandem mass spectrometry.

Authors:  Lukas Reiter; Manfred Claassen; Sabine P Schrimpf; Marko Jovanovic; Alexander Schmidt; Joachim M Buhmann; Michael O Hengartner; Ruedi Aebersold
Journal:  Mol Cell Proteomics       Date:  2009-07-16       Impact factor: 5.911

3.  Mass-spectrometry-based draft of the human proteome.

Authors:  Mathias Wilhelm; Judith Schlegl; Hannes Hahne; Amin Moghaddas Gholami; Marcus Lieberenz; Mikhail M Savitski; Emanuel Ziegler; Lars Butzmann; Siegfried Gessulat; Harald Marx; Toby Mathieson; Simone Lemeer; Karsten Schnatbaum; Ulf Reimer; Holger Wenschuh; Martin Mollenhauer; Julia Slotta-Huspenina; Joos-Hendrik Boese; Marcus Bantscheff; Anja Gerstmair; Franz Faerber; Bernhard Kuster
Journal:  Nature       Date:  2014-05-29       Impact factor: 49.962

4.  Metrics for the Human Proteome Project 2013-2014 and strategies for finding missing proteins.

Authors:  Lydie Lane; Amos Bairoch; Ronald C Beavis; Eric W Deutsch; Pascale Gaudet; Emma Lundberg; Gilbert S Omenn
Journal:  J Proteome Res       Date:  2013-12-23       Impact factor: 4.466

5.  Olfactory receptor responding to gut microbiota-derived signals plays a role in renin secretion and blood pressure regulation.

Authors:  Jennifer L Pluznick; Ryan J Protzko; Haykanush Gevorgyan; Zita Peterlin; Arnold Sipos; Jinah Han; Isabelle Brunet; La-Xiang Wan; Federico Rey; Tong Wang; Stuart J Firestein; Masashi Yanagisawa; Jeffrey I Gordon; Anne Eichmann; Janos Peti-Peterdi; Michael J Caplan
Journal:  Proc Natl Acad Sci U S A       Date:  2013-02-11       Impact factor: 11.205

6.  The PeptideAtlas project.

Authors:  Frank Desiere; Eric W Deutsch; Nichole L King; Alexey I Nesvizhskii; Parag Mallick; Jimmy Eng; Sharon Chen; James Eddes; Sandra N Loevenich; Ruedi Aebersold
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

7.  A draft map of the human proteome.

Authors:  Min-Sik Kim; Sneha M Pinto; Derese Getnet; Raja Sekhar Nirujogi; Srikanth S Manda; Raghothama Chaerkady; Anil K Madugundu; Dhanashree S Kelkar; Ruth Isserlin; Shobhit Jain; Joji K Thomas; Babylakshmi Muthusamy; Pamela Leal-Rojas; Praveen Kumar; Nandini A Sahasrabuddhe; Lavanya Balakrishnan; Jayshree Advani; Bijesh George; Santosh Renuse; Lakshmi Dhevi N Selvan; Arun H Patil; Vishalakshi Nanjappa; Aneesha Radhakrishnan; Samarjeet Prasad; Tejaswini Subbannayya; Rajesh Raju; Manish Kumar; Sreelakshmi K Sreenivasamurthy; Arivusudar Marimuthu; Gajanan J Sathe; Sandip Chavan; Keshava K Datta; Yashwanth Subbannayya; Apeksha Sahu; Soujanya D Yelamanchi; Savita Jayaram; Pavithra Rajagopalan; Jyoti Sharma; Krishna R Murthy; Nazia Syed; Renu Goel; Aafaque A Khan; Sartaj Ahmad; Gourav Dey; Keshav Mudgal; Aditi Chatterjee; Tai-Chung Huang; Jun Zhong; Xinyan Wu; Patrick G Shaw; Donald Freed; Muhammad S Zahari; Kanchan K Mukherjee; Subramanian Shankar; Anita Mahadevan; Henry Lam; Christopher J Mitchell; Susarla Krishna Shankar; Parthasarathy Satishchandra; John T Schroeder; Ravi Sirdeshmukh; Anirban Maitra; Steven D Leach; Charles G Drake; Marc K Halushka; T S Keshava Prasad; Ralph H Hruban; Candace L Kerr; Gary D Bader; Christine A Iacobuzio-Donahue; Harsha Gowda; Akhilesh Pandey
Journal:  Nature       Date:  2014-05-29       Impact factor: 49.962

  7 in total
  43 in total

1.  Proteomics: High-protein research.

Authors:  Neil Savage
Journal:  Nature       Date:  2015-11-05       Impact factor: 49.962

2.  Metrics for the Human Proteome Project 2015: Progress on the Human Proteome and Guidelines for High-Confidence Protein Identification.

Authors:  Gilbert S Omenn; Lydie Lane; Emma K Lundberg; Ronald C Beavis; Alexey I Nesvizhskii; Eric W Deutsch
Journal:  J Proteome Res       Date:  2015-07-30       Impact factor: 4.466

Review 3.  Protein biomarkers for subtyping breast cancer and implications for future research.

Authors:  Claudius Mueller; Amanda Haymond; Justin B Davis; Alexa Williams; Virginia Espina
Journal:  Expert Rev Proteomics       Date:  2018-01-03       Impact factor: 3.940

4.  Systematic Errors in Peptide and Protein Identification and Quantification by Modified Peptides.

Authors:  Boris Bogdanow; Henrik Zauber; Matthias Selbach
Journal:  Mol Cell Proteomics       Date:  2016-05-23       Impact factor: 5.911

Review 5.  Mass-spectrometric exploration of proteome structure and function.

Authors:  Ruedi Aebersold; Matthias Mann
Journal:  Nature       Date:  2016-09-15       Impact factor: 49.962

6.  Human Proteome Project Mass Spectrometry Data Interpretation Guidelines 3.0.

Authors:  Eric W Deutsch; Lydie Lane; Christopher M Overall; Nuno Bandeira; Mark S Baker; Charles Pineau; Robert L Moritz; Fernando Corrales; Sandra Orchard; Jennifer E Van Eyk; Young-Ki Paik; Susan T Weintraub; Yves Vandenbrouck; Gilbert S Omenn
Journal:  J Proteome Res       Date:  2019-10-21       Impact factor: 4.466

Review 7.  Advances in the Chromosome-Centric Human Proteome Project: looking to the future.

Authors:  Young-Ki Paik; Gilbert S Omenn; William S Hancock; Lydie Lane; Christopher M Overall
Journal:  Expert Rev Proteomics       Date:  2017-11-10       Impact factor: 3.940

Review 8.  The state of play in higher eukaryote gene annotation.

Authors:  Jonathan M Mudge; Jennifer Harrow
Journal:  Nat Rev Genet       Date:  2016-10-24       Impact factor: 53.242

9.  State of the Human Proteome in 2014/2015 As Viewed through PeptideAtlas: Enhancing Accuracy and Coverage through the AtlasProphet.

Authors:  Eric W Deutsch; Zhi Sun; David Campbell; Ulrike Kusebauch; Caroline S Chu; Luis Mendoza; David Shteynberg; Gilbert S Omenn; Robert L Moritz
Journal:  J Proteome Res       Date:  2015-07-24       Impact factor: 4.466

10.  Devising a Consensus Framework for Validation of Novel Human Coding Loci.

Authors:  Elspeth A Bruford; Lydie Lane; Jennifer Harrow
Journal:  J Proteome Res       Date:  2015-10-07       Impact factor: 4.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.