Literature DB >> 18047271

Analyzing large-scale proteomics projects with latent semantic indexing.

Sebastian Klie1, Lennart Martens, Juan Antonio Vizcaíno, Richard Côté, Phil Jones, Rolf Apweiler, Alexander Hinneburg, Henning Hermjakob.   

Abstract

Since the advent of public data repositories for proteomics data, readily accessible results from high-throughput experiments have been accumulating steadily. Several large-scale projects in particular have contributed substantially to the amount of identifications available to the community. Despite the considerable body of information amassed, very few successful analyses have been performed and published on this data, leveling off the ultimate value of these projects far below their potential. A prominent reason published proteomics data is seldom reanalyzed lies in the heterogeneous nature of the original sample collection and the subsequent data recording and processing. To illustrate that at least part of this heterogeneity can be compensated for, we here apply a latent semantic analysis to the data contributed by the Human Proteome Organization's Plasma Proteome Project (HUPO PPP). Interestingly, despite the broad spectrum of instruments and methodologies applied in the HUPO PPP, our analysis reveals several obvious patterns that can be used to formulate concrete recommendations for optimizing proteomics project planning as well as the choice of technologies used in future experiments. It is clear from these results that the analysis of large bodies of publicly available proteomics data by noise-tolerant algorithms such as the latent semantic analysis holds great promise and is currently underexploited.

Entities:  

Mesh:

Year:  2007        PMID: 18047271     DOI: 10.1021/pr070461k

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  13 in total

1.  Computational framework for analysis of prey-prey associations in interaction proteomics identifies novel human protein-protein interactions and networks.

Authors:  Sudipto Saha; Jean-Eudes Dazard; Hua Xu; Rob M Ewing
Journal:  J Proteome Res       Date:  2012-08-21       Impact factor: 4.466

Review 2.  Quality assessment for clinical proteomics.

Authors:  David L Tabb
Journal:  Clin Biochem       Date:  2012-12-12       Impact factor: 3.281

Review 3.  Proteomics data repositories: providing a safe haven for your data and acting as a springboard for further research.

Authors:  Juan Antonio Vizcaíno; Joseph M Foster; Lennart Martens
Journal:  J Proteomics       Date:  2010-07-06       Impact factor: 4.044

4.  A guide to the Proteomics Identifications Database proteomics data repository.

Authors:  Juan Antonio Vizcaíno; Richard Côté; Florian Reisinger; Joseph M Foster; Michael Mueller; Jonathan Rameseder; Henning Hermjakob; Lennart Martens
Journal:  Proteomics       Date:  2009-09       Impact factor: 3.984

5.  Effective use of latent semantic indexing and computational linguistics in biological and biomedical applications.

Authors:  Hongyu Chen; Bronwen Martin; Caitlin M Daimon; Stuart Maudsley
Journal:  Front Physiol       Date:  2013-01-30       Impact factor: 4.566

Review 6.  Making proteomics data accessible and reusable: current state of proteomics databases and repositories.

Authors:  Yasset Perez-Riverol; Emanuele Alpi; Rui Wang; Henning Hermjakob; Juan Antonio Vizcaíno
Journal:  Proteomics       Date:  2015-03       Impact factor: 3.984

7.  A HUPO test sample study reveals common problems in mass spectrometry-based proteomics.

Authors:  Alexander W Bell; Eric W Deutsch; Catherine E Au; Robert E Kearney; Ron Beavis; Salvatore Sechi; Tommy Nilsson; John J M Bergeron
Journal:  Nat Methods       Date:  2009-06       Impact factor: 28.547

8.  The Proteomics Identifications database: 2010 update.

Authors:  Juan Antonio Vizcaíno; Richard Côté; Florian Reisinger; Harald Barsnes; Joseph M Foster; Jonathan Rameseder; Henning Hermjakob; Lennart Martens
Journal:  Nucleic Acids Res       Date:  2009-11-11       Impact factor: 16.971

9.  The PRoteomics IDEntifications (PRIDE) database and associated tools: status in 2013.

Authors:  Juan Antonio Vizcaíno; Richard G Côté; Attila Csordas; José A Dianes; Antonio Fabregat; Joseph M Foster; Johannes Griss; Emanuele Alpi; Melih Birim; Javier Contell; Gavin O'Kelly; Andreas Schoenegger; David Ovelleiro; Yasset Pérez-Riverol; Florian Reisinger; Daniel Ríos; Rui Wang; Henning Hermjakob
Journal:  Nucleic Acids Res       Date:  2012-11-29       Impact factor: 16.971

10.  QC metrics from CPTAC raw LC-MS/MS data interpreted through multivariate statistics.

Authors:  Xia Wang; Matthew C Chambers; Lorenzo J Vega-Montoto; David M Bunk; Stephen E Stein; David L Tabb
Journal:  Anal Chem       Date:  2014-02-17       Impact factor: 6.986

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.