Literature DB >> 21538885

A posteriori quality control for the curation and reuse of public proteomics data.

Joseph M Foster1, Sven Degroeve, Laurent Gatto, Matthieu Visser, Rui Wang, Johannes Griss, Rolf Apweiler, Lennart Martens.   

Abstract

Proteomics is a rapidly expanding field encompassing a multitude of complex techniques and data types. To date much effort has been devoted to achieving the highest possible coverage of proteomes with the aim to inform future developments in basic biology as well as in clinical settings. As a result, growing amounts of data have been deposited in publicly available proteomics databases. These data are in turn increasingly reused for orthogonal downstream purposes such as data mining and machine learning. These downstream uses however, need ways to a posteriori validate whether a particular data set is suitable for the envisioned purpose. Furthermore, the (semi-)automatic curation of repository data is dependent on analyses that can highlight misannotation and edge conditions for data sets. Such curation is an important prerequisite for efficient proteomics data reuse in the life sciences in general. We therefore present here a selection of quality control metrics and approaches for the a posteriori detection of potential issues encountered in typical proteomics data sets. We illustrate our metrics by relying on publicly available data from the Proteomics Identifications Database (PRIDE), and simultaneously show the usefulness of the large body of PRIDE data as a means to derive empirical background distributions for relevant metrics.
Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Mesh:

Year:  2011        PMID: 21538885     DOI: 10.1002/pmic.201000602

Source DB:  PubMed          Journal:  Proteomics        ISSN: 1615-9853            Impact factor:   3.984


  8 in total

1.  Improving links between literature and biological data with text mining: a case study with GEO, PDB and MEDLINE.

Authors:  Aurélie Névéol; W John Wilbur; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2012-06-08       Impact factor: 3.451

Review 2.  Making proteomics data accessible and reusable: current state of proteomics databases and repositories.

Authors:  Yasset Perez-Riverol; Emanuele Alpi; Rui Wang; Henning Hermjakob; Juan Antonio Vizcaíno
Journal:  Proteomics       Date:  2015-03       Impact factor: 3.984

Review 3.  Visualization of proteomics data using R and bioconductor.

Authors:  Laurent Gatto; Lisa M Breckels; Thomas Naake; Sebastian Gibb
Journal:  Proteomics       Date:  2015-04       Impact factor: 3.984

4.  qcML: an exchange format for quality control metrics from mass spectrometry experiments.

Authors:  Mathias Walzer; Lucia Espona Pernas; Sara Nasso; Wout Bittremieux; Sven Nahnsen; Pieter Kelchtermans; Peter Pichler; Henk W P van den Toorn; An Staes; Jonathan Vandenbussche; Michael Mazanek; Thomas Taus; Richard A Scheltema; Christian D Kelstrup; Laurent Gatto; Bas van Breukelen; Stephan Aiche; Dirk Valkenborg; Kris Laukens; Kathryn S Lilley; Jesper V Olsen; Albert J R Heck; Karl Mechtler; Ruedi Aebersold; Kris Gevaert; Juan Antonio Vizcaíno; Henning Hermjakob; Oliver Kohlbacher; Lennart Martens
Journal:  Mol Cell Proteomics       Date:  2014-04-23       Impact factor: 5.911

5.  Pride-asap: automatic fragment ion annotation of identified PRIDE spectra.

Authors:  Niels Hulstaert; Florian Reisinger; Jonathan Rameseder; Harald Barsnes; Juan Antonio Vizcaíno; Lennart Martens
Journal:  J Proteomics       Date:  2013-04-17       Impact factor: 4.044

Review 6.  A Golden Age for Working with Public Proteomics Data.

Authors:  Lennart Martens; Juan Antonio Vizcaíno
Journal:  Trends Biochem Sci       Date:  2017-01-22       Impact factor: 13.807

7.  The PRoteomics IDEntifications (PRIDE) database and associated tools: status in 2013.

Authors:  Juan Antonio Vizcaíno; Richard G Côté; Attila Csordas; José A Dianes; Antonio Fabregat; Joseph M Foster; Johannes Griss; Emanuele Alpi; Melih Birim; Javier Contell; Gavin O'Kelly; Andreas Schoenegger; David Ovelleiro; Yasset Pérez-Riverol; Florian Reisinger; Daniel Ríos; Rui Wang; Henning Hermjakob
Journal:  Nucleic Acids Res       Date:  2012-11-29       Impact factor: 16.971

8.  Public proteomics data: How the field has evolved from sceptical inquiry to the promise of in silico proteomics.

Authors:  Lennart Martens
Journal:  EuPA Open Proteom       Date:  2016-03-28
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.