Literature DB >> 32786688

Toward a Sample Metadata Standard in Public Proteomics Repositories.

Yasset Perez-Riverol1.   

Abstract

Metadata is essential in proteomics data repositories and is crucial to interpret and reanalyze the deposited data sets. For every proteomics data set, we should capture at least three levels of metadata: (i) data set description, (ii) the sample to data files related information, and (iii) standard data file formats (e.g., mzIdentML, mzML, or mzTab). While the data set description and standard data file formats are supported by all ProteomeXchange partners, the information regarding the sample to data files is mostly missing. Recently, members of the European Bioinformatics Community for Mass Spectrometry (EuBIC) have created an open-source project called Sample to Data file format for Proteomics (https://github.com/bigbio/proteomics-metadata-standard/) to enable the standardization of sample metadata of public proteomics data sets. Here, the project is presented to the proteomics community, and we call for contributors, including researchers, journals, and consortiums to provide feedback about the format. We believe this work will improve reproducibility and facilitate the development of new tools dedicated to proteomics data analysis.

Entities:  

Keywords:  bioinformatics; data reanalysis; data repositories; experimental design; multiomics; open data; proteomeXchange; proteomics; reproducibility; sample metadata; standards

Year:  2020        PMID: 32786688      PMCID: PMC7116434          DOI: 10.1021/acs.jproteome.0c00376

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  11 in total

1.  Impact of replicate types on proteomic expression analysis.

Authors:  Natasha A Karp; Matthew Spencer; Helen Lindsay; Kevin O'Dell; Kathryn S Lilley
Journal:  J Proteome Res       Date:  2005 Sep-Oct       Impact factor: 4.466

2.  Experimental design and data-analysis in label-free quantitative LC/MS proteomics: A tutorial with MSqRob.

Authors:  Ludger J E Goeminne; Kris Gevaert; Lieven Clement
Journal:  J Proteomics       Date:  2017-04-05       Impact factor: 4.044

3.  Modeling experimental design for proteomics.

Authors:  Jan Eriksson; David Fenyö
Journal:  Methods Mol Biol       Date:  2010

4.  Quantitative Proteomics of the Cancer Cell Line Encyclopedia.

Authors:  David P Nusinow; John Szpyt; Mahmoud Ghandi; Christopher M Rose; E Robert McDonald; Marian Kalocsay; Judit Jané-Valbuena; Ellen Gelfand; Devin K Schweppe; Mark Jedrychowski; Javad Golji; Dale A Porter; Tomas Rejtar; Y Karen Wang; Gregory V Kryukov; Frank Stegmeier; Brian K Erickson; Levi A Garraway; William R Sellers; Steven P Gygi
Journal:  Cell       Date:  2020-01-23       Impact factor: 41.582

5.  A simple spreadsheet-based, MIAME-supportive format for microarray data: MAGE-TAB.

Authors:  Tim F Rayner; Philippe Rocca-Serra; Paul T Spellman; Helen C Causton; Anna Farne; Ele Holloway; Rafael A Irizarry; Junmin Liu; Donald S Maier; Michael Miller; Kjell Petersen; John Quackenbush; Gavin Sherlock; Christian J Stoeckert; Joseph White; Patricia L Whetzel; Farrell Wymore; Helen Parkinson; Ugis Sarkans; Catherine A Ball; Alvis Brazma
Journal:  BMC Bioinformatics       Date:  2006-11-06       Impact factor: 3.169

6.  linkedISA: semantic representation of ISA-Tab experimental metadata.

Authors:  Alejandra González-Beltrán; Eamonn Maguire; Susanna-Assunta Sansone; Philippe Rocca-Serra
Journal:  BMC Bioinformatics       Date:  2014-11-27       Impact factor: 3.169

7.  Proteomics Standards Initiative: Fifteen Years of Progress and Future Work.

Authors:  Eric W Deutsch; Sandra Orchard; Pierre-Alain Binz; Wout Bittremieux; Martin Eisenacher; Henning Hermjakob; Shin Kawano; Henry Lam; Gerhard Mayer; Gerben Menschaert; Yasset Perez-Riverol; Reza M Salek; David L Tabb; Stefan Tenzer; Juan Antonio Vizcaíno; Mathias Walzer; Andrew R Jones
Journal:  J Proteome Res       Date:  2017-09-15       Impact factor: 4.466

8.  The PRIDE database and related tools and resources in 2019: improving support for quantification data.

Authors:  Yasset Perez-Riverol; Attila Csordas; Jingwen Bai; Manuel Bernal-Llinares; Suresh Hewapathirana; Deepti J Kundu; Avinash Inuganti; Johannes Griss; Gerhard Mayer; Martin Eisenacher; Enrique Pérez; Julian Uszkoreit; Julianus Pfeuffer; Timo Sachsenberg; Sule Yilmaz; Shivani Tiwary; Jürgen Cox; Enrique Audain; Mathias Walzer; Andrew F Jarnuczak; Tobias Ternent; Alvis Brazma; Juan Antonio Vizcaíno
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

9.  A large dataset of protein dynamics in the mammalian heart proteome.

Authors:  Edward Lau; Quan Cao; Dominic C M Ng; Brian J Bleakley; T Umut Dincer; Brian M Bot; Ding Wang; David A Liem; Maggie P Y Lam; Junbo Ge; Peipei Ping
Journal:  Sci Data       Date:  2016-03-15       Impact factor: 6.444

10.  ProteomicsDB: a multi-omics and multi-organism resource for life science research.

Authors:  Patroklos Samaras; Tobias Schmidt; Martin Frejno; Siegfried Gessulat; Maria Reinecke; Anna Jarzab; Jana Zecha; Julia Mergner; Piero Giansanti; Hans-Christian Ehrlich; Stephan Aiche; Johannes Rank; Harald Kienegger; Helmut Krcmar; Bernhard Kuster; Mathias Wilhelm
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

View more
  5 in total

1.  Implementing the reuse of public DIA proteomics datasets: from the PRIDE database to Expression Atlas.

Authors:  Mathias Walzer; David García-Seisdedos; Ananth Prakash; Paul Brack; Peter Crowther; Robert L Graham; Nancy George; Suhaib Mohammed; Pablo Moreno; Irene Papatheodorou; Simon J Hubbard; Juan Antonio Vizcaíno
Journal:  Sci Data       Date:  2022-06-14       Impact factor: 8.501

Review 2.  A proteomics sample metadata representation for multiomics integration and big data analysis.

Authors:  Chengxin Dai; Anja Füllgrabe; Julianus Pfeuffer; Elizaveta M Solovyeva; Jingwen Deng; Pablo Moreno; Selvakumar Kamatchinathan; Deepti Jaiswal Kundu; Nancy George; Silvie Fexova; Björn Grüning; Melanie Christine Föll; Johannes Griss; Marc Vaudel; Enrique Audain; Marie Locard-Paulet; Michael Turewicz; Martin Eisenacher; Julian Uszkoreit; Tim Van Den Bossche; Veit Schwämmle; Henry Webel; Stefan Schulze; David Bouyssié; Savita Jayaram; Vinay Kumar Duggineni; Patroklos Samaras; Mathias Wilhelm; Meena Choi; Mingxun Wang; Oliver Kohlbacher; Alvis Brazma; Irene Papatheodorou; Nuno Bandeira; Eric W Deutsch; Juan Antonio Vizcaíno; Mingze Bai; Timo Sachsenberg; Lev I Levitsky; Yasset Perez-Riverol
Journal:  Nat Commun       Date:  2021-10-06       Impact factor: 14.919

3.  The European Bioinformatics Institute: empowering cooperation in response to a global health crisis.

Authors:  Gaia Cantelli; Guy Cochrane; Cath Brooksbank; Ellen McDonagh; Paul Flicek; Johanna McEntyre; Ewan Birney; Rolf Apweiler
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

4.  The PRIDE database resources in 2022: a hub for mass spectrometry-based proteomics evidences.

Authors:  Yasset Perez-Riverol; Jingwen Bai; Chakradhar Bandla; David García-Seisdedos; Suresh Hewapathirana; Selvakumar Kamatchinathan; Deepti J Kundu; Ananth Prakash; Anika Frericks-Zipper; Martin Eisenacher; Mathias Walzer; Shengbo Wang; Alvis Brazma; Juan Antonio Vizcaíno
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

5.  A knowledge graph to interpret clinical proteomics data.

Authors:  Alberto Santos; Ana R Colaço; Annelaura B Nielsen; Lili Niu; Maximilian Strauss; Philipp E Geyer; Fabian Coscia; Nicolai J Wewer Albrechtsen; Filip Mundt; Lars Juhl Jensen; Matthias Mann
Journal:  Nat Biotechnol       Date:  2022-01-31       Impact factor: 68.164

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.