Literature DB >> 31081335

Proteomics Standards Initiative Extended FASTA Format.

Pierre-Alain Binz1, Jim Shofstahl2, Juan Antonio Vizcaíno3, Harald Barsnes4,5, Robert J Chalkley6, Gerben Menschaert7, Emanuele Alpi3, Karl Clauser8, Jimmy K Eng9, Lydie Lane10,11, Sean L Seymour12, Luis Francisco Hernández Sánchez13,14, Gerhard Mayer15, Martin Eisenacher15, Yasset Perez-Riverol3, Eugene A Kapp16, Luis Mendoza17, Peter R Baker6, Andrew Collins18, Tim Van Den Bossche19, Eric W Deutsch17.   

Abstract

Mass-spectrometry-based proteomics enables the high-throughput identification and quantification of proteins, including sequence variants and post-translational modifications (PTMs) in biological samples. However, most workflows require that such variations be included in the search space used to analyze the data, and doing so remains challenging with most analysis tools. In order to facilitate the search for known sequence variants and PTMs, the Proteomics Standards Initiative (PSI) has designed and implemented the PSI extended FASTA format (PEFF). PEFF is based on the very popular FASTA format but adds a uniform mechanism for encoding substantially more metadata about the sequence collection as well as individual entries, including support for encoding known sequence variants, PTMs, and proteoforms. The format is very nearly backward compatible, and as such, existing FASTA parsers will require little or no changes to be able to read PEFF files as FASTA files, although without supporting any of the extra capabilities of PEFF. PEFF is defined by a full specification document, controlled vocabulary terms, a set of example files, software libraries, and a file validator. Popular software and resources are starting to support PEFF, including the sequence search engine Comet and the knowledge bases neXtProt and UniProtKB. Widespread implementation of PEFF is expected to further enable proteogenomics and top-down proteomics applications by providing a standardized mechanism for encoding protein sequences and their known variations. All the related documentation, including the detailed file format specification and example files, are available at http://www.psidev.info/peff .

Entities:  

Keywords:  FASTA; PEFF; PSI; Proteomics Standards Initiative; file formats; mass spectrometry; proteogenomics; proteomics; standards

Mesh:

Year:  2019        PMID: 31081335      PMCID: PMC6642660          DOI: 10.1021/acs.jproteome.9b00064

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  51 in total

Review 1.  The Human Proteome Organization: a mission to advance proteome knowledge.

Authors:  Sam Hanash; Julio E Celis
Journal:  Mol Cell Proteomics       Date:  2002-06       Impact factor: 5.911

2.  The proteomics standards initiative.

Authors:  Sandra Orchard; Henning Hermjakob; Rolf Apweiler
Journal:  Proteomics       Date:  2003-07       Impact factor: 3.984

3.  Mass spectrometry in high-throughput proteomics: ready for the big time.

Authors:  Tommy Nilsson; Matthias Mann; Ruedi Aebersold; John R Yates; Amos Bairoch; John J M Bergeron
Journal:  Nat Methods       Date:  2010-09       Impact factor: 28.547

4.  The PSI-MOD community standard for representation of protein modification data.

Authors:  Luisa Montecchi-Palazzi; Ron Beavis; Pierre-Alain Binz; Robert J Chalkley; John Cottrell; David Creasy; Jim Shofstahl; Sean L Seymour; John S Garavelli
Journal:  Nat Biotechnol       Date:  2008-08       Impact factor: 54.908

5.  Pyteomics 4.0: Five Years of Development of a Python Proteomics Framework.

Authors:  Lev I Levitsky; Joshua A Klein; Mark V Ivanov; Mikhail V Gorshkov
Journal:  J Proteome Res       Date:  2019-01-08       Impact factor: 4.466

6.  Improved tools for biological sequence comparison.

Authors:  W R Pearson; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1988-04       Impact factor: 11.205

Review 7.  Anatomy and evolution of database search engines-a central component of mass spectrometry based proteomic workflows.

Authors:  Kenneth Verheggen; Helge Raeder; Frode S Berven; Lennart Martens; Harald Barsnes; Marc Vaudel
Journal:  Mass Spectrom Rev       Date:  2017-09-13       Impact factor: 10.946

8.  A uniform proteomics MS/MS analysis platform utilizing open XML file formats.

Authors:  Andrew Keller; Jimmy Eng; Ning Zhang; Xiao-jun Li; Ruedi Aebersold
Journal:  Mol Syst Biol       Date:  2005-08-02       Impact factor: 11.429

9.  The ProteomeXchange consortium in 2017: supporting the cultural change in proteomics public data deposition.

Authors:  Eric W Deutsch; Attila Csordas; Zhi Sun; Andrew Jarnuczak; Yasset Perez-Riverol; Tobias Ternent; David S Campbell; Manuel Bernal-Llinares; Shujiro Okuda; Shin Kawano; Robert L Moritz; Jeremy J Carver; Mingxun Wang; Yasushi Ishihama; Nuno Bandeira; Henning Hermjakob; Juan Antonio Vizcaíno
Journal:  Nucleic Acids Res       Date:  2016-10-18       Impact factor: 16.971

10.  MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics.

Authors:  Andy T Kong; Felipe V Leprevost; Dmitry M Avtonomov; Dattatreya Mellacheruvu; Alexey I Nesvizhskii
Journal:  Nat Methods       Date:  2017-04-10       Impact factor: 28.547

View more
  8 in total

1.  BpForms and BcForms: a toolkit for concretely describing non-canonical polymers and complexes to facilitate global biochemical networks.

Authors:  Paul F Lang; Yassmine Chebaro; Xiaoyue Zheng; John A P Sekar; Bilal Shaikh; Darren A Natale; Jonathan R Karr
Journal:  Genome Biol       Date:  2020-05-18       Impact factor: 13.583

2.  Extending Comet for Global Amino Acid Variant and Post-Translational Modification Analysis Using the PSI Extended FASTA Format.

Authors:  Jimmy K Eng; Eric W Deutsch
Journal:  Proteomics       Date:  2020-04-02       Impact factor: 3.984

3.  Assessing Protein Sequence Database Suitability Using De Novo Sequencing.

Authors:  Richard S Johnson; Brian C Searle; Brook L Nunn; Jason M Gilmore; Molly Phillips; Chris T Amemiya; Michelle Heck; Michael J MacCoss
Journal:  Mol Cell Proteomics       Date:  2019-11-15       Impact factor: 5.911

Review 4.  Identification of tumor antigens with immunopeptidomics.

Authors:  Chloe Chong; George Coukos; Michal Bassani-Sternberg
Journal:  Nat Biotechnol       Date:  2021-10-11       Impact factor: 54.908

5.  Proteomics Standards Initiative's ProForma 2.0: Unifying the Encoding of Proteoforms and Peptidoforms.

Authors:  Richard D LeDuc; Eric W Deutsch; Pierre-Alain Binz; Ryan T Fellers; Anthony J Cesnik; Joshua A Klein; Tim Van Den Bossche; Ralf Gabriels; Arshika Yalavarthi; Yasset Perez-Riverol; Jeremy Carver; Wout Bittremieux; Shin Kawano; Benjamin Pullman; Nuno Bandeira; Neil L Kelleher; Paul M Thomas; Juan Antonio Vizcaíno
Journal:  J Proteome Res       Date:  2022-03-15       Impact factor: 4.466

6.  The neXtProt knowledgebase in 2020: data, tools and usability improvements.

Authors:  Monique Zahn-Zabal; Pierre-André Michel; Alain Gateau; Frédéric Nikitin; Mathieu Schaeffer; Estelle Audot; Pascale Gaudet; Paula D Duek; Daniel Teixeira; Valentine Rech de Laval; Kasun Samarasinghe; Amos Bairoch; Lydie Lane
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

7.  UniProt: the universal protein knowledgebase in 2021.

Authors: 
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

8.  Bioinformatic Prediction and Characterization of Proteins in Porphyra dentata by Shotgun Proteomics.

Authors:  Mingchang Yang; Lizhen Ma; Xianqing Yang; Laihao Li; Shengjun Chen; Bo Qi; Yueqi Wang; Chunsheng Li; Shaoling Yang; Yongqiang Zhao
Journal:  Front Nutr       Date:  2022-07-07
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.