Literature DB >> 30295851

The jPOST environment: an integrated proteomics data repository and database.

Yuki Moriya¹, Shin Kawano¹, Shujiro Okuda², Yu Watanabe², Masaki Matsumoto³, Tomoyo Takami³, Daiki Kobayashi⁴, Yoshinori Yamanouchi^4,5, Norie Araki⁴, Akiyasu C Yoshizawa⁶, Tsuyoshi Tabata^6,7, Mio Iwasaki⁷, Naoyuki Sugiyama⁶, Satoshi Tanaka⁸, Susumu Goto¹, Yasushi Ishihama⁶.

Abstract

Rapid progress is being made in mass spectrometry (MS)-based proteomics, yielding an increasing number of larger datasets with higher quality and higher throughput. To integrate proteomics datasets generated from various projects and institutions, we launched a project named jPOST (Japan ProteOme STandard Repository/Database, https://jpostdb.org/) in 2015. Its proteomics data repository, jPOSTrepo, began operations in 2016 and has accepted more than 10 TB of MS-based proteomics datasets in the past two years. In addition, we have developed a new proteomics database named jPOSTdb in which the published raw datasets in jPOSTrepo are reanalyzed using standardized protocol. jPOSTdb provides viewers showing the frequency of detected post-translational modifications, the co-occurrence of phosphorylation sites on a peptide and peptide sharing among proteoforms. jPOSTdb also provides basic statistical analysis tools to compare proteomics datasets.

Entities: Chemical Disease Species

Mesh：

Substances：
Proteome

Year: 2019 PMID： 30295851 PMCID： PMC6324006 DOI： 10.1093/nar/gky899

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

Proteomics approaches such as mass spectrometry (MS), gel electrophoresis and antibody-based ones are important for identifying proteins with accompanying information such as isoforms, subcellular localizations, tissue specificity, protein interactions, abundances and post-translational modifications (PTMs), many of which cannot be obtained by genomic and transcriptomic approaches. Recent advances in MS technology have significantly improved the coverage, resolution and speed in the measurements and resulted in increasing amounts of larger proteomics data with higher quality and higher throughput. These substantial proteomics data are useful for finding biomarkers in the fields of life and medical sciences, and should be accumulated, published and shared for further analyses (1,2). In fact, proteomics data including MS raw data, peak lists, peptide and protein identification results, and metadata about experiments should be deposited in a public repository when submitting papers describing and using the data, the same as the situation for nucleotide sequence data. In 2016, we launched a proteomics data repository in Japan, named jPOSTrepo (https://repository.jpostdb.org/) (3), which conforms to the international standards provided by the ProteomeXchange (PX) consortium (4). Currently, jPOSTrepo accepts MS-based proteomics data from all over the world as an official member of the PX consortium, together with PRIDE in Europe (5), MassIVE (https://massive.ucsd.edu/), PASSEL (6) and Panorama Public (7) in U.S.A. and iProX (http://www.iprox.org/) in China. jPOSTrepo has unique features such as a high-speed file upload system and user-friendly interface with open-source libraries; all of the submission operations can be completed within a web browser (3). Meanwhile, since 2010, the Human Proteome Project of the Human Proteome Organization has been constructing a human proteome database to integrate and organise information on all human proteins and their functions, such as isoforms, temporal variations, localizations, and modifications, although this has yet to be completed (8,9). Although the Human Proteome Map (10) and the ProteomicsDB (11,12) were published as MS-based human proteome databases in 2014, it has been pointed out that they contained many false positives because they detected proteins by combining spectra from multiple experiments containing many low-quality spectra (13). As such, we should develop a high-quality proteome database to obtain highly accurate information of proteoforms with few false positives from MS raw files. Here, we present the current status of the jPOST environment; updates of jPOSTrepo in the past two years and a newly developed proteome database, named jPOSTdb (https://globe.jpostdb.org/), containing standardized proteomics data reanalyzed from the published raw data in the jPOSTrepo with a common protocol and under the same confidence criteria (14). These data are also well curated with experimental metadata based on controlled selections of biological ontologies. jPOSTdb provides graphical data visualization interfaces for identified proteins, including protein annotations, identified peptide mapping, frequencies of detected PTMs, co-occurrence of phosphorylation sites on a peptide and peptide sharing among isoforms and among proteins. To the best of our knowledge, the interface for visualizing quantitative data and co-occurrence information of PTMs and peptide sharing is a unique feature of jPOSTdb.

CURRENT STATUS AND UPDATES OF REPOSITORY

jPOSTrepo was launched in May 2016 and joined the PX consortium in July, 2016. Currently, jPOSTrepo has 320 projects (170 public) with more than 10TB of data; this number of projects is threefold compared with that of 2 years ago, whereas the amount of data has increased 7-fold. The datasets have been submitted from 20 countries, including not only those in Asia and Oceania, as originally expected, but also other countries in Europe, North and South America and elsewhere. One of the key features of jPOSTrepo is that it allows the rapid uploading of files. Although it depends on the local network environment, the average file upload speed is 9.60MB/s from Japan, 3.86MB/s from elsewhere in Asia and Oceania, and 4.22MB/s from other areas (Figure 1). This means that it takes only 1 h at most to upload a 10GB file on an average.

Figure 1.

File upload speed to jPOSTrepo. Each square represents the mean upload speed at every 200 km from each place to jPSOTrepo server installed at Database Center for Life Science in Mishima, Japan and the corresponding bar shows maximum speed and minimum speed. In addition to MS data, jPOSTrepo now accepts gel electrophoresis-based proteomics data (such as those from 2D-PAGE and 2D-DIGE) and antibody-based proteomics data, covering almost all types of proteomics data. On the submission page, users can select which type of dataset to submit. Because the PX consortium accepts only MS data, datasets of other data types cannot be assigned a PX identifier. However, jPOSTrepo issues a jPOST identifier for non-MS datasets; thus, users are able to use a jPOST identifier instead of a PX identifier.

OVERVIEW OF DATABASE

The back-end system of jPOSTdb supports not only human but also a wide range of species such as model animals, plants, yeasts, and bacteria, unlike existing databases targeted only to human. The current version of the database contains reanalyzed datasets from samples of human, mouse, rat and bacteria selected by checking metadata and raw data quality. For the metadata, information on the sample source, sample preparation method and condition of mass spectrometry acquisition is required. Each piece of metadata is curated manually for each dataset based on various ontologies of the life science field, such as National Center for Biotechnology Information taxonomy (15), Cell Line Ontology (16), Braunschweig Enzyme Database Tissue and Enzyme Source Ontology (17), Human Disease Ontology (18), National Cancer Institute Thesaurus (19) and the Human Proteome Organization Proteomics Standards Initiative-Mass Spectrometry controlled vocabulary (20). We inspected the raw data quality based on the following factors: (i) the peptide-spectrum match (%) should be >10%, (ii) LC chromatograms should not have irregular peak profiles and (iii) the distribution of delta mass for precursor ions should follow the Gaussian pattern. jPOSTdb stores assigned peptide-spectrum matches (PSMs), identified peptides, inferred proteins and dataset information hierarchically. For peptide identification, jPOSTdb uses the UniProt ‘proteomes’, which are the protein sequence set thought to be expressed by an organism whose genome has been completely sequenced (21). Proteins are inferred based on identified peptides by using the method proposed by Nesvizhskii and Aebersold (22). In jPOSTdb, the entire dataset as a single unit is named a ‘Globe’, and datasets selected from the Globe by users with their own filters is named a ‘Slice’. jPOSTdb provides the following functions: (i) datasets filtering by metadata, to create a ‘Slice’; (ii) browsing identified peptides, PTMs and other data and (iii) basic statistical analysis and visualization of data in specified ‘Slices’. Figure 2 shows an overview of the jPOSTdb web interface. First, users can filter a ‘Globe’ by a faceted search based on a combination of dataset metadata such as species, sample type, cell line, organ, disease, modifications and MS instruments, as well as a simple keyword search. Then, users can save the filtered dataset to a ‘Slice’, which means a personalized sub-dataset of a ‘Globe’, from the checkboxes of the results table and ‘Add to Slice’ button. Because the ‘Slice’ data are stored in the web storage of the user's browser, a saved ‘Slice’ is not sent to the server or any other location at all, and the different browsers present on a computer do not share data of web storage. Users thus need to export and import the ‘Slice’ to observe the same datasets in different locations. After the dataset filtering, users can access the detected peptides, PTMs, inferred proteins and other proteomics information. Additionally, users can perform comparative analyses between ‘Slices’ by using the jPOSTdb which implements basic statistical and functional analyses, such as differential expression analysis and enrichment analysis. The details of this are described below.

Figure 2.

Overview of the jPOSTdb web interface. ‘Globe’ is the whole data set of jPOSTdb, whereas ‘Slice’ is datasets filtered by metadata. Details of the example views on the right are described in Figures 3–5.

Figure 3.

Figure 5.

Example of comparison between ‘Slices’. (A) Result example of differential expression analysis. (B) Network graph of enrichment analysis (in the KEGG pathway category).

INTERFACES OF DATABASE

jPOSTdb is organized by the following three types of entry: dataset entry, protein entry and peptide entry. Each entry page contains a data summary and lists of corresponding proteins, peptides and PSMs. In the dataset and protein entry pages, jPOSTdb provides graphical data visualization interfaces (Figures 3 and 4). In addition, jPOSTdb also provides a ‘Slice’ page with graphical interfaces as the dataset page. Users can download the data tables of PSMs and inferred proteins through the corresponding dataset entry page and the jPOST FTP site (ftp://jpost.pharm.kyoto-u.ac.jp/database/).

Figure 4.

Example of protein entry page. (A) Metadata and statistics of a protein. (B) Viewer of protein annotations. (C) Viewer of peptide sharing among isoforms and proteins. Peptide bars with the same color indicate the same peptides.

Example of dataset entry page. (A) Metadata and statistics of dataset. (B) Histogram of proteins per chromosome. Chromosome annotations are based on UniProt, where ‘unplaced’ means chromosome information is not available. (C) Pie chart of protein existence. (D) Input form for KEGG pathway mapping. (E) Example of KEGG pathway mapping. The protein box color varies from red to blue, corresponding to high and low expression, respectively, based on their spectral counts. Example of protein entry page. (A) Metadata and statistics of a protein. (B) Viewer of protein annotations. (C) Viewer of peptide sharing among isoforms and proteins. Peptide bars with the same color indicate the same peptides.

Dataset entry page

Chromosome info

The ‘Chromosome info’ section shows a histogram of detected proteins (blue) and the total number of proteins (grey) per chromosome (Figure 3B), mitochondria and plasmids listed on the dataset entry page and slice page. The protein count is calculated based on the protein entry number in UniProt (15). Therefore, the total number of proteins does not refer to the exact number of coding genes in each chromosome. In cases in which the species of a dataset/slice is human, the protein count is based on the count of neXtProt entries (23).

Protein existence

The ‘Protein existence’ section displays a pie chart that shows the evidence types that support the existence of proteins (Figure 3C), described in the neXtProt database for human and the UniProt database for other organisms. There are five types of evidence for the existence of a protein as follows: PE1) experimental evidence at the protein level, PE2) experimental evidence at the transcript level, PE3) protein inferred from homology, PE4) protein predicted and PE5) protein uncertain, in that there is uncertainty over whether the protein actually exists. In case of human, proteins in PE2–4 categories are called ‘missing proteins’, indicating that they are unconfirmed sequences for which protein products have not yet been detected; the detection of these proteins has been the focus of chromosome-centric human proteome project (9).

KEGG pathway mapping

Proteins with KEGG Orthology (KO) annotation can be mapped to KEGG PATHWAY (24) (Figure 3D). The protein box color on KEGG pathway maps varies from red to blue, corresponding to high and low expression, respectively, based on their spectral counts.

Protein entry page

Protein browser

The ‘Protein browser’ section is a viewer of protein structural annotations (Figure 4B). Users can add annotations of interest into the viewer panel from the ‘Add view’ pull-down menu. The ‘Peptide’ panel shows detected peptides mapped to the protein sequence. The color of peptide bars reflects the number of PSMs, and varies from red to grey (red represents a high number and grey represents a low one). The ‘PTM’ panel displays actual modification names such as ‘Phospho’ in the viewer, and shows detected PTMs on the protein sequence. The vertical bar length above the PTM site reflects the count of PTM detection. Here ‘norm’. represents the normalized length by spectral counts included in the site, and ‘count’ shows real counts. The ‘P-site linkage’ panel shows the co-occurrence of phosphorylation sites on a peptide. To the best of our knowledge, the comprehensive PTM counting and PTM co-occurrence are unique features of jPOSTdb. The ‘UniProt annotation’ panel shows PTM sites and single amino acid variations described in UniProt.

Peptide sharing

The ‘Peptide sharing’ section shows peptides with the same amino acid sequence mapping to multiple UniProt isoforms and multiple UniProt entries, separately (Figure 4C). Peptide bars with the same color across multiple sequences indicate the same peptides. The numerical values overlapped with peptide bars refer to the number of PSMs. In MS-based shotgun proteomics, instead of full-length proteins, the peptides digested by enzymes such as trypsin are detected. Therefore, it is important to clarify which protein is more likely to be the origin of each peptide for the inference of protein expression. On this interface, users can visualize which proteins share peptides with each other. Users can also recognize whether the peptide of interest has tryptic cleavage sites at both termini.

COMPARISON OF SELECTED DATASETS

Because jPOSTdb provides basic statistical analysis tools, users can compare the statistics and expression of proteins between ‘Slices’.

Differential expression analysis

Users can compare the expression level of proteins between two ‘Slices’ by empirical Bayes estimate, Wilcoxon rank sum test and fold change of the mean from the pull-down menu. This quantification is based on spectral counting normalized by the total count of spectra from each dataset. The former two methods are implemented by using the R programming language library. The Wilcoxon rank sum test is a statistical test commonly used in differential expression analyses when the number of datasets is large enough. The empirical Bayes estimation is applicable even when the number of datasets is relatively small. In the volcano plot of results, users can change thresholds of the fold change and P-value by moving the triangular marker on the x- and y-axes (Figure 5A). The fold change of the mean expression level in a protein is shown as a histogram-like plot. It is not subjected to any statistical test, so the P-value is not calculated; therefore, the y-axis shows an arbitrary scale based on frequency. Example of comparison between ‘Slices’. (A) Result example of differential expression analysis. (B) Network graph of enrichment analysis (in the KEGG pathway category).

Enrichment analysis

jPOSTdb provides protein set enrichment analysis targeting the KEGG pathway category and the three main categories of Gene Ontology (GO) (biological process, molecular function, and cellular component) (25), for selected proteins in the differential expression analysis. The results are displayed by a network graph (Figure 5B) and a table. Nodes in the network show categories of the KEGG pathway or the GO, and the node color of enriched categories varies from yellow to red (P-value < 0.05, yellow shows high and red shows low). The blue node shows the root category of the network. Each node's size reflects the number of selected proteins in the differential expression analysis. However, the size of white nodes and the root node is limited to the maximum size of enriched nodes that are colored from yellow to red, to make the network layout clearer. When the target is a KEGG pathway, users can map proteins onto the KEGG pathway map. The boxes of pathway maps are colored from blue to red (blue shows that the expression level is decreased and red shows that it is increased). When a box in pathway maps corresponds to multiple proteins, the box is colored arbitrarily by any one of these proteins (due to a limitation of the KEGG mapper).

DISCUSSION

Increasing number of projects has been continuously deposited in jPOSTrepo from all over the world. In addition, the average data size per project is increasing, whereas the number of files per project is constant, indicating that the average size of an MS raw file is increasing (Figure 6). This would be because of higher resolving power of mass analyzers and higher frequencies to acquire MS/MS spectra in newly launched MS instruments, resulting in higher coverage of proteomes with higher throughput than earlier. In addition, ultra-large-scale proteomics projects using human patient samples have been launched, such as the international cancer proteogenome consortium (https://icpc.cancer.gov/) and the Human Diabetes Proteome Project (26). Hence, it is imperative for the public repository to equip itself with a highly efficient system to upload/download the data, which has been achieved in jPOSTrepo (Figure 1). Furthermore, jPOSTrepo continuously receives the MS-based proteomics data of non-human organisms, as well as the gel-based and the antibody-based proteomics data. Taken together, a wide variety of proteome datasets has been successfully accumulated in jPOSTrepo and is converted into jPOSTdb using the standardized data analysis protocol.

Figure 6.

The growing of submitted data size to jPOSTrepo. Blue line shows the average file size (left y-axis), and red line shows the average data size per project (right y-axis).

The growing of submitted data size to jPOSTrepo. Blue line shows the average file size (left y-axis), and red line shows the average data size per project (right y-axis). Currently, jPOSTdb automatically calculates spectral counts for quantitative analyses, and uses other quantitative approaches, such as stable isotope labelling for relative and absolute quantitation, including SILAC, dimethyl labelling, iTRAQ and TMT, and label-free quantitation based on extracted ion chromatograms, emPAI (27) and iBAQ (28) as a next step. jPOSTdb is constructed based on the Semantic Web technology that facilitates integration of various databases containing big data (29). The protein sequence and annotation databases, such as UniProt and neXtProt, have released data formatted by the Resource Description Framework (RDF) data model, which supports the Semantic Web technology (21,30). In addition, not only the proteome data, but also other omics and life science-related data have been converted to RDF data and released in the NBDC RDF portal (https://integbio.jp/rdf/) and EBI RDF platform (31) and have been incorporated into various life science databases such as TogoGenome, a genomic database based on RefSeq data (http://togogenome.org/), TogoVar, a database of human genome variants/variations (https://togovar.biosciencedbc.jp/) and GlyTouCan, a glycan structure repository (32). Based on the RDF format, therefore, data integration as well as database integration will be more accelerated in future, and the high-quality proteome data generated from the jPOST environment would be one of its core elements because proteins are lethally essential biomolecules for the functioning of cells.

29 in total

1. Exponentially modified protein abundance index (emPAI) for estimation of absolute protein amount in proteomics by the number of sequenced peptides per protein.

Authors: Yasushi Ishihama; Yoshiya Oda; Tsuyoshi Tabata; Toshitaka Sato; Takeshi Nagasu; Juri Rappsilber; Matthias Mann
Journal: Mol Cell Proteomics Date: 2005-06-14 Impact factor: 5.911

2. Interpretation of shotgun proteomic data: the protein inference problem.

Authors: Alexey I Nesvizhskii; Ruedi Aebersold
Journal: Mol Cell Proteomics Date: 2005-07-11 Impact factor: 5.911

3. PASSEL: the PeptideAtlas SRMexperiment library.

Authors: Terry Farrah; Eric W Deutsch; Richard Kreisberg; Zhi Sun; David S Campbell; Luis Mendoza; Ulrike Kusebauch; Mi-Youn Brusniak; Ruth Hüttenhain; Ralph Schiess; Nathalie Selevsek; Ruedi Aebersold; Robert L Moritz
Journal: Proteomics Date: 2012-04 Impact factor: 3.984

4. Representing the NCI Thesaurus in OWL DL: Modeling tools help modeling languages.

Authors: Natalya F Noy; Sherri de Coronado; Harold Solbrig; Gilberto Fragoso; Frank W Hartel; Mark A Musen
Journal: Appl Ontol Date: 2008-01-01 Impact factor: 1.115

5. Global quantification of mammalian gene expression control.

Authors: Björn Schwanhäusser; Dorothea Busse; Na Li; Gunnar Dittmar; Johannes Schuchhardt; Jana Wolf; Wei Chen; Matthias Selbach
Journal: Nature Date: 2011-05-19 Impact factor: 49.962

6. The human proteome project: current state and future direction.

Authors: Pierre Legrain; Ruedi Aebersold; Alexander Archakov; Amos Bairoch; Kumar Bala; Laura Beretta; John Bergeron; Christoph H Borchers; Garry L Corthals; Catherine E Costello; Eric W Deutsch; Bruno Domon; William Hancock; Fuchu He; Denis Hochstrasser; György Marko-Varga; Ghasem Hosseini Salekdeh; Salvatore Sechi; Michael Snyder; Sudhir Srivastava; Mathias Uhlén; Cathy H Wu; Tadashi Yamamoto; Young-Ki Paik; Gilbert S Omenn
Journal: Mol Cell Proteomics Date: 2011-07 Impact factor: 5.911

7. The HUPO proteomics standards initiative- mass spectrometry controlled vocabulary.

Authors: Gerhard Mayer; Luisa Montecchi-Palazzi; David Ovelleiro; Andrew R Jones; Pierre-Alain Binz; Eric W Deutsch; Matthew Chambers; Marius Kallhardt; Fredrik Levander; James Shofstahl; Sandra Orchard; Juan Antonio Vizcaíno; Henning Hermjakob; Christian Stephan; Helmut E Meyer; Martin Eisenacher
Journal: Database (Oxford) Date: 2013-03-12 Impact factor: 3.451

8. The BRENDA Tissue Ontology (BTO): the first all-integrating ontology of all organisms for enzyme sources.

Authors: Marion Gremse; Antje Chang; Ida Schomburg; Andreas Grote; Maurice Scheer; Christian Ebeling; Dietmar Schomburg
Journal: Nucleic Acids Res Date: 2010-10-28 Impact factor: 16.971

9. The NCBI Taxonomy database.

Authors: Scott Federhen
Journal: Nucleic Acids Res Date: 2011-12-01 Impact factor: 16.971

10. The EBI RDF platform: linked open data for the life sciences.

Authors: Simon Jupp; James Malone; Jerven Bolleman; Marco Brandizi; Mark Davies; Leyla Garcia; Anna Gaulton; Sebastien Gehant; Camille Laibe; Nicole Redaschi; Sarala M Wimalaratne; Maria Martin; Nicolas Le Novère; Helen Parkinson; Ewan Birney; Andrew M Jenkinson
Journal: Bioinformatics Date: 2014-01-11 Impact factor: 6.937

24 in total

1. Mass Spectrometry-Based Plasma Proteomics: Considerations from Sample Collection to Achieving Translational Data.

Authors: Vera Ignjatovic; Philipp E Geyer; Krishnan K Palaniappan; Jessica E Chaaban; Gilbert S Omenn; Mark S Baker; Eric W Deutsch; Jochen M Schwenk
Journal: J Proteome Res Date: 2019-10-11 Impact factor: 4.466

2. Human Proteome Project Mass Spectrometry Data Interpretation Guidelines 3.0.

Authors: Eric W Deutsch; Lydie Lane; Christopher M Overall; Nuno Bandeira; Mark S Baker; Charles Pineau; Robert L Moritz; Fernando Corrales; Sandra Orchard; Jennifer E Van Eyk; Young-Ki Paik; Susan T Weintraub; Yves Vandenbrouck; Gilbert S Omenn
Journal: J Proteome Res Date: 2019-10-21 Impact factor: 4.466

3. Progress on Identifying and Characterizing the Human Proteome: 2019 Metrics from the HUPO Human Proteome Project.

Authors: Gilbert S Omenn; Lydie Lane; Christopher M Overall; Fernando J Corrales; Jochen M Schwenk; Young-Ki Paik; Jennifer E Van Eyk; Siqi Liu; Stephen Pennington; Michael P Snyder; Mark S Baker; Eric W Deutsch
Journal: J Proteome Res Date: 2019-09-13 Impact factor: 4.466

4. Research on the Human Proteome Reaches a Major Milestone: >90% of Predicted Human Proteins Now Credibly Detected, According to the HUPO Human Proteome Project.

Authors: Gilbert S Omenn; Lydie Lane; Christopher M Overall; Ileana M Cristea; Fernando J Corrales; Cecilia Lindskog; Young-Ki Paik; Jennifer E Van Eyk; Siqi Liu; Stephen R Pennington; Michael P Snyder; Mark S Baker; Nuno Bandeira; Ruedi Aebersold; Robert L Moritz; Eric W Deutsch
Journal: J Proteome Res Date: 2020-10-19 Impact factor: 4.466

5. The Archaeal Proteome Project advances knowledge about archaeal cell biology through comprehensive proteomics.

Authors: Stefan Schulze; Zachary Adams; Micaela Cerletti; Rosana De Castro; Sébastien Ferreira-Cerca; Christian Fufezan; María Inés Giménez; Michael Hippler; Zivojin Jevtic; Robert Knüppel; Georgio Legerme; Christof Lenz; Anita Marchfelder; Julie Maupin-Furlow; Roberto A Paggi; Friedhelm Pfeiffer; Ansgar Poetsch; Henning Urlaub; Mechthild Pohlschroder
Journal: Nat Commun Date: 2020-06-19 Impact factor: 14.919

6. The 26th annual Nucleic Acids Research database issue and Molecular Biology Database Collection.

Authors: Daniel J Rigden; Xosé M Fernández
Journal: Nucleic Acids Res Date: 2019-01-08 Impact factor: 16.971

7. The ProteomeXchange consortium in 2020: enabling 'big data' approaches in proteomics.

Authors: Eric W Deutsch; Nuno Bandeira; Vagisha Sharma; Yasset Perez-Riverol; Jeremy J Carver; Deepti J Kundu; David García-Seisdedos; Andrew F Jarnuczak; Suresh Hewapathirana; Benjamin S Pullman; Julie Wertz; Zhi Sun; Shin Kawano; Shujiro Okuda; Yu Watanabe; Henning Hermjakob; Brendan MacLean; Michael J MacCoss; Yunping Zhu; Yasushi Ishihama; Juan A Vizcaíno
Journal: Nucleic Acids Res Date: 2020-01-08 Impact factor: 16.971

Review 8. A high-stringency blueprint of the human proteome.

Authors: Subash Adhikari; Edouard C Nice; Eric W Deutsch; Lydie Lane; Gilbert S Omenn; Stephen R Pennington; Young-Ki Paik; Christopher M Overall; Fernando J Corrales; Ileana M Cristea; Jennifer E Van Eyk; Mathias Uhlén; Cecilia Lindskog; Daniel W Chan; Amos Bairoch; James C Waddington; Joshua L Justice; Joshua LaBaer; Henry Rodriguez; Fuchu He; Markus Kostrzewa; Peipei Ping; Rebekah L Gundry; Peter Stewart; Sanjeeva Srivastava; Sudhir Srivastava; Fabio C S Nogueira; Gilberto B Domont; Yves Vandenbrouck; Maggie P Y Lam; Sara Wennersten; Juan Antonio Vizcaino; Marc Wilkins; Jochen M Schwenk; Emma Lundberg; Nuno Bandeira; Gyorgy Marko-Varga; Susan T Weintraub; Charles Pineau; Ulrike Kusebauch; Robert L Moritz; Seong Beom Ahn; Magnus Palmblad; Michael P Snyder; Ruedi Aebersold; Mark S Baker
Journal: Nat Commun Date: 2020-10-16 Impact factor: 14.919

9. Effect of Phosphorylation on the Collision Cross Sections of Peptide Ions in Ion Mobility Spectrometry.

Authors: Kosuke Ogata; Chih-Hsiang Chang; Yasushi Ishihama
Journal: Mass Spectrom (Tokyo) Date: 2021-01-30

10. UniProt: the universal protein knowledgebase in 2021.

Authors:
Journal: Nucleic Acids Res Date: 2021-01-08 Impact factor: 16.971