| Literature DB >> 30295851 |
Yuki Moriya1, Shin Kawano1, Shujiro Okuda2, Yu Watanabe2, Masaki Matsumoto3, Tomoyo Takami3, Daiki Kobayashi4, Yoshinori Yamanouchi4,5, Norie Araki4, Akiyasu C Yoshizawa6, Tsuyoshi Tabata6,7, Mio Iwasaki7, Naoyuki Sugiyama6, Satoshi Tanaka8, Susumu Goto1, Yasushi Ishihama6.
Abstract
Rapid progress is being made in mass spectrometry (MS)-based proteomics, yielding an increasing number of larger datasets with higher quality and higher throughput. To integrate proteomics datasets generated from various projects and institutions, we launched a project named jPOST (Japan ProteOme STandard Repository/Database, https://jpostdb.org/) in 2015. Its proteomics data repository, jPOSTrepo, began operations in 2016 and has accepted more than 10 TB of MS-based proteomics datasets in the past two years. In addition, we have developed a new proteomics database named jPOSTdb in which the published raw datasets in jPOSTrepo are reanalyzed using standardized protocol. jPOSTdb provides viewers showing the frequency of detected post-translational modifications, the co-occurrence of phosphorylation sites on a peptide and peptide sharing among proteoforms. jPOSTdb also provides basic statistical analysis tools to compare proteomics datasets.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30295851 PMCID: PMC6324006 DOI: 10.1093/nar/gky899
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.File upload speed to jPOSTrepo. Each square represents the mean upload speed at every 200 km from each place to jPSOTrepo server installed at Database Center for Life Science in Mishima, Japan and the corresponding bar shows maximum speed and minimum speed.
Figure 2.Overview of the jPOSTdb web interface. ‘Globe’ is the whole data set of jPOSTdb, whereas ‘Slice’ is datasets filtered by metadata. Details of the example views on the right are described in Figures 3–5.
Figure 3.Example of dataset entry page. (A) Metadata and statistics of dataset. (B) Histogram of proteins per chromosome. Chromosome annotations are based on UniProt, where ‘unplaced’ means chromosome information is not available. (C) Pie chart of protein existence. (D) Input form for KEGG pathway mapping. (E) Example of KEGG pathway mapping. The protein box color varies from red to blue, corresponding to high and low expression, respectively, based on their spectral counts.
Figure 5.Example of comparison between ‘Slices’. (A) Result example of differential expression analysis. (B) Network graph of enrichment analysis (in the KEGG pathway category).
Figure 4.Example of protein entry page. (A) Metadata and statistics of a protein. (B) Viewer of protein annotations. (C) Viewer of peptide sharing among isoforms and proteins. Peptide bars with the same color indicate the same peptides.
Figure 6.The growing of submitted data size to jPOSTrepo. Blue line shows the average file size (left y-axis), and red line shows the average data size per project (right y-axis).