| Literature DB >> 28763045 |
Scott B Ficarro1,2, William M Alexander3,4, Jarrod A Marto5,6,7.
Abstract
Although not yet truly 'comprehensive', modern mass spectrometry-based experiments can generate quantitative data for a meaningful fraction of the human proteome. Importantly for large-scale protein expression analysis, robust data pipelines are in place for identification of un-modified peptide sequences and aggregation of these data to protein-level quantification. However, interoperable software tools that enable scientists to computationally explore and document novel hypotheses for peptide sequence, modification status, or fragmentation behavior are not well-developed. Here, we introduce mzStudio, an open-source Python module built on our multiplierz project. This desktop application provides a highly-interactive graphical user interface (GUI) through which scientists can examine and annotate spectral features, re-search existing PSMs to test different modifications or new spectral matching algorithms, share results with colleagues, integrate other domain-specific software tools, and finally create publication-quality graphics. mzStudio leverages our common application programming interface (mzAPI) for access to native data files from multiple instrument platforms, including ion trap, quadrupole time-of-flight, Orbitrap, matrix-assisted laser desorption ionization, and triple quadrupole mass spectrometers and is compatible with several popular search engines including Mascot, Proteome Discoverer, X!Tandem, and Comet. The mzStudio toolkit enables researchers to create a digital provenance of data analytics and other evidence that support specific peptide sequence assignments.Entities:
Keywords: API; SQLite; application programming interface; bioinformatics software; mass spectrometry; quantification; results distribution
Year: 2017 PMID: 28763045 PMCID: PMC5620537 DOI: 10.3390/proteomes5030020
Source DB: PubMed Journal: Proteomes ISSN: 2227-7382
Figure 1The main user interface of mzStudio supports direct access to native mass spectrometry data files from different instrument manufacturers, and can import database search results from Mascot, X!Tandem, Comet, and Proteome Discoverer. An interactive analysis window enables rapid SQLite filtering of data, while a peptide calculator toolbar displays theoretical fragment ion masses. Additional tools provide for feature detection, custom spectral processing, and launching database searches.
Figure 2mzStudio enables custom spectral processing and direct database search of processed spectra. After opening a raw file and importing search results , users can click individual PSMs to view annotated peaks . This particular MS/MS spectrum was obtained during analysis of peptides modified with the CDK7 kinase inhibitor THZ1 and yields a low-confidence Mascot score due to the presence of several inhibitor-related ions and non-canonical fragmentation pathways. Using mzStudio’s built-in spectral processing tools , users can easily experiment with different processing algorithms (i.e., filter inhibitor related ions, reduce charge of highly charged species, and score ions from inhibitor related fragmentation pathways), and assess the impact on peptide sequence scores through the integrated search tool .
Figure 3SpecStylus enables text- and graphic-annotation of mass spectra, processed spectra, extracted ion chromatograms, and search results/parameters. Collectively these figures and notes provide digital provenance for data analytic workflows and ideas supporting peptide sequence assignment. Spectra and chromatograms can be exported as .svg, .pdf, .ppt, or .png for integration into presentation material, while underlying peak lists are compatible with NIST library search tools.