Literature DB >> 30084957

VisioProt-MS: interactive 2D maps from intact protein mass spectrometry.

Marie Locard-Paulet1, Julien Parra1, Renaud Albigot1, Emmanuelle Mouton-Barbosa1, Laurent Bardi1, Odile Burlet-Schiltz1, Julien Marcoux1.   

Abstract

SUMMARY: VisioProt-MS is designed to summarize and analyze intact protein and top-down proteomics data. It plots the molecular weights of eluting proteins as a function of their retention time, thereby allowing inspection of runs from liquid chromatography coupled to mass spectrometry (LC-MS). It also overlays MS/MS identification results. VisioProt-MS is compatible with outputs from many different top-down dedicated software. To our knowledge, this is the only open source standalone application that allows the dynamic comparison of several MS files, a prerequisite for comparative analysis of different biological conditions. With its dynamic rendering, this user-friendly web application facilitates inspection, comparison and export of publication quality 2 D maps from deconvoluted LC-MS run(s) and top-down proteomics data.
AVAILABILITY AND IMPLEMENTATION: The Shiny-based web application VisioProt-MS is suitable for non-R users. It can be found at https://masstools.ipbs.fr/mstools/visioprot-ms/ and the corresponding scripts are downloadable at https://github.com/mlocardpaulet/VisioProt-MS. It is governed by the CeCILL license (http://www.cecill.info).
© The Author(s) 2018. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30084957      PMCID: PMC6378940          DOI: 10.1093/bioinformatics/bty680

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Top-down proteomics consists in the analysis of intact proteins using liquid chromatography coupled to mass spectrometry (LC-MS), followed by their identification by MS/MS. The main advantage of this pipeline over classical bottom-up proteomics (analysis of peptides generated by protein enzymatic digestion) is to directly inform on the presence of potential combinations of post-translational modifications, splicing events and/or mutations, thereby providing in-depth characterization of proteoforms (Smith ). In recent years, top-down proteomics has gained momentum (Chen ), becoming a more high-throughput (Tran ) and quantitative technique (Durbin ; Ntai ). This was allowed by the development of high-resolution mass spectrometers and algorithms allowing signal deconvolution of MS spectra along the chromatographic runs (Durbin ; Gersch ; Lee , 2009), together with software suites dedicated to proteoform identification (Guner ; Kou ; LeDuc ; Liu ; Park ; Tran ). Visualization of deconvoluted LC-MS data remains a key process in top-down data inspection, and the direct comparison of several LC-MS runs reveals differences in protein footprints between samples/experimental conditions. An increasing number of top-down MS articles can be found in the literature. However, only a handful include LC-MS 2 D maps produced with in-house tools. These include publications from the groups of Kelleher (Durbin ; Tran ), Patrie (Plymire ), and Smith (Lee ). We published several papers presenting such 2 D maps using RoWinPro deconvolution (Gersch ) plotted with Excel (Carel ; Parra ), but this workflow is very low throughput and does not facilitate dynamic data inspection. For these reasons, downloadable software suites were developed, such as ICR-2TLS, which generates LC-MS 2 D maps (Lourette ; Sharma ; Tian , 2012; Wu ). Since last year, the software suite Informed-Proteomics produces similar 2 D maps (Park ; Zhou ). These provide many functionalities applied to top-down proteomics, but their visualization functions do not allow the direct comparison of several LC-MS runs on the same map. Thus, we developed a standalone tool to visualize, inspect and compare the molecular weights (MWs) of eluting proteoforms against their retention times (RT). VisioProt-MS is a user-friendly and highly compatible open source web application that plots and overlays interactive 2 D maps from deconvoluted LC-MS run(s). It is designed for dynamic data inspection as well as for creating publication quality figures. VisioProt-MS is compatible with the following bioinformatics tools: RoWinPro (Gersch ), Intact Protein Analysis (BioPharma FinderTM 3.0, Thermo Fisher Scientific), DataAnalysisTM 4.2 (Bruker) (Schmit ), TopFD (TopPIC Suite), (Kou ) and ProMex (Informed-Proteomics) (Park ) for deconvolution of LC-MS data; and Prosight PD (Proteome Discoverer, Thermo), TopPIC (TopPIC Suite, (Kou ) and MSPathFinder (Informed-Proteomics, (Park )) for LC-MS/MS data. VisioProt-MS quickly provides an overview of all the detected MWs, reflecting data quality and reproducibility in terms of observed MWs, intensities and RT. It allows comparison of not only multiple LC-MS runs (including from different deconvolution suites), but also LC-MS and LC-MS/MS runs of the same sample. Its dynamic features enable to pinpoint potential new proteoforms, quickly reject wrongly assigned Protein Spectral Matches (PSM) and spot intense proteoforms that remain unassigned.

2 Materials and methods

VisioProt-MS is a Shiny-based web application implemented in R (3.4.2), using the RStudio environment. It depends on the shiny (1.0.3); ggplot2 (2.2.1); Plotly (4.7.1); dplyr (0.7.2); data table (1.10.4-3); shinyBS (0.61) and RcolorBrewer (1.1-2) packages. R and Shiny (1.5.5.872) are hosted by a virtual Linux platform running on Ubuntu (16.04.3).

3 Results

VisioProt-MS enables users with no programming skills to generate, visualize and analyze 2 D maps of intact protein LC-MS runs, and to overlay the corresponding identifications from top-down analysis. It is organized in two inter-dependent panels that are presented in Figure 1: the right panel displays the output plot generated in real-time from the parameters defined in the left panel. The user can navigate between two visualization modes: “MS” to inspect one or several LC-MS file(s) for comparison of biological conditions/deconvolution parameters, or for batch-to-batch comparison; and “MS/MS” to visualize results from top-down analysis. In the following paragraphs, we will go through the specific features of these two modes, and then describe their common functionalities.
Fig. 1.

Screenshot of the VisioProt-MS web application in “MS” mode. One file is uploaded (the color scale corresponds to the log10-transformed intensity), and an example of a 2-files upload is presented in the central insert (the original file and the overlaid file are in red and blue, respectively)

Screenshot of the VisioProt-MS web application in “MS” mode. One file is uploaded (the color scale corresponds to the log10-transformed intensity), and an example of a 2-files upload is presented in the central insert (the original file and the overlaid file are in red and blue, respectively)

3.1  MS

We provide examples of compatible LC-MS input formats in the help document available by clicking on the help button (“?”). When available, the entire profile of eluted species are plotted point by point to get a “chromatogram-like” figure (like in Fig. 1). Since ProMex and BioPharma FinderTM only output the start/end elution times, VisioProt-MS outputs the range of each eluting proteoform (see help document for output examples). The application will also be compatible with outputs from any upcoming sliding window deconvolution tool, provided that the input table corresponds to a format described in the help document. Selecting one file renders a plot with color-coded intensity whereas multiple files selection (files from the same folder) allows the comparison of several MS runs. Inspection of these protein footprints enables the direct visualization of co-eluting modified or truncated species. Additionally, overlapping different LC-MS acquisitions permits comparison between different sample conditions (e.g. treated versus non-treated samples), batch-to-batch comparisons, or quality control assessment of the LC-MS system, which are facilitated by the color-coding and zooming options.

3.2 MS/MS

In the “MS/MS” mode, VisioProt-MS plots a 2 D map of the MS/MS spectra acquired during top-down analysis. The proportion of assigned versus unassigned MS/MS directly informs the user of the overall quality of the data. Furthermore, the RT and MWs distributions of the assigned species can highlight eventual issues with the MS/MS acquisition parameters, search parameters or chromatographic methods. Besides quality control, this mode allows the thorough inspection of top-down proteomics data. The user can hide unassigned MS/MS spectra with the option “Hide MS/MS without ID”, and localize on the map the identified proteins by selecting them in the box “Select the ID to highlight”. It is possible to overlay MS/MS data with the LC-MS trace of the same sample (from the same LC-MS/MS run or a separate LC-MS acquisition) in order to spot false positive MS/MS hits that do not overlap with a truly deconvoluted protein (i.e. for several seconds and not just an artefact). The user can also identify the proteoforms clearly visible in LC-MS but not identified in the LC-MS/MS analysis, which can be due to the absence of the corresponding sequence from the database, a narrow error window, poor selection or fragmentation. All these parameters can then be optimized to improve identification results.

3.3 Common features

VisioProt-MS makes it quick and simple to inspect intact protein LC-MS and top-down proteomics data. Test files are provided for both MS and MS/MS modes and can be loaded by clicking on the “Using test mode” checkboxes. It is easy to zoom-in using mouse selection, and to change the plot parameters with the options provided on the left panel. A thorough inspection of the deconvoluted data can be conducted when “Show data labels” is checked. Then, dynamic labels appear on mouse hovering, returning the following values from the input tables: RT (min); molecular weight of the protein (Da); signal intensity; protein identity and file name when applicable. These can be of great use to identify peaks of interest in the 2D map. Because dynamic labelling significantly slows down the application, we kept it as an option, and we advise the user to check it only after optimization of the rendering parameters (colors, point size, threshold and zooming). When comparing several LC-MS files, the “Show data labels” mode allows the user to isolate or mask files by clicking on their names in the figure legend, thereby facilitating data inspection. To quicken plot rendering, we decided to plot only 20% of the data points with the highest intensity. This parameter can be defined manually in the dedicated field. Modifying this threshold results in changes in the axis and color ranges due to the appearance/disappearance of low intensity points. A selection of color scales offers the choice between diverging or sequential palettes (see the help document to visualize the color-scales). When several files are plotted together, the colors on the plot are file-dependent, whereas when only one file is plotted, the colors correspond to the log10-transformed intensity values from the input table. Export of the displayed map can be done by a simple click on the button “Download.pdf”, “Download.png” or “Download.svg”. By default, this output file is named “VisioProt-MS_” followed by the name of the input file and the date. It is saved in the “Downloads” folder of the user computer, and can be used as it is for publication purposes. All the functionalities of VisioProt-MS are described in details in the help webpage available by clicking on the help button (“?”).
  24 in total

1.  Direct mass spectrometric analysis of intact proteins of the yeast large ribosomal subunit using capillary LC/FTICR.

Authors:  Sang-Won Lee; Scott J Berger; Suzana Martinović; Ljiljana Pasa-Tolić; Gordon A Anderson; Yufeng Shen; Rui Zhao; Richard D Smith
Journal:  Proc Natl Acad Sci U S A       Date:  2002-04-30       Impact factor: 11.205

2.  ProSight PTM: an integrated environment for protein identification and characterization by top-down mass spectrometry.

Authors:  Richard D LeDuc; Gregory K Taylor; Yong-Bin Kim; Thomas E Januszyk; Lee H Bynum; Joseph V Sola; John S Garavelli; Neil L Kelleher
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

3.  Two-dimensional liquid chromatography system for online top-down mass spectrometry.

Authors:  Zhixin Tian; Rui Zhao; Nikola Tolić; Ronald J Moore; David L Stenoien; Errol W Robinson; Richard D Smith; Ljiljana Paša-Tolić
Journal:  Proteomics       Date:  2010-10       Impact factor: 3.984

4.  A mass spectrometry platform for a streamlined investigation of proteasome integrity, posttranslational modifications, and inhibitor binding.

Authors:  Malte Gersch; Mathias W Hackl; Christian Dubiella; Alexander Dobrinevski; Michael Groll; Stephan A Sieber
Journal:  Chem Biol       Date:  2015-02-26

5.  Proteomic profiling of intact proteins using WAX-RPLC 2-D separations and FTICR mass spectrometry.

Authors:  Seema Sharma; David C Simpson; Nikola Tolić; Navdeep Jaitly; Anoop M Mayampurath; Richard D Smith; Ljiljana Pasa-Tolić
Journal:  J Proteome Res       Date:  2007-02       Impact factor: 4.466

6.  A top-down LC-FTICR MS-based strategy for characterizing oxidized calmodulin in activated macrophages.

Authors:  Natacha Lourette; Heather Smallwood; Si Wu; Errol W Robinson; Thomas C Squier; Richard D Smith; Ljiljana Pasa-Tolić
Journal:  J Am Soc Mass Spectrom       Date:  2010-03-22       Impact factor: 3.109

Review 7.  Top-Down Proteomics: Ready for Prime Time?

Authors:  Bifan Chen; Kyle A Brown; Ziqing Lin; Ying Ge
Journal:  Anal Chem       Date:  2017-12-15       Impact factor: 6.986

8.  An integrated top-down and bottom-up strategy for broadly characterizing protein isoforms and modifications.

Authors:  Si Wu; Natacha M Lourette; Nikola Tolić; Rui Zhao; Errol W Robinson; Aleksey V Tolmachev; Richard D Smith; Ljiljana Pasa-Tolić
Journal:  J Proteome Res       Date:  2009-03       Impact factor: 4.466

9.  A robust two-dimensional separation for top-down tandem mass spectrometry of the low-mass proteome.

Authors:  Ji Eun Lee; John F Kellie; John C Tran; Jeremiah D Tipton; Adam D Catherman; Haylee M Thomas; Dorothy R Ahlf; Kenneth R Durbin; Adaikkalam Vellaichamy; Ioanna Ntai; Alan G Marshall; Neil L Kelleher
Journal:  J Am Soc Mass Spectrom       Date:  2009-08-12       Impact factor: 3.109

10.  Scrutiny of Mycobacterium tuberculosis 19 kDa antigen proteoforms provides new insights in the lipoglycoprotein biogenesis paradigm.

Authors:  Julien Parra; Julien Marcoux; Isabelle Poncin; Stéphane Canaan; Jean Louis Herrmann; Jérôme Nigou; Odile Burlet-Schiltz; Michel Rivière
Journal:  Sci Rep       Date:  2017-03-08       Impact factor: 4.379

View more
  2 in total

1.  Proteasome complexes experience profound structural and functional rearrangements throughout mammalian spermatogenesis.

Authors:  Dušan Živković; Angelique Sanchez Dafun; Thomas Menneteau; Adrien Schahl; Sandrine Lise; Christine Kervarrec; Ana Toste Rêgo; Paula C A da Fonseca; Matthieu Chavent; Charles Pineau; Odile Burlet-Schiltz; Julien Marcoux; Marie-Pierre Bousquet
Journal:  Proc Natl Acad Sci U S A       Date:  2022-04-04       Impact factor: 12.779

Review 2.  Middle-down approach: a choice to sequence and characterize proteins/proteomes by mass spectrometry.

Authors:  P Boomathi Pandeswari; Varatharajan Sabareesh
Journal:  RSC Adv       Date:  2019-01-02       Impact factor: 4.036

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.