Literature DB >> 29896482

A new SWATH ion library for mouse adult hippocampal neural stem cells.

Clarissa Braccia1, Meritxell Pons Espinal2, Mattia Pini1, Davide De Pietri Tonelli2, Andrea Armirotti1.   

Abstract

Over the last years, the SWATH data-independent acquisition protocol (Sequential Window acquisition of All THeoretical mass spectra) has become a cornerstone for the worldwide proteomics community (Collins et al., 2017) [1]. In this approach, a high-resolution quadrupole-ToF mass spectrometer acquires thousands of MS/MS data by selecting not just a single precursor at a time, but by allowing a broad m/z range to be fragmented. This acquisition window is then sequentially moved from the lowest to the highest mass selection range. This technique enables the acquisition of thousands of high-resolution MS/MS spectra per minute in a standard LC-MS run. In the subsequent data analysis phase, the corresponding dataset is searched in a "triple quadrupole-like" mode, thus not considering the whole MS/MS scan spectrum, but by searching for several precursor to fragment transitions that identify and quantify the corresponding peptide. This search is made possible with the use of an ion library, previously acquired in a classical data dependent, full-spectrum mode (Fabre et al., 2017; Wu et al., 2017) [2], [3]. The SWATH protocol, combining the protein identification power of high-resolution MS/MS spectra with the robustness and accuracy in analyte quantification of triple-quad targeted workflows, has become very popular in proteomics research. The major drawback lies in the ion library itself, which is normally demanding and time-consuming to build. Conversely, through the realignment of chromatographic retention times, an ion library of a given proteome can relatively easily be tailored upon "any" proteomics experiment done on the same proteome. We are thus hereby sharing with the worldwide proteomics community our newly acquired ion library of mouse adult hippocampal neural stem cells. Given the growing effort in neuroscience research involving proteomics experiments (Pons-Espinal et al., 2017; Sarnyai and Guest, 2017; Sethi et al., 2015; Bramini et al., 2016) [4,[5], [6], [7], we believe that this data might be of great help for the neuroscience community. All the here reported data (RAW files, results and ion library) can be freely downloaded from the SWATHATLAS (Deutsch et al., 2008) [8] website (http://www.peptideatlas.org/PASS/PASS01110).

Entities:  

Keywords:  ACN, Acetonitrile; DDA, Data dependent acquisition; DTT, Dithiothreitol; EGF, Epidermal growth factor; FA, Formic acid; FGF, Fibroblast growth factor; IAA, Iodoacetamide; Neural stem cells; Neuroscience; PDL, Poly-D-Lysine; PSM, Peptide spectrum match; PTMs, Post translational modifications; Proteomics; SWATH; TEA, Triethylamine

Year:  2018        PMID: 29896482      PMCID: PMC5995750          DOI: 10.1016/j.dib.2018.02.062

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the data We here report on the building of a new SWATH ion library of mouse adult hippocampal neural stem cells. The whole process from sample preparation to data acquisition to data analysis is carefully described. All the data here described are made available to the worldwide community and can be freely downloaded from the SWATH Atlas website (www.swathatlas.org). This dataset might represent a ready-made tool for the growing community of SWATH proteomics and it can use to identify and quantify proteins from similar biological contexts

Data

The data we are hereby sharing is a ready-made ion library to be used for SWATH proteomics experiments by the worldwide proteomics community working on neural stem cells. The library (along with all the corresponding RAW and results files that were used to build it) is openly shared through the SWATHATLAS initiative [8].

Experimental design, materials and methods

All the chemicals and solvent were purchased from Sigma Aldrich (Milano, Italy), unless otherwise indicated. The NanoAcquity LC system, trapping column and Oasis SPE columns were purchased from Waters (Milford, MA,USA). The 5600+ Triple tof system, ProteinPilot and MarkerView softwares and the PepCalMix reference standard were purchased from SCIEX (Ontario, Canada).

Sample preparation: aNSC growth, cell lysis and protein digestion

Mouse adult hippocampal neural stem cells (aNSCs) were obtained as previously described [4]. Briefly, adult dentate gyrus (DG) was isolated from 8–10 C57Bl6/J mice at the age of 6–8 weeks. After dissection in Hanks Balanced Salt Solution (HBSS, Gibco) medium, the tissue was enzymatically dissociated with papain (2,5 U/ml), dispase (1 U/ml) and DNaseI (250 U/ml) for 20 min at 37 °C. During incubation, the tissue was repeatedly triturated with a fire polished Pasteur pipette. The cell suspension was centrifuged at 130 g for 5 min and the pellet was re-suspended in a buffered solution (1× HBSS, 30 mM Glucose, 2 mM HEPES pH 7,4, 26 mM NaHCO3) followed by a centrifugation at 130 g for 5 min. aNSCs were isolated using 22% Percoll gradient solution. After further centrifugation for 5 min at 130 g the cell pellet was re-suspended in 2 ml of culture medium containing Neurobasal (Invitrogen), Glutamax (Invitrogen), 1% penicillin and streptomycin (Invitrogen), B27 without retinoic acid (Invitrogen), FGF (20 ng/ml; PeproTech) and EGF (20 ng/ml; PeproTech). The dissociated DG tissue was plated into PDL/Laminin (Sigma/Roche) coated wells and incubated at 37 °C with 5% CO2. The growth medium was exchanged 24 h later to further remove excess debris. Every 2 days half of the growth medium was exchanged with fresh medium to replenish the growth factors. aNSCs were passaged once they reached 80% confluence. aNSCs cell pellets were homogenized using lysis RIPA buffer containing phosphatase and protease inhibitors (Complete mini EDTA-free, Roche) with the following composition: 150 mM NaCl, 1.0% Triton X-100, 0.5% sodium deoxycholate, 0.1% SDS (sodium dodecyl sulphate), 50 mM Tris, pH 8.0 and 1 mM Na3VO4. Cellular debris were separated at 13,000 rpm for 30 min at 4 °C. The total protein content in the lysate was quantified using BCA assay following the manufacturer instructions (Pierce TM BCA Protein Assay Kit; Thermo Scientific). The volume corresponding to 200 µg of proteins from aNSC was incubated with 10 µl of 100 mM DTT in 50 mM NH4HCO3, pH 8 for 30 min at room temperature to reduce protein cysteine groups. After reduction, 30 µl of 100 mM IAA in 50 mM NH4HCO3, pH 8 were added and the sample was incubated for 20 min in the dark. The protein content was then precipitated for 5 h at − 20 °C by adding 1 ml of cold (− 20 °C) acetone. After centrifugation at 14,000 g for 15 min, the surnatant was discarded and the protein pellet was dissolved in 200 µl of 50 mM NH4HCO3, pH 8. The sample was then digested overnight at 37 °C by adding 4 µg of trypsin (proteomics grade, from Sigma).

Desalting and fractionation of peptides

In order to increase the number of identified proteins and their sequence coverage, the resulting tryptic peptides were fractionated offline using the reversed phase high pH/low pH strategy [9]. High pH fractionation was performed using Oasis solid-phase extraction (SPE) columns. After the activation (with 500 µl of ACN) and equilibration (with 500 µl of 3% ACN 0.1% FA) of the column resin, the peptides were loaded on the column. The peptides retained on the column were first desalted with 500 µl of washing solution (3% ACN with 0.1% FA). A total of 8 fractions of peptides were collected by eluting sample with 500 µl of 0.1% TEA solution with increasing acetonitrile concentration (5%, 7.5%, 10%, 12.5%, 15%, 17.5%, 20% and 50%).

LC–MS/MS analysis

Each fraction was dried in a vacuum centrifuge and then redissolved in 45 µl of 3% ACN added with 0.1% FA. 5 µl of PepCalMix solution, corresponding to 250 pmoles, were added to each fraction as internal standard for retention time recalibration. Five microliters of sample (roughly corresponding to 2.5 μg of peptides) were loaded on a NanoAcquity chromatographic system and connected to a TripleToF 5600+ mass spectrometer equipped with a NanoSpray III ion source. The peptides were first trapped on a 180 µm × 20 mm Acquity C18 column for 4.0 min at 4.0 µl/min flow rate. Eluent composition was kept at 1% ACN + 0.1% FA. Peptides were then moved on a PicoFrit C18 column (75 μm × 25 cm, from NewObjective Inc., Woburn, MA, USA) and eluted at 300 nL/min with a 2 h gradient of acetonitrile in water (3–45%, both eluents were added with % FA). ACN content was then ramped at 90% in 5 min and kept for additional 5 min. The system was then reconditioned to 3% for 18 min. Eluted peptides were analyzed in positive ion mode. Ion spray voltage was set to 2400 V, spray gas 1 and curtain gas values were set to 4 and 30 respectively. Source temperature was kept at 75 °C and declustering potential was set to 80 V. Up to 40 precursor ions were selected for MS/MS experiment using the following criteria: charge state +2 to +5 with intensity above 150 counts. Scan range was set to 400–1200 m/z for the MS survey scan and to 100–1600 m/z for each MS/MS scan. Collision energy values recommended by SCIEX were used for peptide fragmentation. The total ion chromatograms for the eight fractions is represented in Fig. 1.
Fig. 1

Overlapped LC-MS traces of the eight fractions from high-pH fractionation.

Overlapped LC-MS traces of the eight fractions from high-pH fractionation.

Protein identification and creation of the ion library

A total of 313362 MS/MS spectra were collected in our experiment (an average of 43 MS/MS spectra per second, considering the whole gradient time). Raw data files from all the fractions were analyzed using ProteinPilot and MASCOT (Matrix science Ltd, London, UK) software. In both cases, MS/MS spectra were searched against the reference Mus Musculus proteome downloaded September the 27, 2017 from UNIPROT database (http://www.uniprot.org/proteomes/UP000000589) and consisting of 52015 proteins. Carbamidomethylation was set as fixed modification. The purpose of this effort was to ensure the highest quality to our ion library, by only retaining proteins at maximum 1% false discovery rate (FDR). FDR analysis is normally performed by searching MS/MS data against a given target database and against a corresponding decoy database [10], where the target sequence is randomized (same aminoacid composition, but randomized sequence) or semi-randomized (same aminoacid composition, correct first and last residues, randomized remaining sequence). The ratio between the number of protein IDs found against the decoy database and the number of protein IDs found against the target database returns the FDR estimate for that dataset (Global FDR) [11]. ProteinPilot, working with the Paragon algorithm [12], uses a confidence score to assign identification likeliness at peptide spectrum match level (PSM FDR) and it provides a calculation of the corresponding FDR value at protein level by using the method published by Tang [13]. A preliminary search of our data with ProteinPilot, using a minimal set of PTMs (Rapid search, consisting of methionine oxidation, asparagine and glutamine deamidation and conversion of glutamine and glutamate to pyroglutammic acid) and setting a 10% confidence score threshold (default search parameters recommended by SCIEX for optimal FDR calculation) returned 4394 IDs having a calculated maximum FDR of 1%. The algorithm also estimated that a protein FDR of 1% would correspond to a confidence score of 79.6%. Based on this estimate, a dataset obtained with 80% confidence threshold should already consist of proteins having 1% maximum FDR. We then refined our searches using ProteinPilot at increasing confidence scores (20–99.9%) and we plotted the corresponding number of identified proteins. These numbers of decoy proteins were found by ProteinPilot at each different confidence score: 20%: 879, 60%: 165, 80%: 90, 90%: 24, 99%: 1, 99.9%: 0. As a comparative test, we then searched our data using MASCOT software with only one PTMs (methionine oxidation), in order to reduce the overall chance of false hits. MASCOT allows an explicit control of FDR at protein level and it returned 3779 protein hits at 1% FDR protein level. The results of the search are shown in Fig. 2.
Fig. 2

Number of protein IDs at different ProteinPilot confidence scores (black) compared with MASCOT protein IDs at 1% FDR (grey).

Number of protein IDs at different ProteinPilot confidence scores (black) compared with MASCOT protein IDs at 1% FDR (grey). A ProteinPilot search done with a confidence score of 99% returns slightly less protein IDs than a MASCOT search with a controlled protein 1% FDR (3727). We then checked the degree of overlapping of the two sets of results, by plotting the corresponding Venn diagrams at the level of both individual proteins (Fig. 3) and corresponding genes (Fig. 4)
Fig. 3

Venn diagram showing the overlapping of the two dataset at individual protein level.

Fig. 4

Venn diagram showing the overlapping of the two dataset at gene level.

Venn diagram showing the overlapping of the two dataset at individual protein level. Venn diagram showing the overlapping of the two dataset at gene level. As demonstrated by the plots, the overlapping degree of the two dataset is not particularly high at individual proteins level, with roughly 65% ProteinPilot IDs shared with MASCOT. The degree of overlapping is much higher at the level of the corresponding genes, with roughly 90% of the genes represented by ProteinPilot protein IDs shared with MASCOT. The conversion of the protein IDs into their corresponding genes was done using the ID mapping tool on the Uniprot website (http://www.uniprot.org/uploadlists/). We also investigated the role of the two algorithms (Paragon and MASCOT) in the total number of MS/MS spectra positively assigned to protein hits (Fig. 5).
Fig. 5

Percentage of MS/MS spectra positively assigned to protein hits by MASCOT (grey) and ProteinPilot at different confidence scores (black).

Percentage of MS/MS spectra positively assigned to protein hits by MASCOT (grey) and ProteinPilot at different confidence scores (black). The number of MS/MS spectra assigned by MASCOT to protein hits, compared to the total number of acquired ones (313362) is markedly lower (28%) than those assigned by ProteinPilot (51–53%). Besides the differences between the two search algorithms, this difference is also due to lower number of PTMs used for MASCOT (1) compared to ProteinPilot (5). The latter software does not allow the selection of individual PTMs in the search other than those indicated above for the “Rapid” search effort. This discrepancy in PTMs might also partially explain the differences with MASCOT in the overlapping of individual proteins compared to the corresponding genes (Fig. 3, Fig. 4).

Creation of the ion library and performance test

We built the SWATH ion library applying the most stringent quality criteria we could. The.group file resulting from Protein Pilot search at 99% confidence was loaded on the SWATH microapp tool embedded in PeakView algorithm. No peptides shared by more than one protein were imported. We then removed from the ion library all the peptides carrying PTMs other than CAM, thus allowing only this fixed modification. The reference data (transitions and expected RT values) for the PepCalMix were manually added the ion library using a text editor. Table 1 reports a summary of the features of the final ion library
Table 1

Features incorporated in the aNSC ion library.

MRM Assays594922
Peptides25394
Proteins (Max 1% FDR)3673
Features incorporated in the aNSC ion library. For each entry, the library reports the Q1 and Q3 m/z values (precursor and fragment respectively), the observed retention time (in minutes), the relative intensity of the transition, the associated protein name and peptide sequence, the precursor charge state and the fragment type, plus several other flags. The above described ion library has been uploaded in the SWATATLAS website and can be freely downloaded from http://www.peptideatlas.org/PASS/PASS01110. Any TripleToF user worldwide can immediately use the ion libraries we are hereby sharing. Commercially available standard peptides for retention time recalibration other than the PepCalMix sold by SCIEX can in principle be used. The ion library must then be updated with the corresponding MRM and RT data. This can easily be done by using a text editor.
Subject areaProteomics, Neuroscience
More specific subject areaSWATH proteomics
Type of dataIon library for SWATH proteomics
How data was acquired2D-LC MS/MS in Data-Dependent mode
Data formatRaw data and a compiled ion library
Experimental factorsTryptic digest of mouse adult hippocampal neural stem cells fractionated by 2D LC
Experimental featuresData dependent MS/MS acquisition followed by database search against Mus Musculus reference proteome.
Data source locationFondazione Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genova, Italy
Data accessibilityAll data are freely downloadable fromwww.swathatlas.org
  13 in total

Review 1.  Approaches for targeted proteomics and its potential applications in neuroscience.

Authors:  Sumit Sethi; Dipti Chourasia; Ishwar S Parhar
Journal:  J Biosci       Date:  2015-09       Impact factor: 1.826

2.  The Paragon Algorithm, a next generation search engine that uses sequence temperature values and feature probabilities to identify peptides from tandem mass spectra.

Authors:  Ignat V Shilov; Sean L Seymour; Alpesh A Patel; Alex Loboda; Wilfred H Tang; Sean P Keating; Christie L Hunter; Lydia M Nuwaysir; Daniel A Schaeffer
Journal:  Mol Cell Proteomics       Date:  2007-05-27       Impact factor: 5.911

3.  Nonlinear fitting method for determining local false discovery rates from decoy database searches.

Authors:  Wilfred H Tang; Ignat V Shilov; Sean L Seymour
Journal:  J Proteome Res       Date:  2008-08-14       Impact factor: 4.466

4.  Graphene Oxide Nanosheets Disrupt Lipid Composition, Ca(2+) Homeostasis, and Synaptic Transmission in Primary Cortical Neurons.

Authors:  Mattia Bramini; Silvio Sacchetti; Andrea Armirotti; Anna Rocchi; Ester Vázquez; Verónica León Castellanos; Tiziano Bandiera; Fabrizia Cesca; Fabio Benfenati
Journal:  ACS Nano       Date:  2016-07-05       Impact factor: 15.881

Review 5.  PeptideAtlas: a resource for target selection for emerging targeted proteomics workflows.

Authors:  Eric W Deutsch; Henry Lam; Ruedi Aebersold
Journal:  EMBO Rep       Date:  2008-05       Impact factor: 8.807

6.  Artificial decoy spectral libraries for false discovery rate estimation in spectral library searching in proteomics.

Authors:  Henry Lam; Eric W Deutsch; Ruedi Aebersold
Journal:  J Proteome Res       Date:  2010-01       Impact factor: 4.466

7.  An off-line high pH reversed-phase fractionation and nano-liquid chromatography-mass spectrometry method for global proteomic profiling of cell lines.

Authors:  Hang Wang; Shengnan Sun; Yi Zhang; Si Chen; Ping Liu; Bin Liu
Journal:  J Chromatogr B Analyt Technol Biomed Life Sci       Date:  2014-11-04       Impact factor: 3.205

8.  Unbiased False Discovery Rate Estimation for Shotgun Proteomics Based on the Target-Decoy Approach.

Authors:  Lev I Levitsky; Mark V Ivanov; Anna A Lobas; Mikhail V Gorshkov
Journal:  J Proteome Res       Date:  2016-12-13       Impact factor: 4.466

9.  Improving Protein Detection Confidence Using SWATH-Mass Spectrometry with Large Peptide Reference Libraries.

Authors:  Jemma X Wu; Dana Pascovici; Vera Ignjatovic; Xiaomin Song; Christoph Krisp; Mark P Molloy
Journal:  Proteomics       Date:  2017-10       Impact factor: 3.984

Review 10.  Connecting Brain Proteomics with Behavioural Neuroscience in Translational Animal Models of Neuropsychiatric Disorders.

Authors:  Zoltán Sarnyai; Paul C Guest
Journal:  Adv Exp Med Biol       Date:  2017       Impact factor: 2.622

View more
  6 in total

1.  Quantification of Changes in Protein Expression Using SWATH Proteomics.

Authors:  Clarissa Braccia; Nara Liessi; Andrea Armirotti
Journal:  Methods Mol Biol       Date:  2021

2.  MiR-135a-5p Is Critical for Exercise-Induced Adult Neurogenesis.

Authors:  Meritxell Pons-Espinal; Caterina Gasperini; Matteo J Marzi; Clarissa Braccia; Andrea Armirotti; Alexandra Pötzsch; Tara L Walker; Klaus Fabel; Francesco Nicassio; Gerd Kempermann; Davide De Pietri Tonelli
Journal:  Stem Cell Reports       Date:  2019-05-23       Impact factor: 7.765

3.  The Generation of a Comprehensive Spectral Library for the Analysis of the Guinea Pig Proteome by SWATH-MS.

Authors:  Pawel Palmowski; Rachael Watson; G Nicholas Europe-Finner; Magdalena Karolczak-Bayatti; Andrew Porter; Achim Treumann; Michael J Taggart
Journal:  Proteomics       Date:  2019-07-22       Impact factor: 3.984

4.  Probing SWATH-MS as a tool for proteome level quantification in a nonmodel fish.

Authors:  Alison A Monroe; Huoming Zhang; Celia Schunter; Timothy Ravasi
Journal:  Mol Ecol Resour       Date:  2020-08-04       Impact factor: 7.090

5.  Plasma Proteomic Analysis in Morquio A Disease.

Authors:  José V Álvarez; Susana B Bravo; María Pilar Chantada-Vázquez; Sofía Barbosa-Gouveia; Cristóbal Colón; Olalla López-Suarez; Shunji Tomatsu; Francisco J Otero-Espinar; María L Couce
Journal:  Int J Mol Sci       Date:  2021-06-07       Impact factor: 5.923

6.  Proteomic Analysis in Morquio A Cells Treated with Immobilized Enzymatic Replacement Therapy on Nanostructured Lipid Systems.

Authors:  J Víctor Álvarez; Susana B Bravo; María García-Vence; María J De Castro; Asteria Luzardo; Cristóbal Colón; Shunji Tomatsu; Francisco J Otero-Espinar; María L Couce
Journal:  Int J Mol Sci       Date:  2019-09-18       Impact factor: 5.923

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.