Literature DB >> 32478154

Exploration of human cerebrospinal fluid: A large proteome dataset revealed by trapped ion mobility time-of-flight mass spectrometry.

Charlotte Macron1, Regis Lavigne2,3, Antonio Núñez Galindo1, Michael Affolter1, Charles Pineau2,3, Loïc Dayon1,4.   

Abstract

Cerebrospinal fluid (CSF) is a biofluid in direct contact with the brain and as such constitutes a sample of choice in neurological disorder research, including neurodegenerative diseases such as Alzheimer or Parkinson. Human CSF has still been less studied using proteomic technologies compared to other biological fluids such as blood plasma or serum. In this work, a pool of "normal" human CSF samples was analysed using a shotgun proteomic workflow that combined removal of highly abundant proteins by immunoaffinity depletion and isoelectric focussing fractionation of tryptic peptides to alleviate the complexity of the biofluid. The resulting 24 fractions were analysed using liquid chromatography coupled to a high-resolution and high-accuracy timsTOF Pro mass spectrometer. This state-of-the-art mass spectrometry-based proteomic workflow allowed the identification of 3'174 proteins in CSF. The dataset reported herein completes the pool of the most comprehensive human CSF proteomes obtained so far. An overview of the identified proteins is provided based on gene ontology annotation. Mass and tandem mass spectra are made available as a possible starting point for further studies exploring the human CSF proteome.
© 2020 The Authors.

Entities:  

Keywords:  Cerebrospinal fluid; LC-MS/MS; Large-scale proteome; Mass spectrometry; Proteomics

Year:  2020        PMID: 32478154      PMCID: PMC7251648          DOI: 10.1016/j.dib.2020.105704

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications table

Value of the data

A comprehensive proteomic profile of “normal” human CSF, among the largest reported so far using LC-MS/MS, is provided The data is useful for enhanced characterization and annotation of the human CSF proteome The data is valuable for the proteomic community for spectral library generation and as a starting point for clinical studies focussing on CSF and neurological disorders The data provides information for targeted protein/peptide assay development in human CSF

Data description

The dataset presented herein identified 3’174 proteins and their respective 25’227 peptides in “normal” CSF; protein and peptide lists are provided in Supplementary Table S1. The human CSF sample analyzed in this report was previously analyzed with different LC-MS/MS intrumentations to assess throughput and robustness of an automated pipeline for biomarker discovery [4] and to deeply charaterize the human CSF proteome in the quest of identificaton of missing proteins [1,5]. In the present work, the previously prepared sample was analyzed again using the recent timsTOF Pro mass spectrometer to evaluate its capabilities in terms of CSF proteome coverage. MS data were thus acquired by analysing CSF depleted from abundant proteins, after tryptic digestion and peptide fractionation, using a nanoElute LC system coupled to a timsTOF Pro mass spectrometer. MS raw files were then converted into peaklists with MSConvert and searched against the human UniProtKB/Swiss-Prot database using Mascot and X! Tandem. The Scaffold software, specifying a false discovery rate (FDR) of 1% at both protein and peptide level, and a one unique peptide criterion, was used to report protein identifications. Gene Ontology (GO) annotation was performed with the Panther software (Fig. 1). Binding and Catalytic activity represented 78% of the molecular functions. Cellular process was the most important biological process represented (i.e., 23% of all genes); lastly, Cell and Cell part (21% each) were the major cellular components identified in this dataset.
Fig. 1

GO terms of the genes representative of the 3’174 proteins identified in the CSF dataset. The Panther software was used for the GO annotation on the three ontologies, (a) molecular function (b) biological process and (c) cellular component.

GO terms of the genes representative of the 3’174 proteins identified in the CSF dataset. The Panther software was used for the GO annotation on the three ontologies, (a) molecular function (b) biological process and (c) cellular component. A GO enrichment was also performed with Gorilla [6], to identify terms enriched in this “normal” human CSF sample with respect to the whole human proteome (Table 1). Terms relative to semaphorin/neuropilin/plexin, such as “semaphorin receptor activity”, “axon guidance receptor activity” or “semaphorin-plexin signaling pathway involved in neuron projection guidance” were particularly enriched in this dataset.
Table 1

GO term enrichment for the genes representative of the 3’174 proteins identified in the CSF dataset. GO term enrichment analysis was performed with Gorilla [6] on the three ontologies, (a) molecular function (b) biological process and (c) cellular component. The background used for the enrichment analysis was the full human proteome (UniProtKB/Swiss-Prot 2020/02 release). In the table, only terms with p-value below 10−5 and fold enrichment above 5, are displayed. All the enrichment results are presented in Supplementary Tables S2-4.

(a) Molecular Function

GO numberGO termNumber of proteins identified in CSFTotal number of protein in human UniProtKB/Swiss-ProtFold enrichment

GO:0097493structural molecule activity conferring elasticity12125.96
GO:0048407platelet-derived growth factor binding11115.96
GO:0030023extracellular matrix constituent conferring elasticity10105.96
GO:0031995insulin-like growth factor II binding885.96
GO:0031994insulin-like growth factor I binding12135.51
GO:0045499chemorepellent activity23255.49
GO:0017154semaphorin receptor activity11125.47
GO:0008046axon guidance receptor activity895.30
GO:0008191metalloendopeptidase inhibitor activity14165.22
GO:0086080protein binding involved in heterotypic cell-cell adhesion11135.05

(b) Biological Process

GO numberGO termNumber of proteins identified in CSFTotal number of protein in human UniProtKB/Swiss-ProtFold enrichment

GO:0006957complement activation, alternative pathway13135.96
GO:0097104postsynaptic membrane assembly10105.96
GO:0048251elastic fiber assembly995.96
GO:0099545trans-synaptic signaling by trans-synaptic complex885.96
GO:1902669positive regulation of axon guidance885.96
GO:1902284neuron projection extension involved in neuron projection guidance885.96
GO:0048846axon extension involved in axon guidance885.96
GO:0061684chaperone-mediated autophagy775.96
GO:0048842positive regulation of axon extension involved in axon guidance775.96
GO:1902285semaphorin-plexin signaling pathway involved in neuron projection guidance12135.51
GO:1902287semaphorin-plexin signaling pathway involved in axon guidance11125.47
GO:0001941postsynaptic membrane organization11125.47
GO:0042340keratan sulfate catabolic process11125.47
GO:0097090presynaptic membrane organization10115.42
GO:0097105presynaptic membrane assembly9105.37
GO:0034371chylomicron remodeling895.30
GO:0071526semaphorin-plexin signaling pathway31355.28
GO:0099560synaptic membrane adhesion22255.25
GO:0042730fibrinolysis19225.15
GO:0030207chondroitin sulfate catabolic process12145.11
GO:0048841regulation of axon extension involved in axon guidance26315.00

(c) Cellular Component

GO numberGO termNumber of proteins identified in CSFTotal number of protein in human UniProtKB/Swiss-ProtFold enrichment
GO:0005577fibrinogen complex885.96
GO:0005593FACIT collagen trimer775.96
GO:0005579membrane attack complex775.96
GO:0005583fibrillar collagen trimer11125.47
GO:0002116semaphorin receptor complex10115.42
GO:0032279asymmetric synapse895.30
GO:0098651basement membrane collagen trimer895.30
GO:0042627chylomicron11135.05
GO:0071682endocytic vesicle lumen16195.02
GO term enrichment for the genes representative of the 3’174 proteins identified in the CSF dataset. GO term enrichment analysis was performed with Gorilla [6] on the three ontologies, (a) molecular function (b) biological process and (c) cellular component. The background used for the enrichment analysis was the full human proteome (UniProtKB/Swiss-Prot 2020/02 release). In the table, only terms with p-value below 10−5 and fold enrichment above 5, are displayed. All the enrichment results are presented in Supplementary Tables S2-4. When we compared this dataset to our previous data acquired with an Orbitrap Fusion Lumos instrument, identifying 20’689 peptides mapping on 3’379 proteins [1], we found that 57.4% of the proteins (i.e., 2’390 proteins) were common to both datasets, as well as almost 14’000 peptides (i.e., 43.8%) (Fig. 2).
Fig. 2

Comparison of protein and peptide identifications in CSF between our previously published dataset obtained with an Orbitrap Fusion Lumos instrument [1], and the current dataset obtained with a timsTOF Pro mass spectrometer.

Comparison of protein and peptide identifications in CSF between our previously published dataset obtained with an Orbitrap Fusion Lumos instrument [1], and the current dataset obtained with a timsTOF Pro mass spectrometer.

Experimental design, materials, and methods

Sample preparation

The sample preparation was performed previously [1,5]. Briefly, 96 aliquots of 400 μL of a commercial pooled CSF sample (Analytical Biological Services) were evaporated with a vacuum centrifuge (Thermo Scientific). The dried samples were diluted in depletion Buffer A (Agilent Technologies) containing 9.65 µg/mL of β-lactoglobulin from bovine milk. Abundant CSF proteins were removed using MARS columns (Agilent Technologies) and HPLC systems (Thermo Scientific) equipped with an HTC-PAL (CTC Analytics AG) fraction collector. Buffer exchange was performed with Strata-X 33u polymeric reversed-phase (RP) (30 mg/1 mL) cartridges mounted on a 96-hole holder and a vacuum manifold, as previously described [7]. Samples were subsequently evaporated and subjected to reduction, alkylation, digestion, tandem mass tag (TMT) 6-plex (Thermo Scientific) labeling, pooling and purification using a 4-channels Microlab Star liquid handler workstation (Hamilton) in a 96-well-plate format and according to previously reported protocols [4,[7], [8], [9]]. Briefly, each sample was dissolved in 95 μL of triethylammonium bicarbonate (TEAB) 100 mM and 5 μL of 2% sodium dodecyl sulfate. A volume of 5.3 μL of tris(2-carboxyethyl) phosphine (20 mM) was added and incubation was performed for 1 h at 55°C. A volume of 5.5 μL of iodoacetamide 150 mM was added (incubation for 1 h in darkness). Enzymatic digestion was performed via the addition of 10 μL of trypsin/Lys-C at 0.25 μg/μL in 100 mM TEAB (incubation overnight at 37°C). TMT labeling was performed via the addition of 0.8 mg of TMT 6-plex reagent in 41 μL of CH3CN (incubation for 1 h at room temperature). After reaction, a volume of 8 μL of hydroxylamine 5% in H2O was added to each tube to react for 15 min. Samples from a given TMT 6-plex experiment were pooled together in a new tube. Pooled samples (i.e., 16 pools in total from an original 96-samples set) were purified by solid phase extraction with Oasis HLB cartridges from Waters and Strata-X-C 33u polymeric strong cation cartridges from Phenomenex. All samples were resuspended in 200 μL of H2O/CH3CN/formic acid 96.9/3/0.1; 75 μL of the 16 resulting pooled samples were mixed together (to get enough material for sample fractionation), dried, and dissolved in 3232.8 µL H2O with 345.6 µL glycerol 50% and 21.6 µL of IPG buffer pH 3-10 (GE Healthcare Life Sciences). The sample was separated in 24 fractions with isoelectric focusing according to a previously published protocol [10], using the 3100 OFFGEL Fractionator (Agilent Technologies) and Immobiline DryStrip pH 3-10 (24 cm) (GE Healthcare Life Sciences).

RP-LC MS/MS analysis

The purified 24 fractions were dissolved in 50 µL H2O/CH3CN/formic acid (FA) 96.9/3/0.1%. A volume of 3 µL of each of the fractions were then diluted with 7 µL of H2O/FA 99.9/0.1% and only 2 µL of each diluted fraction were injected for separation on a 75 µm × 250 mm Aurora 2 C18 column (Ion Opticks). A typical RP gradient (Solvent A: 0.1% FA, 99.9% H2O MilliQ; Solvent B: 0.1% FA, 99.9% CH3CN) was run on a nanoflow LC system (nanoElute, Bruker Daltonik GmbH) at a flow rate of 400 nL/min. Column temperature was controlled at 50°C. The LC run lasted for 120 min (2% to 15% of Solvent B during 60 min; up to 25% at 90 min; up to 37% at 100 min; up to 95% at 110 min and finally 95% for 10 min to wash the column). The column was coupled online to a timsTOF Pro with a CaptiveSpray ion source (both from Bruker Daltonik GmbH). The temperature of the ion transfer capillary was set at 180°C. Ions were accumulated for 123 ms, and mobility separation was achieved by ramping the entrance potential from −160 V to −20 V within 123 ms. The acquisition of mass and tandem mass spectra was done with average resolution of 60,000 and 50,000 full width at half maximum (mass range 100-1700 m/z), respectively. To enable the parallel accumulation-serial fragmentation (PASEF) method, precursor m/z and mobility information was first derived from full scan TIMS-MS experiments (with a mass range of m/z 100-1700). Singly charged precursors were excluded by their position in the m/z-ion mobility plane and precursors that reached a ‘target value’ of 20,000 a.u. were dynamically excluded for 0.4 min. The quadrupole isolation width was set to 2 Th for m/z < 700 and 3 Th form/z ≥ 700, for fragmentation, and the collision energies varied between 31 and 52 eV depending on precursor mass and charge. TIMS, MS operation and PASEF were controlled and synchronized using the control instrument software OtofControl 5.1 (Bruker Daltonik). LC-MS/MS data were acquired using the PASEF method with a total cycle time of 1.23 s, including 1 TIMS MS scan and 10 PASEF MS/MS scans. The 10 PASEF scans (123 ms each) contained on average 12 MS/MS scans per PASEF scan. Ion mobility resolved mass spectra, nested ion mobility versus m/z distributions, as well as summed fragment ion intensities were extracted from the raw data file with DataAnalysis 5.1 (Bruker Daltonik).

Data processing and analysis

Protein identification was performed against the human UniProtKB/Swiss-Prot database (2020/02 release) comprising 20’367 protein sequences in total. Mascot (version 2.4.6 from Matrix Sciences) was used as search engine. Variable amino acid modifications were: oxidized methionine, deamidated asparagine/glutamine, and 6-plex TMT-labeled peptide amino terminus; 6-plex TMT-labeled lysine was set as fixed modifications as well as carbamidomethylation of cysteine. Trypsin was selected as the proteolytic enzyme, with a maximum of two potential missed cleavages. Peptide and fragment ion tolerance were set to 15 ppm and 0.05 Da, respectively. All Mascot result files were loaded into Scaffold Q+S 4.8.4 (Proteome Software) to be further searched with X! Tandem (The GPM, thegpm.org; version CYCLONE (2010.12.01.1)). The FDR in Scaffold was set up to 1% at protein and peptide level, with a one unique peptide criterion to report protein identification.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships which have, or could be perceived to have, influenced the work reported in this article. C. Macron, A. Núñez Galindo, M. Affolter and L. Dayon are employees of the Société des Produits Nestlé SA.
SubjectProteomics
Specific subject areaComprehensive proteome profiling of “normal” human cerebrospinal fluid (CSF) using mass spectrometry (MS).
Type of dataLiquid chromatograply tandem mass spectrometry (LC–MS/MS) data.
How data were acquiredLC-MS/MS acquisition on a nanoElute LC system coupled to a timsTOF Pro mass spectrometer.
Data formatRaw and processed.
Parameters for data collectionWe re-analyzed samples previously analyzed in a report by Macron et al.[1]. A commercial pool of “normal” human CSF samples was prepared according to a previously published proteomic workflow [2,3], described in the following Method section. Sample fractionation was used.
Description of data collectionLC-MS/MS analyses of the resulting 24 fractions were performed using a nanoElute LC system, coupled to a timsTOF Pro mass spectrometer, to evaluate the instrumental performances for the proteomic profiling of CSF with respect to other LC-MS technologies [1]. Mass spectral data were searched using Mascot and X! Tandem search engines before being visualized and validated with the Scaffold software.
Data source locationNestlé Research, 1015 Lausanne, Switzerland.
Data accessibilityProtein and peptide lists are provided in Supplementary Table S1. Repository name: ProteomeXchange Consortium. Data identification number: PXD018369.
  10 in total

1.  Tandem mass tags: a novel quantification strategy for comparative analysis of complex protein mixtures by MS/MS.

Authors:  Andrew Thompson; Jürgen Schäfer; Karsten Kuhn; Stefan Kienle; Josef Schwarz; Günter Schmidt; Thomas Neumann; R Johnstone; A Karim A Mohammed; Christian Hamon
Journal:  Anal Chem       Date:  2003-04-15       Impact factor: 6.986

2.  Relative protein quantification by MS/MS using the tandem mass tag technology.

Authors:  Loïc Dayon; Jean-Charles Sanchez
Journal:  Methods Mol Biol       Date:  2012

3.  Deep Dive on the Proteome of Human Cerebrospinal Fluid: A Valuable Data Resource for Biomarker Discovery and Missing Protein Identification.

Authors:  Charlotte Macron; Lydie Lane; Antonio Núñez Galindo; Loïc Dayon
Journal:  J Proteome Res       Date:  2018-08-31       Impact factor: 4.466

4.  Analyzing Cerebrospinal Fluid Proteomes to Characterize Central Nervous System Disorders: A Highly Automated Mass Spectrometry-Based Pipeline for Biomarker Discovery.

Authors:  Antonio Núñez Galindo; Charlotte Macron; Ornella Cominetti; Loïc Dayon
Journal:  Methods Mol Biol       Date:  2019

5.  A Versatile Workflow for Cerebrospinal Fluid Proteomic Analysis with Mass Spectrometry: A Matter of Choice between Deep Coverage and Sample Throughput.

Authors:  Charlotte Macron; Antonio Núñez Galindo; Ornella Cominetti; Loïc Dayon
Journal:  Methods Mol Biol       Date:  2019

6.  Comprehensive and Scalable Highly Automated MS-Based Proteomic Workflow for Clinical Biomarker Discovery in Human Plasma.

Authors:  Loïc Dayon; Antonio Núñez Galindo; John Corthésy; Ornella Cominetti; Martin Kussmann
Journal:  J Proteome Res       Date:  2014-07-24       Impact factor: 4.466

7.  Relative quantification of proteins in human cerebrospinal fluids by MS/MS using 6-plex isobaric tags.

Authors:  Loïc Dayon; Alexandre Hainard; Virginie Licker; Natacha Turck; Karsten Kuhn; Denis F Hochstrasser; Pierre R Burkhard; Jean-Charles Sanchez
Journal:  Anal Chem       Date:  2008-03-01       Impact factor: 6.986

8.  Proteomics of Cerebrospinal Fluid: Throughput and Robustness Using a Scalable Automated Analysis Pipeline for Biomarker Discovery.

Authors:  Antonio Núñez Galindo; Martin Kussmann; Loïc Dayon
Journal:  Anal Chem       Date:  2015-10-16       Impact factor: 6.986

9.  Identification of Missing Proteins in Normal Human Cerebrospinal Fluid.

Authors:  Charlotte Macron; Lydie Lane; Antonio Núñez Galindo; Loïc Dayon
Journal:  J Proteome Res       Date:  2018-08-17       Impact factor: 4.466

10.  GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists.

Authors:  Eran Eden; Roy Navon; Israel Steinfeld; Doron Lipson; Zohar Yakhini
Journal:  BMC Bioinformatics       Date:  2009-02-03       Impact factor: 3.169

  10 in total
  6 in total

1.  Global, in situ analysis of the structural proteome in individuals with Parkinson's disease to identify a new class of biomarker.

Authors:  Marie-Therese Mackmull; Luise Nagel; Fabian Sesterhenn; Jan Muntel; Jan Grossbach; Patrick Stalder; Roland Bruderer; Lukas Reiter; Wilma D J van de Berg; Natalie de Souza; Andreas Beyer; Paola Picotti
Journal:  Nat Struct Mol Biol       Date:  2022-10-12       Impact factor: 18.361

2.  A Novel Neurofilament Light Chain ELISA Validated in Patients with Alzheimer's Disease, Frontotemporal Dementia, and Subjective Cognitive Decline, and the Evaluation of Candidate Proteins for Immunoassay Calibration.

Authors:  Shreyasee Das; Nele Dewit; Dirk Jacobs; Yolande A L Pijnenburg; Sjors G J G In 't Veld; Salomé Coppens; Milena Quaglia; Christophe Hirtz; Charlotte E Teunissen; Eugeen Vanmechelen
Journal:  Int J Mol Sci       Date:  2022-06-29       Impact factor: 6.208

Review 3.  Proteomic-based evidence for adult neurogenesis in birds and mammals as indicated from cerebrospinal fluid.

Authors:  Eleni Voukali; Michal Vinkler
Journal:  Neural Regen Res       Date:  2022-12       Impact factor: 6.058

4.  Amyotrophic Lateral Sclerosis Is Accompanied by Protein Derangements in the Olfactory Bulb-Tract Axis.

Authors:  Mercedes Lachén-Montes; Naroa Mendizuri; Karina Ausin; Pol Andrés-Benito; Isidro Ferrer; Joaquín Fernández-Irigoyen; Enrique Santamaría
Journal:  Int J Mol Sci       Date:  2020-11-05       Impact factor: 5.923

5.  Comparison of plasma and cerebrospinal fluid proteomes identifies gene products guiding adult neurogenesis and neural differentiation in birds.

Authors:  Eleni Voukali; Nithya Kuttiyarthu Veetil; Pavel Němec; Pavel Stopka; Michal Vinkler
Journal:  Sci Rep       Date:  2021-03-05       Impact factor: 4.379

6.  Proteomic Analysis of Tears and Conjunctival Cells Collected with Schirmer Strips Using timsTOF Pro: Preanalytical Considerations.

Authors:  Murat Akkurt Arslan; Ioannis Kolman; Cédric Pionneau; Solenne Chardonnet; Romain Magny; Christophe Baudouin; Françoise Brignole-Baudouin; Karima Kessal
Journal:  Metabolites       Date:  2021-12-21
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.