Literature DB >> 33855135

Data processing for fennel protein characterization by Fourier transform ion cyclotron resonance mass spectrometry (FT-ICR-MS).

Maria Teresa Melfi1, Basem Kanawati2, Philippe Schmitt-Kopplin2,3, Luigi Macchia4, Diego Centonze1, Donatella Nardiello1.   

Abstract

An untargeted shot-gun approach is described for the ultra-high-resolution analysis of fennel proteins by Fourier transform ion cyclotron resonance mass spectrometry (FT-ICR-MS) combined with a home-made Matlab search algorithm. The first step of the proposed bioinformatic strategy was the development of a custom-made fennel protein database, starting from the well-known, on-line available, protein NCBI database, under Foeniculum Vulgare organism, consisting of 231 total proteins. Partial and redundant forms of proteins, repeatedly included in the official NCBI database under different codes, were removed. In the final custom-made database, in addition to the 92 fennel specific non-redundant proteins, 10 proteins belonging to recognized allergenic sources associated with spice-mugwort-allergy syndrome (celery, carrot, parsley, birch, and mugwort) were also included. The second step was the in-silico enzymatic digestion, performed on all the 102 proteins, to obtain a theoretical list of m/z dataset of tryptic peptides. The Matlab processing data was the third and crucial step, necessary to search for in-silico mass calculated peptide sequences in the high resolution ICR mass spectra of the digested fennel extract. The final step was based on database searching in Peptide Mass Fingerprint (PMF) mode by using the matched m/z values as input data. The PMF search results confirmed the presence of 70 proteins (61 fennel specific and 9 allergenic proteins) inside the fennel extract.
© 2021 The Author(s). Published by Elsevier Inc.

Entities:  

Keywords:  Custom-made protein database; Direct-infusion FT-ICR-MS; Fennel proteins; In-silico enzymatic digestion; Matlab data processing; Peptide mass fingerprint

Year:  2021        PMID: 33855135      PMCID: PMC8027264          DOI: 10.1016/j.dib.2021.106960

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table

Value of the Data

An innovative analytical platform by ultra-high-resolution FT-ICR-MS analysis combined with a home-made new Matlab search algorithm is proposed for the characterization of fennel proteins. The bioinformatics workflow allows for reliable protein characterizations by database searching in Peptide Mass Fingerprint mode, taking full advantages of the high measurement accuracy of the FT-ICR mass analyzer. The described strategy (direct-infusion FT-ICR-MS, peak list extraction, production of a sub-database of non-redundant specific protein entries, calculation of tryptic digestions, comparison between the theoretical and the experimental m/z datasets, and finally, the Mascot database searching in PMF mode) represents a very informative approach for a rapid and accurate protein characterization in food products, as well as in biological and clinical samples. The development of a custom-made protein database has allowed to overcome the intrinsic limits of NCBI protein database in terms of redundancy and incompleteness.

Data Description

All tables and Figures are reported in [1]. Table 1 shows the custom-made fennel protein database (protein name, code, amino acid sequence and molecular weight), generated starting from the official NCBI database associated to the organisms Foeniculum Vulgaris. From the original list of 231 total proteins, the redundant and partial proteins were removed and 92 fennel specific, non-redundant proteins were obtained. In Table 2, the list of the most important allergenic proteins originating from other plants (celery, carrot, parsley, birch and mugwort), known as responsible, together with fennel, of the so-called birch-weed or fruit-spice syndrome [2], [3], [4] are listed. Therefore, in total, the custom-made protein database is composed of 102 different proteins: 92 fennel specific and 10 recognized allergenic proteins. In Fig. 1, the primary structures of all the proteins found in the fennel extracts are shown, highlighting in bold red the amino acid sequences which correspond to the identified peptides. For each protein, peptide sequences and mass are reported in Table 3. For each identification, the number of matched peptides ranged from 3 (for smaller proteins, such as Cytochrome b6/ f complex subunit IV) and 99 (for larger proteins, such as Hypothetical chloroplast RF1). Further details are reported in the related scientific article [5]. Fig. 2A and B shows expanded views of the fennel MS spectrum in the mass segments 600–700 and 700–800 amu, respectively, also showing the amino acidic sequences matching the in silico peptide digests. Other mass segments: 800–1000 and 1000–1200 amu are shown in Fig. 3A and B, respectively. The MS spectra raw data files and the corresponding mass peak lists have been uploaded in a public repository [6]. Fig. 4 shows a comparison between the primary structure of the major pollen allergen Bet v 1-A, from Betula pendula, and the putative homologue protein found in the fennel extract; the bold gray strings correspond to the identified peptides.

Experimental Design, Materials and Methods

Protein extraction from fennel and enzymatic digestion

An amount of 100 g of the edible part of fresh Foeniculum Vulgare (purchased in local supermarkets) was washed properly, minced and homogenated (Heidolph DIAX 900 homogenizer with a Heidolph 10 F probe) for 15 min at 25,000 revolutions per minute (rpm), on ice, in the presence of phosphate buffer solution (PBS) containing Ca2+, Mg2+, EDTA at a final concentration of 2 mM, and 700 µL of plant cell-specific protease inhibitor cocktail composed of 4-(2-aminoethyl) benzenesulfonyl fluoride hydrochloride (AEBSF), 1,10-phenantroline, pepstatin A, bestatin and trans-epoxysuccinyl-Lleucylamido-(4-guanidino)butane (E64), at unknown concentration. The homogenate was then centrifuged at 12,000 xg, for 20 min at 4 °C and the supernatant was recovered. This step was twice repeated. Successively, the sample was ultra-centrifuged at 100,000 xg, for 2 h, at 4 °C. From an initial amount of 100 g of fresh fennel, 40 mL of 100,000 xg supernatant were obtained. The extract was kept at −80 °C, until used. The protein content, determined according to the colorimetric Bradford method [7], was 3.5 mg L−1. The in-solution enzymatic digestion of the raw fennel extracts, diluted 1:10 dilution with water, was performed in duplicate, following the procedure of Khodadadi et al. with slight modifications [8]. Briefly, 400 µL of methanol, 100 µL of chloroform and 300 µL of water were added to 100 µl of fennel protein extract and mixed thoroughly. After centrifugation at 15,000 rpm for 15 min, the upper aqueous phase was discarded, whereas 300 µL of methanol was added slowly to the lower phase. Then, the extract was further centrifuged at 15,000 rpm for 15 min. After drying, the resulting pellet was resuspended in 50 mM NH4HCO3 to reach a pH of 8.5. After reduction with 50 mM dithiothreitol for 60 min at 56 °C, and alkylation with 50 mM iodoacetamide for 60 min at 37 °C in the dark, the enzymatic digestion was performed with trypsin at a 1:100 enzyme/protein concentration for 18 h of incubation at 37 °C. The resulting peptides mixtures were acidified with 5µL of 5% formic acid (FA) and centrifuged at 15,000 rpm for 15 min. Before ESI-FT-ICR mass spectrometry analyses, the peptide mixtures were diluted 1:10 in a mixture of acetonitrile/water (70:30, v/v) containing 0.2% FA.

ESI-FT-ICR mass spectrometry analyses

Ultra-high resolution ESI(+) mass spectra were acquired on a SolariX ion cyclotron resonance Fourier transform mass spectrometer (Bruker Daltonics GmbH, Bremen, DE) equipped with an Apollo II ESI source (Bruker Daltonics GmbH, Bremen, DE) and a 12 T superconducting magnet (Magnex Scientific, Yarnton, UK). The Bruker Solarix contains a linear ion beam guide, which consists of a splitted quadrupole (directly connected to an ion funnel), a quadrupole, which serves as a mass selection filter and a hexapole, which represents a relatively high pressure collision cell, for MS/MS experiments, which are assisted with argon atoms (collision gas). Samples were injected with a flow rate of 2 µL min−1. The MS was calibrated with a 5 mg L−1 arginine solution reaching a mass error below 100 ppb and was tuned in order to obtain the highest sensitivity for peptides in the mass/charge (m/z) range of 122–3000 in broadband detection mode. The resolution was on average of R = 400,000 at m/z 400, enabling an excellent signal differentiation on a molecular level. Tryptic peptide mixtures of fennel proteins were analyzed by direct flow injections by the use of Electrospray Ionization double Quadrupole Fourier Transform-Ion Cyclotron Resonance mass spectrometry (ESI qQ-FT-ICR-MS). The used electrospray voltage was 3600 V in the positive ionization electrospray mode. No in-source ion fragmentation was done. This allowed for detection of the intact peptides resulting from the digested fennel proteins. A sine mathematic apodization function was applied on the time domain transients and is used to significantly reduce the FT-artifacts side lobes (wiggles), which do not count for the real (chemically relevant) signals in the mass spectra [9]. Technical replicates were performed by multiple analysis (at least 3) of the same sample fennel peptide mixture. Extraction of mass spectra peak-lists, mass annotation and deconvolution were performed by using Data Analysis 4.4 (Bruker Daltonics). The mass spectra were calibrated by the use of the cluster ions of arginine in positive Electrospray ion mode, which range from m/z 175 from the [M + H]+ monomer ion until reaching m/z 1220 heptamer [7M+H]+ cluster ion.

Data processing and database searching

Starting from the official protein NCBI database under Foeniculum Vulgare organism (consisting in 231 total proteins), a custom-made fennel database of 92 different proteins was derived by removing partial and redundant forms of proteins, repeatedly included in the original NCBI database under different codes. Then, in the final version of the custom-made protein database, 10 additional proteins belonging to recognized allergenic sources associated with spice-mugwort-allergy syndrome (celery, carrot, parsley, birch, and mugwort) were also included. Each protein of the above described NCBI subset was subjected to simulated tryptic digestion by the use of the ExPaSy peptide mass calculator tool (ExPaSy Bioinformatics Resource Portal, https://web.expasy.org/peptide_mass) by setting the following parameters: enzyme trypsin, two allowed missed cleavages; mass range from 0 to unlimited Dalton; cysteines treated with iodoacetamide. For each protein, the in silico enzymatic digestion was performed by selecting consecutively the option to save the theoretical peptide masses in form of [M + H]+, [M + 2H]2+ and [M + 3H]3+ in order to have a complete and realistic mass list of all the putative m/z ions, as a result of the use of an ESI source. The obtained tryptic peptide theoretical m/z lists were compared to the experimental m/z dataset of the fennel FT-ICR mass spectrum by a home-made Matlab search algorithm (as reported in the Supplementary Material), with a mass tolerance of 5 ppm for this step. The matched m/z values (z=+1; +2; and +3) were converted in the query format required for database Peptide Mass Fingerprint (PMF) searching (MASCOT search engine, Matrix Science, London, UK) as [M + H]+ ions according to Eq.: [M + H]+=z*m/z +(1-z)proton mass, where z is the charge state). By using as input data the matched [M + H]+ lists, the PMF searching was performed under the Viridiplantae category of the official full redundant NCBI database. A maximum number of 2 missed cleavages were allowed and the option carbamidomethylation at cysteine residues was checked. A peptide mass tolerance of 2 ppm was set in the error window.

CRediT Author Statement

Study concept and design: B. Kanawati and D. Nardiello: Acquisition of data; M.T. Melfi and B. Kanawati: Analysis and interpretation of data; B. Kanawati, D. Nardiello and M.T. Melfi: Drafting of the manuscript; D. Nardiello, B. Kanawati and M.T. Melfi: Clinical studies on hypersensitive patients with fennel allergy; L. Macchia: Critical revision of the manuscript for important intellectual content; P. Schmitt-Kopplin and D. Centonze.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships which have or could be perceived to have influenced the work reported in this article.
SubjectProteomics, Analytical Chemistry, Food Chemistry
Specific subject areaAnalytical Chemistry applied to protein analysis and food chemistry
Type of dataTableFigure
How data were acquiredStarting from the official NCBI database, a custom-made fennel protein database was derived by removing redundant and partial proteins and including recognized allergenic proteins associated with spice-mugwort-allergy syndrome. Tryptic peptide theoretical lists were obtained by in-silico enzymatic digestion of custom-made database proteins and compared to the experimental m/z values of the FT-ICR mass spectra of the digested fennel extract by a home-made Matlab search algorithm.A list of proteins inside the fennel extract was obtained by MASCOT Peptide Mass Fingerprint database search by using as input data that matched theoretical/experimental m/z dataset.
Data formatRawAnalyzed
Parameters for data collectionFT-ICR-MS analysis parameters, data processing and bioinformatics for protein characterization are briefly described in this paper and fully provided in the related research article.
Description of data collectionData were collected by SolariX ion cyclotron resonance Fourier transform mass spectrometer (Bruker Daltonics GmbH, Bremen, DE) equipped with an Apollo II ESI source (Bruker Daltonics GmbH, Bremen, DE) and a 12 T superconducting magnet (Magnex Scientific, Yarnton, UK).
Data source locationFennel extracts were prepared in the laboratories of the Section: Allergy and Clinical Immunology at the University of Bari (Bari, ITALY).The fennel tryptic peptide mixtures were prepared in the laboratories of the Research Unit Analytical BioGeoChemistry at Helmholtz Zentrum München (Neuherberg, Germany)
Data accessibilityRepository name: Mendeley Data: Data processing for fennel protein characterization by Fourier transform ion cyclotron resonance mass spectrometry (FT-ICR-MS)Data identification number: http://dx.doi.org/10.17632/s354fkt6fp.1Direct URL to data: http://dx.doi.org/10.17632/s354fkt6fp.1Repository name: Mendeley Data: Supplementary Data for: Data processing for fennel protein characterization by Fourier transform ion cyclotron resonance mass spectrometry (FT-ICR-MS)Data identification number: http://dx.doi.org/10.17632/9fxgd8wb2j.1Direct URL to data: http://dx.doi.org/10.17632/9fxgd8wb2j.1
Related research articleM.T. Melfi, B. Kanawati, P. Schmitt-Kopplin, L. Macchia, D. Centonze, D. Nardiello, Investigation of fennel protein extracts by shot-gun Fourier transform ion cyclotron resonance mass spectrometry, Food Research International 139 (2021) 109,919 https://doi.org/10.1016/j.foodres.2020.109919
  7 in total

1.  Relevance of pollen-specific IgE levels to the development of Apiaceae hypersensitivity in patients with birch pollen allergy.

Authors:  R Asero
Journal:  Allergy       Date:  1997-05       Impact factor: 13.146

2.  A rapid and sensitive method for the quantitation of microgram quantities of protein utilizing the principle of protein-dye binding.

Authors:  M M Bradford
Journal:  Anal Biochem       Date:  1976-05-07       Impact factor: 3.365

3.  Fourier transform (FT)-artifacts and power-function resolution filter in Fourier transform mass spectrometry.

Authors:  Basem Kanawati; Theresa M Bader; Karl-Peter Wanczek; Yan Li; Philippe Schmitt-Kopplin
Journal:  Rapid Commun Mass Spectrom       Date:  2017-10-15       Impact factor: 2.419

4.  Investigation of fennel protein extracts by shot-gun Fourier transform ion cyclotron resonance mass spectrometry.

Authors:  Maria Teresa Melfi; Basem Kanawati; Philippe Schmitt-Kopplin; Luigi Macchia; Diego Centonze; Donatella Nardiello
Journal:  Food Res Int       Date:  2020-11-30       Impact factor: 6.475

5.  Characterization of allergens in Apiaceae spices: anise, fennel, coriander and cumin.

Authors:  E Jensen-Jarolim; A Leitner; R Hirschwehr; D Kraft; B Wüthrich; O Scheiner; J Graf; C Ebner
Journal:  Clin Exp Allergy       Date:  1997-11       Impact factor: 5.018

6.  Fennel allergy is a lipid-transfer protein (LTP)-related food hypersensitivity associated with peach allergy.

Authors:  Elide A Pastorello; Laura Farioli; Chrysi Stafylaraki; Joseph Scibilia; Maria G Giuffrida; Ambra Mascheri; Marta Piantanida; Cristina Baro; Laura Primavesi; Michele Nichelatti; Jan W Schroeder; Valerio Pravettoni
Journal:  J Agric Food Chem       Date:  2013-01-14       Impact factor: 5.279

7.  Leaf proteomics of drought-sensitive and -tolerant genotypes of fennel.

Authors:  Ehsaneh Khodadadi; Barat Ali Fakheri; Saeed Aharizad; Abbasali Emamjomeh; Majid Norouzi; Setsuko Komatsu
Journal:  Biochim Biophys Acta Proteins Proteom       Date:  2017-09-05       Impact factor: 3.036

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.