Literature DB >> 33715301

Mass spectrometry-based top-down and bottom-up approaches for proteomic analysis of the Moroccan Buthus occitanus scorpion venom.

Khadija Daoudi1,2, Christian Malosse3, Ayoub Lafnoune1,2, Bouchra Darkaoui1,2, Salma Chakir1, Jean-Marc Sabatier4, Julia Chamot-Rooke3, Rachida Cadi2, Naoual Oukkache1.   

Abstract

Buthus occitanus (B. occitanus) is one of the most dangerous scorpions in the world. Despite the involvement of B. occitanus scorpion in severe cases of envenomation in Morocco, no study has focused yet on the proteomic composition of the Moroccan B. occitanus scorpion venom. Mass spectrometry-based proteomic techniques are commonly used in the study of scorpion venoms. The implementation of top-down and bottom-up approaches for proteomic analyses facilitates screening by allowing a global view of the structural aspects of such complex matrices. Here, we provide a partial overview of the venom of B. occitanus scorpion, in order to explore the diversity of its toxins and hereafter understand their effects. To this end, a combination of top-down and bottom-up approaches was applied using nano-high liquid chromatography coupled to nano-electrospray tandem mass spectrometry (nano-LC-ESI MS/MS). The LC-MS results showed that B. occitanus venom contains around 200 molecular masses ranging from 1868 to 16 720 Da, the most representative of which are those between 5000 and 8000 Da. Interestingly, combined top-down and bottom-up LC-MS/MS results allowed the identification of several toxins, which were mainly those acting on ion channels, including those targeting sodium (NaScTxs), potassium (KScTxs), chloride (ClScTxs), and calcium channels (CaScTx), as well as antimicrobial peptides (AMPs), amphipathic peptides, myotropic neuropeptides, and hypothetical secreted proteins. This study reveals the molecular diversity of B. occitanus scorpion venom and identifies components that may have useful pharmacological activities.
© 2021 The Authors. FEBS Open Bio published by John Wiley & Sons Ltd on behalf of Federation of European Biochemical Societies.

Entities:  

Keywords:  Buthus occitanus scorpion; bottom-up; top-down; toxins; venom; venomic

Mesh:

Substances:

Year:  2021        PMID: 33715301      PMCID: PMC8255848          DOI: 10.1002/2211-5463.13143

Source DB:  PubMed          Journal:  FEBS Open Bio        ISSN: 2211-5463            Impact factor:   2.693


acetonitrile antimicrobial peptides Buthus  occitanus neurotoxins affecting calcium channels neurotoxins affecting chloride channels Dalton Electron‐Transfer/Higher‐Energy Collision Dissociation formic acid higher‐energy C‐trap dissociation iodoacetamide kilodalton neurotoxins affecting potassium channels liquid chromatography coupled to tandem mass spectrometry liquid chromatography coupled to mass spectrometry mass spectrometry molecular weight nano‐liquid chromatography coupled to electrospray tandem mass spectrometry neurotoxins affecting sodium channels quadrupole total ion chromatogram Each year, scorpion stings record new cases of envenomation over the world with an incidence of more than 1.5 million and over 2600 deaths, mainly in tropical and subtropical countries of South America, Asia, and North Africa [1]. Most of these envenomation cases were caused by scorpions belonging to the Buthidae family, which contains dangerous species known by their lethal venoms [2]. The venom of these family members contains a heterogeneous cocktail of compounds, including inorganic substances, enzymes, mucopolysaccharides, allergenic compounds, and peptides with high toxicity toward ionic channels of excitable cells [3, 4, 5, 6]. In Morocco, 26 819 cases of scorpion stings were reported in 2019 by the Poison Control and Pharmacovigilance Center of Morocco, with an incidence of 75.3 cases per 100 000 inhabitants [7]. These statistics are due to the diversified scorpion fauna represented by over 50 species, mainly widespread in the middle and southwestern provinces of the kingdom [8]. Among these species, the yellow scorpion Buthus occitanus (B.occitanus) seems to be one of the most dangerous scorpions, on account of its toxic venom causing the majority of envenomation cases [9]. Although several studies had been carried out on this venom [10, 11, 12, 13], no study has yet focused on the proteomic composition of the Moroccan B. occitanus scorpion venom despite its medical importance. Moreover, there are various strategies to screen scorpion venoms, from using conventional strategies for targeting one single toxin, to applying the most throughput equipment of screening for a detailed view of all toxic components. Nowadays, mass spectrometry‐based proteomic approaches are still one of the most fundamental tools to decrypt the complexity of such matrices, owing to the revolutionary advances in instrumentation and software, in addition to improvement in omics strategies (peptidomic, proteomic, transcriptomic, and genomic) [14, 15, 16, 17, 18, 19]. Among the approaches that have improved significantly the proteomics workflow, there are the top‐down process, which designates a rapid analytical workflow of intact proteins, and the bottom‐up approach, which requires prior proteolytic digestion of proteins before mass spectrometry analysis. These approaches lead to acquiring mass fingerprints, primary structural information, and post‐translational modifications [20, 21, 22, 23]. The application of these approaches, singly or complementary, in several proteomic studies has increased the number of characterized venoms and identified toxins [24, 25, 26, 27, 28, 29]. In this context, this work aimed to ensure an overview of the peptidome of B. occitanus scorpion (< 30 kDa), so exploring its toxins arsenal, using a combination of the top‐down and bottom‐up approaches applied on nano‐high liquid chromatography coupled to a nano‐electrospray tandem mass spectrometry (nano‐LC‐ESI MS/MS).

Materials and methods

Venom preparation

Venom milking

Specimens of B. occitanus were collected from the region of Oualidia (32°44′N 9°01′W), in eastern Morocco. The crude venom was milked by electrical stimulation, pooled, centrifuged at 10 000  for 20 min, freeze‐dried, and stored at −20 °C until use [30].

Venom Reduction/Alkylation

At first, 2 mg of B. occitanus crude venom was subjected to a 30 kDa ‘cutoff’ filter (Amicon® Ultra Centrifugal Filters, Merck Millipore, Tulagreen, Ireland), then centrifuged at 16 900  for 15 min. Disulfide‐bridged half‐cysteine residues of this venom filtrate were reduced by 10 mm of DTT in ammonium bicarbonate buffer (50 mm, pH 8.3), for 45 min at a temperature of 56 °C. Cysteine residues were carboxamido‐methylated by incubation with 50 mm iodoacetamide [IAA in ammonium bicarbonate (50 mm, pH 8.3)] for 1 h in the dark. Then, these proteins/peptides were desalted by ZipTip C4 (Millipore Corporation ‐ Billerica, USA) and concentrated on a Savant SpeedVac (Thermo Scientific, San Jose, CA, USA).

Mass spectrometry‐based proteomic approaches

Top‐down proteomics

Intact and reduced/alkylated B. occitanus venom filtrates were carried out on an Orbitrap Fusion™ Lumos™ mass spectrometer (Thermo Scientific™ Waltham, MA, USA), equipped with a Dionex HPLC (Fig. 1).
Fig. 1

Experimental workflow performed in this study. At first, B. occitanus venom was milked by electrical stimulation and applied to a 30 kDa filter. For the top‐down venomic, the flow‐through containing toxins < 30 kDa was analyzed by the Thermo Scientific ™ Orbitrap Fusion Lumos Tribrid Mass Spectrometer. For the bottom‐up approach, two digest methods were achieved: 1) in‐solution digestion, the flow‐through containing toxin < 30 kDa was directly reduced with DTT, alkylated with IAA, and digested with trypsin; and 2) in‐gel digestion, the unstained gel was excised to small cubes, reduced, alkylated, and digested. The digest peptides were then desalted with ZipTip and applied to the Orbitrap Q‐Exactive mass spectrometer.

Experimental workflow performed in this study. At first, B. occitanus venom was milked by electrical stimulation and applied to a 30 kDa filter. For the top‐down venomic, the flow‐through containing toxins < 30 kDa was analyzed by the Thermo Scientific ™ Orbitrap Fusion Lumos Tribrid Mass Spectrometer. For the bottom‐up approach, two digest methods were achieved: 1) in‐solution digestion, the flow‐through containing toxin < 30 kDa was directly reduced with DTT, alkylated with IAA, and digested with trypsin; and 2) in‐gel digestion, the unstained gel was excised to small cubes, reduced, alkylated, and digested. The digest peptides were then desalted with ZipTip and applied to the Orbitrap Q‐Exactive mass spectrometer. For the online peptide fractionation, 2 µg of samples was loaded to a C4 µ‐precolumn cartridge (300 µm i.d. × 5 mm, C4 PepMap 300 particles with 5 µm size and 300 Å pores); the column was equilibrated with solution A [0.1% (v/v) formic acid (FA)]. The separation was maintained over 120 min at 250 nL·min−1, using a linear gradient from 5% to 60% of solution B [acetonitrile (ACN) and 0.1% (v/v) FA]. Proteins/peptides were eluted directly from the column into the mass spectrometer and operated in positive mode with a spray voltage of 1.6 kV. MS spectra were acquired at a resolution setting of 120 000. MS/MS analysis was performed on data‐dependent acquisition, the top 10 abundant precursor ions were selected for an EThcD fragmentations (Electron‐Transfer/Higher‐Energy Collision Dissociation) with a dynamic exclusion time of 90 s. MS/MS spectra were acquired at a resolution setting of 120 000, and the mass range was set from 150 to 2000 m/z.

Bottom‐up proteomics

In‐solution digestion

Reduced/alkylated venom filtrate was digested overnight at a temperature of 37 °C with 0.1 μg of trypsin (Promega, Madison, WI, USA). Tryptic digests were analyzed on a Q‐Exactive Plus instrument (Thermo Fisher Scientific, Bremen, Germany) coupled to an EASY‐nLC 1200 chromatography system (Thermo Fisher Scientific). Two micrograms was loaded on an in‐house packed 50‐cm nano‐HPLC column (75 μm inner diameter) filled with C18 resin (1.9 μm particles, 100 Å pore size, Reprosil‐Pur Basic C18‐HD resin; Maisch GmbH, Ammerbuch‐Entringen, Germany) and equilibrated in 97% solvent A and 3% solvent B (ACN, 0.1% (v/v) FA). Peptides were eluted at 250 nL·min−1, using 3–22% gradient of solvent B for 112 min, then 22–38% gradient of solvent B for 35 min, and finally 38–60% gradient of solvent B for 15 min. The instrument method for the Q‐Exactive Plus was set up in the data‐dependent acquisition mode. MS and MS‐MS spectra were acquired at a resolution of 60 000, 10 of the most abundant precursor ions were selected for HCD fragmentation with collision energy adjusted to 27. Mono‐charged precursors and those with a charge state of > 7 were excluded.

In‐gel digestion

At first, 2 mg of venom filtrate was unfolded for 5 min at 95 °C in sample buffer (LDS sample buffer) and then subjected to a SDS/PAGE using a 4–20% of polyacrylamide gel (SDS Precast Gel RunBlue, 4–20%, 12 well; Expedeon, CA, USA). The electrophoresis was performed, on a Bio‐Rad system, at a constant voltage of 140 V, and the separated proteins were stained with Coomassie Brilliant Blue R (InstantBlue; Expedeon, CA, USA). Stained bands corresponding to proteins/peptides with masses < 30 kDa (Fig. S1) were manually excised into equal small cubes of 1 mm3, then washed with Milli‐Q water, ammonium bicarbonate 50 mm, and ACN 50%. Subsequently, the slices were submitted to an in‐gel reduction with DTT (10 mm) in ammonium bicarbonate buffer (50 mm, pH 8.3) for 45 min at a temperature of 56 °C. Reduced slices were alkylated with IAA (50 mm) in ammonium bicarbonate (50 mm, pH 8.3) buffer for 20 min in the dark, followed by an overnight digestion with 0.1 μg of trypsin (Promega) at a temperature of 37 °C [31]. The enzymatic reaction was stopped by adding 5 µL of FA 5%, and desalted by loading the peptides onto ZipTip C18. After drying, digested peptides were dissolved in 100 μL of 0.1% (v/v) FA and applied on a liquid chromatography coupled to tandem mass spectrometry (LC‐MS/MS) system, composed of a nano‐flow HPLC pump and an Orbitrap Q‐Exactive mass spectrometer (Thermo Scientific) with a nano‐electrospray ion source, as described in the section above.

Data analysis

The top‐down liquid chromatography coupled to mass spectrometry (LC‐MS) data analysis of native B. occitanus venom filtrate was deconvoluted using the Xtract algorithm within Thermo Scientific xcalibur 2.2 software (Thermo Fisher Scientific). For protein identification, data from both of the venomic nano‐LC‐MS/MS approaches were processed using the proteome discover 2.2 software (Thermo Fisher Scientific), against the UniProtKB database, downloaded in 2016 10 11, taxon identifier: 6855 and 4309 entries. Parameters of processing were as follows: a mass tolerance of MS set at 50 p.p.m. and 0.3 Da for MS/MS. One unique peptide was required for protein identification, minimum peptide length was required at five amino acids, and the false discovery rate cutoff was 1%. Trypsin was chosen as the specific enzyme, with a maximum number of two missed cleavages for the bottom‐up analysis. Variable modifications included oxidation of methionine and carbamidomethylation, while no fixed modification was set.

Results

The whole proteomic approaches are based only on the UniProtKB database‐dependent analysis without any manually de novo sequence annotation; therefore, the majority of reported peptide annotations are still an approximation. Also, it is important to stress that the relative abundances and the percentages of the described peptides are purely based on total number counts and not concentrations as long as no quantitative analysis was performed. The total ion chromatogram (TIC) generated from the top‐down LC‐MS analysis of native B. occitanus venom filtrate (Fig. 2) gave a partial picture of the venom complexity, with around 60 peaks, most of them detected with high relative abundance.
Fig. 2

TIC of native B. occitanus venom filtrate, generated from top‐down mass spectrometry analysis (MS1). The x‐axis represents the relative abundance (%), and the y‐axis, the retention time (min). Spectra were deconvoluted, and generated monoisotopic masses were distributed according to their MW.

TIC of native B. occitanus venom filtrate, generated from top‐down mass spectrometry analysis (MS1). The x‐axis represents the relative abundance (%), and the y‐axis, the retention time (min). Spectra were deconvoluted, and generated monoisotopic masses were distributed according to their MW. The mass fingerprint of B. occitanus venom was generated from a manual deconvolution of spectra gained by top‐down LC‐MS approach, thus detecting a total of 197 monoisotopic masses ranging from 1868 to 16 720 Da (Table 1). We get one mass less than 2000 Da, 28 molecular masses ranging between 2000 and 5000 Da, 147 mass values from 5000 to 8000 Da, and 21 masses for those over 8000 Da.
Table 1

List of the 197 monoisotopic masses detected by the top‐down LC‐MS analysis.

Retention time (min)MW (Da)
0–10N.D
10–201868.0157
20–302208.2634; 2506.4634
30–402813.4212; 2851.4287; 2966.3848; 3124.4545; 3219.5691; 3233.4756; 3461.4966; 3486.7774;3538.283; 3550.4334; 3670.8935; 3718.7023; 3823.4412; 3937.8078; 4093.8732; 4321.8654; 4366.9752; 4366.986; 5731.6152; 5919.5155.
40–503522.2898;3614.8741; 3807.4466; 3937.7725; 4333.933; 4366.9856; 4568.7172; 4572.9253; 5185.3781; 6148.8879; 6423.7104; 6439.6786; 6527.7246; 6539.6502; 6541.7326; 6595.7719; 6606.8166; 6610.768; 6611.7946; 6635.0442; 6744.712; 6829.8098; 6831.8926; 6832.876; 6860.9183; 6861.9012; 6872.9404; 6876.9037; 6877.9284; 6893.9821; 6940.948; 6952.1809; 6974.2357; 6979.0052; 6995.0399; 6997.024; 7014.2508; 7016.0204; 7022.0148; 7024.0653; 7107.2902; 7152.0763; 7162.3796; 7177.1647; 7218.3026; 7220.0387; 7220.2052; 7243.2414; 7297.2395; 7393.2604.
50–606488.9021; 6609.8127; 6611.7977; 6629.8447; 6677.8651; 6749.8876; 6765.9533; 6779.2433;6807.922; 6823.1194; 6836.974; 6837.8837; 6862.9698; 6879.9966; 6907.3347; 6919.9628; 6972.7789; 7007.0404; 7011.1444; 7012.1231; 7020.055; 7028.0976; 7035.2491; 7024.1049; 7051.0799; 7061.1245; 7062.1114; 7069.1111; 7079.1299; 7082.3444; 7115.0302; 7115.2113; 7122.274; 7130.9674; 7143.0368; 7250.1077; 7262.1172; 7266.1721; 7268.152; 7283.1496; 7307.2070; 7328.1353; 7394.3224; 7394.5252; 7400.289; 7416.5358; 7435.2763; 7449.3831; 7468.4297; 7491.1348; 7506.1972; 7534.4067; 7607.5077; 7681.4621; 7777.5363; 7840.6401; 7894.5677; 7912.5297; 7924.5736; 7943.5256; 8174.6428; 8344.5958; 9875.9204; 6896.9694; 6880.9937; 7016.998; 7056.1905; 7074.1478; 7104.0354; 7122.2913; 7115.9848; 7175.0715; 7309.2612; 7414.4224; 7600.5; 7654.5083; 7798.6334; 7817.6424; 7832.6366; 7833.6635; 8140.6441; 8159.4822; 8345.5484; 9959.0054; 11068.3376; 11243.5823;16720.7335.
60–706896.9694; 6880.9937; 7016.998; 7056.1905; 7074.1478; 7104.0354; 7122.2913; 7115.9848; 7175.0715; 7309.2612; 7414.4224; 7600.5; 7654.5083; 7798.6334; 7817.6424; 7832.6366; 7833.6635; 8140.6441; 8159.4822; 8345.5484; 9959.0054; 11068.3376; 16720.7335.
70–806809.9307; 6857.9428; 6859.9368; 6865.9432; 6875.9565; 6880.9796; 6982.0159; 6913.9378; 7009.0523; 7104.9914; 7172.1987; 7200.1528; 7214.1558; 7316.2804; 7377.2599; 7300.0933; 7394.5084.
80–907377.2678; 7301.1747; 9140.1069; 11377.1636; 12971.6074; 13004.7435.
90–1007390.4025; 7466.4483; 7482.4543; 7500.4753; 7704.4655; 7791.5128; 7792.5813; 8672.6993; 8882.0067; 8978.0645; 14577.4253.
100–1109302.1043; 12990.2825; 12985.6009.
110–120N.D

N.D: not determined.

List of the 197 monoisotopic masses detected by the top‐down LC‐MS analysis. N.D: not determined. The most representative molecular masses were those from 5000 to 8000 Da, followed by those between 2000 and 5000 Da, which represents respectively 74% and 10% of the total number of measured molecular masses (Fig. 3).
Fig. 3

Molecular mass distribution of the monoisotopic masses from MS1 spectra deconvolution. 197 components were detected, with their MW ranging from 1868 to 16 720 Da. These peptides distributed from 1000 to 17 000 Da with 1000 Da mass range windows. The x‐axis represents the MW in Da, and the y‐axis represents the percentage (%) based on total number counts.

Molecular mass distribution of the monoisotopic masses from MS1 spectra deconvolution. 197 components were detected, with their MW ranging from 1868 to 16 720 Da. These peptides distributed from 1000 to 17 000 Da with 1000 Da mass range windows. The x‐axis represents the MW in Da, and the y‐axis represents the percentage (%) based on total number counts. The analysis of reduced/alkylated B. occitanus venom filtrate by tandem mass spectrometry allowed the identification of 68 peptides with a molecular weight (MW) from 1959.13 to 7943.53 Da. The detected experimental sequences are shown in Table 2; five of the entries were identified with 100% sequence coverage: neurotoxin BmK‐II (P59360), beta‐insect depressant toxin BotIT4 (P55903), beta‐insect depressant toxin BaIT2 (P80962), insect toxin LqhIT5 (P81240), and insect toxin BsIT4 (P82814). These toxins were reported for the first time in this Moroccan venom, they corresponded to toxins already identified in other scorpion venom. The determined sequence of the neurotoxin BmK‐II (P59360) showed 100% similarity with the database sequence, whereas the observed sequences of the other toxins showed methylation in the N‐terminal part compared with sequences reported in Uniprot database (Fig. 4). Therefore, the other peptides corresponded approximately to toxins, previously identified in other scorpion species with a sequence identity ranging from 17% to 98% (Fig. S2).
Table 2

List of the identified peptides by top‐down analysis of the reduced/alkylated B. occitanus venom filtrate. Data sets generated from the mass spectrometer were analyzed by the proteome discover 2.2 software, against UniProtKB/Swiss‐Prot database. The amino acids sequences colored in black were those detected by the analysis. Peptide entries in bold were identified by both top‐down and bottom‐up approaches.

CategoryAccessionDescriptionIdentified SequenceCoverage (%)Measured MW (Da)No. of peptidesNo. of PSMsNo. of unique peptidesNo. of protein groupsNo. of AAscalc.pI
NaScTxP59356Alpha‐like toxin Lqh6

MVRDGYIAQPENCVYHCIPDCDTLCKDNGGTGGHCGFKLGHGIACWCNALPDNVGIIV

DGVKCHK

98.466974.211411656.48
P13488 Alpha‐like toxin Bom3

MGRDGYIAQPENCVYHCFPGSSGCDTLCKEKGATSGHCGFLPGSGVACWCDNLPNK

VPIVVGGEKCH

98.57012.141111676.71
P56678Alpha‐like toxin Lqh3

MVRDGYIAQPENCVYHCFPGSSGCDTLCKEKGGTSGHCGFKVGHGLACWCNALPDNV

GIIVEGEKCHS

98.527215.3112211686.48
Q9NJC4Chain (toxin BmKaTx17) [10–73] in toxin BmKaTx17

MLLMTGVESGRDAYIAKNYNCVYHCFRDDYCNGLCTENGADSGYCYLAGKYGNACWC

INLPDDKPIRIPGKCHRR

847062.131111757.58
Q4TUA4Chain (alpha‐toxin 4) [20–85] in alpha‐toxin 4

MNYLVFFSLALLLMTGVESVRDGYIADDKNCAYFCGRNAYCDDECKKKGAESGYCQWA

GVYGNACWCYKLPDKVPIRVPGRCNGG

77.647218.311100857.5
P59863 Beta‐toxin BotIT2

MDGYIKGYKGCKITCVINDDYCDTECKAEGGTYGYCWKWGLACWCEDLPDEKRWKSE

TNTC

98.366564.781111614.84
P60163Toxin Cg2

MKDGYLVNKSTGCKYSCIENINDSHCNEECISSIRKGSYGYCYKFYCYCIGMPDSTQVYP

IPGKTCSTE

88.46871.921111696.92
P60256Toxin Boma6b

MVRDAYIAQNYNCVYDCARDAYCNELCTKNGAKSGHCEWFGPHGDACWCIDLPNNVPI

KVEGKCHRK

98.57307.231411677.2
O77091 Chain(beta‐insect excitatory toxin BmK IT‐AP) [19–90] in beta‐insect excitatory toxin BmK IT‐AP

MKFFLIFLVIFPIMGVLGKKNGYAVDSSGKVAECLFNNYCNNECTKVYYADKGYCCLLKC

YCFGLADDKPVLDIWDSTKNYCDVQIIDLS

807943.532621905.36
P21150 Toxin AaHIT4

MEHGYLLNKYTGCKVWCVINNEECGYLCNKRRGGYYGYCYFWKLACYCQGARKSELW

NYKTNKCDL

98.487791.581611668.46
P80962Beta‐insect depressant toxin BaIT2

MDGYIRRRDGCKVSCLFGNEGCDKECKAYGGSYGYCWTWGLACWCEGLPDDKTWKS

ETNTCG

1006845.91411625.31
P01485 Alpha‐mammal toxin Bot3; chain (alpha‐mammal toxin Bot3) [10–73] in alpha‐mammal toxin Bot3

MLVMAGVESVKDGYIVDDRNCTYFCGRNAYCNEECTKLKGESGYCQWASPYGNACYC

YKVPDHVRTKGPGRCN

87.677289.181511737.53
Q86BW9Chain (Makatoxin‐2) [20–83] in Makatoxin‐2

MNYLIVISFALLLMTSVESGRDAYIADSENCTYFCGSNPYCNDLCTENGAKSGYCQWAG

RYGNACWCIDLPDKVPIRIPGPCRGR

75.297062.111411855.25
G4V3T9Neurotoxin BmK AGAP‐SYPU2

MVKDGYIVDDKNCAYFCGRNAYCDDECEKNGAESGYCQWAGVYGNACWCYKLPDKV

PIRVPGRCNG

98.487289.181611665.31
P84614Alpha‐toxin Bs‐Tx28

MGVRDAYIADDKNCVYTCGSNSYCNTECTKNGAESGYCQWFGRWGNGCWCIKLPDKV

PIRIPGKCR

98.487214.21111668.12
Q9BLM4Toxin AahP1005; Chain (toxin AahP1005) [20–83] in toxin AahP1005

MNYLVMISLALLFMTGVESKKDGYIVDDKNCTFFCGRNAYCNDECKKKGAESGYCQWA

SPYGNACYCYKLPDRVSTKKKGGCNGR

75.297316.261311858.46
P86408Neurotoxin MeuNaTx‐1

MVRDGYIADDKNCAYFCGRNAYCDEECKKKGAESGYCQWAGQYGNACWCYKLPDK

VPIKVSGKCN

98.467218.311611657.85
P60255 Toxin Boma6a

MVRDAYIAQNYNCVYDCARDAYCNDLCTKNGAKSGYCEWFGPHGDACWCIDLPNNV

PIKVEGKCHRK

98.57221.1811211677.09
P15225Neurotoxin Os3

MGVRDGYIAQPHNCVYHCFPGSGGCDTLCKENGATQGSSCFILGRGTACWCKDLPDR

VGVIVDGEKCH

98.526957.152621686.71
P45697Alpha‐like toxin BmK‐M1; Chain (alpha‐like toxin BmK‐M1) [20–83] in alpha‐like toxin BmK‐M1

MNYLVMISFALLLMTGVESVRDAYIAKPHNCVYECARNEYCNDLCTKNGAKSGYCQWV

GKYGNGCWCIELPDNVPIRVPGKCHR

76.197429.41411847.88
E4VP24Chain [20–85] in sodium channel neurotoxin MeuNaTxalpha‐1

MNSLVMISLALLVMTGVESVRDGYIADDKNCAYFCGRNAYCDEECKKKGAESGYCQW

AGQYGNACWCYKLPDKVPIKVSGKCNGR

77.647336.321111857.85
P55902 Alpha‐insect toxin BotIT1

MVRDAYIAQNYNCVYFCMKDDYCNDLCTKNGASSGYCQWAGKYGNACWCYALPDNV

PIRIPGKCHS

98.487345.151211667.55
E7CAU3Chain (neurotoxin BmK AGP‐SYPU1) [2–65] in neurotoxin BmK AGP‐SYPU1

MGRDAYIAQNYNCVYHCFRDDYCNGLCTENGADSGYCYLAGKYGHACW

CINLPDDKPIRIPGKCHRR

98.57488.322821677.61
Q1I178Toxin Td9

MIGMVAECKDGYLVGDDGCKMHCFTRPGHYCASECSRVKGKDGYCYAW

LACYCYNMPNWAPIWNSATNSCGKGK

86.487076.011211747.84
A0A146CJ90Chain [20–87] in Venom toxin meuNa32

MNYLILISFALLVITGVESARDAYIAQNYNCVYFCLNPWSSYCDDLCTKNGAK

SGYCQIFGKYGNACWCIDLPDKVPIRIPGKCHFA

78.167690.371111877.53
P68410Alpha‐mammal toxin Ts2

MKEGYAMDHEGCKFSCFIRPAGFCDGYCKTHLKASSGYCAWPACYCYGV

PDHIKVWDYATNKC

98.416655.841611637.61
P68726Chain (Insect toxin 2–53) [22–82] in Insect toxin 2–53

MKLLLLLIVSASMLIESLVNADGYIKRRDGCKVACLVGNEGCDKECKAYGGSY

GYCWTWGLACWCEGLPDDKTWKSETNTCGGKK

71.766739.871111857.5
Q1I163Toxin Td8; chain (toxin Td8) [21–83] in toxin Td8

MTRFVLFLSCFFLIGMVVECKDGYLVGDDGCKMHCFTRPGHYCASECSRVK

GKDGYCYAWLACYCYNMPNWAPIWNSATNRCRGRK

73.256986.051311868.34
P56569Makatoxin‐1

MGRDAYIADSENCTYTCALNPYCNDLCTKNGAKSGYCQWAGRYGNACWCI

DLPDKVPIRISGSCR

98.467240.241200657.5
D8UWD3Sodium channel neurotoxin MeuNaTxalpha‐7

MARDGYIADDKNCAYFCGRNAYCDEECKKKGAESGYCQWAGQYGNACWC

YKLPDKVPIKVSGKCNGR

98.57295.241111678.1
P0DMH9Chain (alpha‐toxin BmalphaTx47) [20–83] in alpha‐toxin BmalphaTx47

MNYLIVISFALLLMTGVQSGRDAYIADSENCTYTCALNPYCNDLCTKNGAKSG

YCQWAGRYGNACWCIDLPDKVPIRISGSCRGR

75.297240.241100857.87
P01483Neurotoxin Bot2

MGRDAYIAQPENCVYECAKNSYCNDLCTKNGAKSGYCQWLGRWGNACYC

IDLPDKVPIRIEGKCHF

98.487240.2411911667.55
P17728 Chain (alpha‐insect toxin LqhaIT) [20–85] in alpha‐insect toxin LqhaIT

MNHLVMISLALLLLLGVESVRDAYIAKNYNCVYECFRDAYCNELCTKNGASS

GYCQWAGKYGNACWCYALPDNVPIRVPGKCHRK

77.647173.211211858.12
P01496Chain (toxin‐3) [15–76] in toxin‐5

MLVVVCLLTAGTEGKKDGYPVEYDNCAYICWNYDNAYCDKLCKDKKADSGY

CYWVHILCYCYGLPDSEPTKTNGKCKSGKK

76.547105.0312011817.49
Q1EG64Chain [20–85] in sodium toxin peptide BmKTb'

MNYLVMISFAFLLMTGVESARDAYIAQNYNCVYHCARDAYCNELCTKNGAKS

GSCPYLGEHKFACYCKDLPDNVPIRVPGKCNGG

77.647321.091211857.58
P01488 alpha‐toxin Bot1

MGRDAYIAQPENCVYECAQNSYCNDLCTKNGATSGYCQWLGKYGNACWC

KDLPDNVPIRIPGKCHF

98.487074.141311666.92
P45698 Chain (neurotoxin BmK‐M9) [15–78] in neurotoxin BmK‐M9

MISFALLLMTGVESVRDAYIAKPENCVYHCATNEGCNKLCTDNGAESGYCQW

GGRYGNACWCIKLPDRVPIRVPGKCHR

81.017015.191111797.88
P83644 Toxin Lqh4

MGVRDAYIADDKNCVYTCGANSYCNTECTKNGAESGYCQWFGKYGNACWC

IKLPDKVPIRIPGKCR

98.487155.251311668.1
P01487Alpha‐insect toxin Lqq3

MVRDAYIAKNYNCVYECFRDSYCNDLCTKNGASSGYCQWAGKYGNACWC

YALPDNVPIRVPGKCH

98.56980.0121221657.87
H1ZZI7Toxin Tpa6

MSIFPIALALLLIGLEEGEAARDGYPLSKNNNCKIYCPDTDVCKDTCKNRASAP

DGKCDGWNSCYCFKVPDHIPVWGDPGTKPCMT

74.417059.121211865.38
B8XGY6Chain [20–85] in Putative alpha‐toxin Tx17

MNYLILISLAVLLTSGVESVRDAYIAQNYNCVYTCFKDAYCNDLCTKNGATSGY

CQWVGKYGNGCWCYALPDNVPIRVPGKCHSR

77.647313.21211857.87
P81504 Insect toxin AaHIT5

MDGYIKRHDGCKVTCLINDNYCDTECKREGGSYGYCYSVGFACWCEGLPDD

KAWKSETNTCD

98.386894.891811624.83
P68722Chain (beta‐insect excitatory toxin LqhIT1b) [19–88] in beta‐insect excitatory toxin LqhIT1b

MKFFLLFLVVLPIMGVLGKKNGYAVDSKGKAPECFLSNYCNNECTKVHYADK

GYCCLLSCYCFGLNDDKKVLEISDTTKKYCDFTIIN

79.547924.561111887.87
P60257Toxin Boma6c

MVRDAYIAQNYNCVYTCFKDAHCNDLCTKNGASSGYCQWAGKYGNACWCY

ALPDNVPIRIPGKCHRK

98.57308.2121421678.31
M1J7U4Putative sodium channel alpha‐toxin Acra5

MVRDGYIMIKDTNCKFSCNIFKKWEYCSPLCQSKGAETGYCYNFGCWCLDL

PDDVPVYGDRGVICRTR

98.527741.511111687.5
Q9N682Chain (neurotoxin BmK‐M11) [20–83] in neurotoxin BmK‐M11

MNYLVMISFALLLMTGVESVRDAYIAKPENCVYHCATNEGCNKLCTDNGAESG

YCQWGGKYGNACWCIKLPDDVPIRVPGKCHR

77.387179.212221847.09
P55903beta‐insect depressant toxin BotIT4

MDGYIRRRDGCKVSCLFGNEGCDKECKAYGGSYGYCWTWGLACWCEGLPDD

KTWKSETNTCG

1006837.961411625.31
A0A0K0LBU9Chain [20–83] in sodium channel blocker AbNaTx26

MRAALLLAFSSLILTGVLTKKSGYPTQHDGCKIWCVFNHFCSNYCETYGGSGYCYT

WKLACWCDNIHDWVPTWSYATTKCRAK

77.17505.21111838.31
P0C910Alpha‐toxin Amm3

MGRDGYIVDTKNCVYHCYPPCDGLCKKNQAKSGSCGFLYPSGLACWCVALPENV

PIKDPNDDCHK

98.467011.141111657.09
P59360 Neurotoxin BmK‐II

VRDAYIAKPHNCVYECARNEYCNDLCTKDGAKSGYCQWVGKYGNGCWCIELPDNV

PIRIPGNCH

1007431.3321421657.09
P81240Insect toxin LqhIT5

MDGYIRGGDGCKVSCVIDHVFCDNECKAAGGSYGYCWGWGLACWCEGLPADREWK

YETNTCG

1006611,81311624.72
P01497Chain (beta‐insect excitatory toxin 1) [19–88] in beta‐insect excitatory toxin 1

MKFLLLFLVVLPIMGVFGKKNGYAVDSSGKAPECLLSNYCNNECTKVHYADKGYCCLL

SCYCFGLNDDKKVLEISDTRKSYCDTTIIN

79.547928.5411011887.53
V9P3B8Chain [23–82] in Chain [23–82] in Meutoxin‐3

MKILTVFMIFIANFLSMTQVFSLKDRFLLINGSYELCLYEENLDEDCERLCKEQNASDG

FCRQPHCFCADMPDDYPTRPTTR

73.177074.131111824.75
Q8T3T0Depressant insect toxin BmK ITa1

MKLFLLLLISASMLIDGLVNADGYIRGSNGCKVSCLWGNEGCNKECGAYGASYGYCW

TWGLACWCEGLPDDKTWKSESNTCGGKK

71.766632.7111811856.38
Q9GQW3Chain (toxin BmKaIT1) [20–83] in toxin BmKaIT1

MNYLVMISFAFLLMTGVESVRDAYIAQNYNCVYHCARDAYCNELCTKNGAKSGSCPY

LGEHKFACYCKDLPDNVPIRVPGKCHRR

75.297012.231311858.12
Q95WX6Beta‐insect depressant toxin BmKITb

MKLFLLLVISASMLIDGLVNADGYIRGSNGCKVSCLWGNEGCNKECKAFGAYYGYCW

TWGLACWCQGLPDDKTWKSESNTCGGKK

71.766775.931411857.85
P0C5H1Beta‐toxin Isom1

MKKNGYAVDSSGKAPECLLSNYCNNECTKVHYADKGYCCLLSCYCFGLSDDKKVLEIS

DTRKKYCDYTIIN

98.597895.4713511717.53
Q9GNG8Toxin BmKaTX15

MNYLVFFSLALLVMTGVESVRDGYIADDKNCAYFCGRNAYCDDECKKNGAESGYCQW

AGVYGNACWCYKLPDKVPIRVPGKCNGG

77.647211.141111856.4
M1JMR8 Sodium channel alpha‐toxin Acra8

MVRDGYIVDDKNCTFFCGRNAYCNDECKKKGGESGYCQWASPYGNACWCYKLPDRV

PIKEKGRCNGR

98.57218.31111678.29
A0A0U4RDS7Chain [20–87] in sodium channel toxin NaTx4

MNHLVMISLAFLFMTGVASVRDGYIAQPETCAYHCIPGSSGCYTLCKEKKGESGHCGWK

SGHGSAWWCNDLPDKEGIIVDGKGCTRR

78.167243.292421877.66
P82814Insect toxin BsIT4

MDGYIKGNKGCKVSCVINNVFCNSMCKSSGGSYGYCWSWGLACWCEGLPAAKKWLY

AATNTCG

1006954.151111638.31
B8XGX9Chain [20–87] in Putative alpha‐toxin Tx2

MNYLIMISLALLLMTGVESGTGVRDAYIADDKNCVYTCALNSYCNTECTKNGAESGYCQ

WLGQYGNACWCIKLPDRVPIRIPGKCRG

78.167394.281311877.5
Q17254 Alpha‐insect toxin Bot14

MSSLMISTAMKGKAPYRQVRDGYIAQPHNCAYHCLKISSGCDTLCKENGATSGHCGH

KSGHGSACWCKDLPDKVGIIVHGEKCHR

78.827184.31511858.5
KScTxA0A059UI30Chain (potassium channel toxin Meg‐beta‐KTx1) [28–91] in potassium channel toxin Meg‐beta‐KTx1

MQRNLVVLLFLGMVALSSCGLREKHFQKLVKYAVPEGTLRTIIQTAVHKLGKTQFGCPA

YQGYCDDHCQDIKKQEGFCHGFKCKCGIPMGF

70.326889.31911918.76
Q9N661 Potassium channel toxin BmTXK‐beta‐2

MQRNLVVLLFLGMVALSSCGLREKHFQKLVKYAVPEGTLRTIIQTAVHKLGKTQFGCP

AYQGYCDDHCQDIKKEEGFCHGFKCKCGIPMGF

25.272506.461111918.57
AMPA0A0A1I6E7AMP AcrAP1

MEIKYLLTVFLVLLIVSDHCQAFLFSLIPHAISGLISAFKGRRKRDLDGQIDRFRNFRKRD

AELEELLSKLPIY

24.321959.131111749.31
Myotropic neuropeptideF8THJ9Putative orcokinin

MMFGIWILCGTAFFFCHVDAYLEYSNMAPGYNALVRRRSMKQPSEGRMFDNLGYNQE

SLVKRNFDEIDNVGFNDFGPASRPGSGRSWFPKRNWELARYNLRRLVKRATQD

ELMENKRQELDEIDKSGFGGFHKRNFDEIDRSGFNDFGKRSFDRFKLVRRADFNN

16.963112.4511111659.29
Hypothetical secreted proteinF1CIZ9Hypothetical secreted protein

MQNIFWILIGVGICITAVQCDSEMESSIRDILTKRRYLKYARSVLDDLNNQLDTLHKRSC

VLNLPGMDCEYGDITGSGKDQDYWTSGRTPGKKRRSYCSLGIGNSEECLTKQLKDDM

TDFNSWNDKFRPGKK

25.753939.7911111327.99
Fig. 4

The detected amino acid sequences of the five toxins identified with 100% coverage by the top‐down LC‐MS/MS; neurotoxin BmK‐II (P59360); beta‐insect depressant toxin BaIT2 (P80962); insect toxin BsIT4 (P82814); insect toxin LqhIT5 (P81240); and beta‐insect depressant toxin BotIT4 (P55903).

List of the identified peptides by top‐down analysis of the reduced/alkylated B. occitanus venom filtrate. Data sets generated from the mass spectrometer were analyzed by the proteome discover 2.2 software, against UniProtKB/Swiss‐Prot database. The amino acids sequences colored in black were those detected by the analysis. Peptide entries in bold were identified by both top‐down and bottom‐up approaches. MVRDGYIAQPENCVYHCIPDCDTLCKDNGGTGGHCGFKLGHGIACWCNALPDNVGIIV DGVKCHK MGRDGYIAQPENCVYHCFPGSSGCDTLCKEKGATSGHCGFLPGSGVACWCDNLPNK VPIVVGGEKCH MVRDGYIAQPENCVYHCFPGSSGCDTLCKEKGGTSGHCGFKVGHGLACWCNALPDNV GIIVEGEKCHS MLLMTGVESGRDAYIAKNYNCVYHCFRDDYCNGLCTENGADSGYCYLAGKYGNACWC INLPDDKPIRIPGKCHRR MNYLVFFSLALLLMTGVESVRDGYIADDKNCAYFCGRNAYCDDECKKKGAESGYCQWA GVYGNACWCYKLPDKVPIRVPGRCNGG MDGYIKGYKGCKITCVINDDYCDTECKAEGGTYGYCWKWGLACWCEDLPDEKRWKSE TNTC MKDGYLVNKSTGCKYSCIENINDSHCNEECISSIRKGSYGYCYKFYCYCIGMPDSTQVYP IPGKTCSTE MVRDAYIAQNYNCVYDCARDAYCNELCTKNGAKSGHCEWFGPHGDACWCIDLPNNVPI KVEGKCHRK MKFFLIFLVIFPIMGVLGKKNGYAVDSSGKVAECLFNNYCNNECTKVYYADKGYCCLLKC YCFGLADDKPVLDIWDSTKNYCDVQIIDLS MEHGYLLNKYTGCKVWCVINNEECGYLCNKRRGGYYGYCYFWKLACYCQGARKSELW NYKTNKCDL MDGYIRRRDGCKVSCLFGNEGCDKECKAYGGSYGYCWTWGLACWCEGLPDDKTWKS ETNTCG MLVMAGVESVKDGYIVDDRNCTYFCGRNAYCNEECTKLKGESGYCQWASPYGNACYC YKVPDHVRTKGPGRCN MNYLIVISFALLLMTSVESGRDAYIADSENCTYFCGSNPYCNDLCTENGAKSGYCQWAG RYGNACWCIDLPDKVPIRIPGPCRGR MVKDGYIVDDKNCAYFCGRNAYCDDECEKNGAESGYCQWAGVYGNACWCYKLPDKV PIRVPGRCNG MGVRDAYIADDKNCVYTCGSNSYCNTECTKNGAESGYCQWFGRWGNGCWCIKLPDKV PIRIPGKCR MNYLVMISLALLFMTGVESKKDGYIVDDKNCTFFCGRNAYCNDECKKKGAESGYCQWA SPYGNACYCYKLPDRVSTKKKGGCNGR MVRDGYIADDKNCAYFCGRNAYCDEECKKKGAESGYCQWAGQYGNACWCYKLPDK VPIKVSGKCN MVRDAYIAQNYNCVYDCARDAYCNDLCTKNGAKSGYCEWFGPHGDACWCIDLPNNV PIKVEGKCHRK MGVRDGYIAQPHNCVYHCFPGSGGCDTLCKENGATQGSSCFILGRGTACWCKDLPDR VGVIVDGEKCH MNYLVMISFALLLMTGVESVRDAYIAKPHNCVYECARNEYCNDLCTKNGAKSGYCQWV GKYGNGCWCIELPDNVPIRVPGKCHR MNSLVMISLALLVMTGVESVRDGYIADDKNCAYFCGRNAYCDEECKKKGAESGYCQW AGQYGNACWCYKLPDKVPIKVSGKCNGR MVRDAYIAQNYNCVYFCMKDDYCNDLCTKNGASSGYCQWAGKYGNACWCYALPDNV PIRIPGKCHS MGRDAYIAQNYNCVYHCFRDDYCNGLCTENGADSGYCYLAGKYGHACW CINLPDDKPIRIPGKCHRR MIGMVAECKDGYLVGDDGCKMHCFTRPGHYCASECSRVKGKDGYCYAW LACYCYNMPNWAPIWNSATNSCGKGK MNYLILISFALLVITGVESARDAYIAQNYNCVYFCLNPWSSYCDDLCTKNGAK SGYCQIFGKYGNACWCIDLPDKVPIRIPGKCHFA MKEGYAMDHEGCKFSCFIRPAGFCDGYCKTHLKASSGYCAWPACYCYGV PDHIKVWDYATNKC MKLLLLLIVSASMLIESLVNADGYIKRRDGCKVACLVGNEGCDKECKAYGGSY GYCWTWGLACWCEGLPDDKTWKSETNTCGGKK MTRFVLFLSCFFLIGMVVECKDGYLVGDDGCKMHCFTRPGHYCASECSRVK GKDGYCYAWLACYCYNMPNWAPIWNSATNRCRGRK MGRDAYIADSENCTYTCALNPYCNDLCTKNGAKSGYCQWAGRYGNACWCI DLPDKVPIRISGSCR MARDGYIADDKNCAYFCGRNAYCDEECKKKGAESGYCQWAGQYGNACWC YKLPDKVPIKVSGKCNGR MNYLIVISFALLLMTGVQSGRDAYIADSENCTYTCALNPYCNDLCTKNGAKSG YCQWAGRYGNACWCIDLPDKVPIRISGSCRGR MGRDAYIAQPENCVYECAKNSYCNDLCTKNGAKSGYCQWLGRWGNACYC IDLPDKVPIRIEGKCHF MNHLVMISLALLLLLGVESVRDAYIAKNYNCVYECFRDAYCNELCTKNGASS GYCQWAGKYGNACWCYALPDNVPIRVPGKCHRK MLVVVCLLTAGTEGKKDGYPVEYDNCAYICWNYDNAYCDKLCKDKKADSGY CYWVHILCYCYGLPDSEPTKTNGKCKSGKK MNYLVMISFAFLLMTGVESARDAYIAQNYNCVYHCARDAYCNELCTKNGAKS GSCPYLGEHKFACYCKDLPDNVPIRVPGKCNGG MGRDAYIAQPENCVYECAQNSYCNDLCTKNGATSGYCQWLGKYGNACWC KDLPDNVPIRIPGKCHF MISFALLLMTGVESVRDAYIAKPENCVYHCATNEGCNKLCTDNGAESGYCQW GGRYGNACWCIKLPDRVPIRVPGKCHR MGVRDAYIADDKNCVYTCGANSYCNTECTKNGAESGYCQWFGKYGNACWC IKLPDKVPIRIPGKCR MVRDAYIAKNYNCVYECFRDSYCNDLCTKNGASSGYCQWAGKYGNACWC YALPDNVPIRVPGKCH MSIFPIALALLLIGLEEGEAARDGYPLSKNNNCKIYCPDTDVCKDTCKNRASAP DGKCDGWNSCYCFKVPDHIPVWGDPGTKPCMT MNYLILISLAVLLTSGVESVRDAYIAQNYNCVYTCFKDAYCNDLCTKNGATSGY CQWVGKYGNGCWCYALPDNVPIRVPGKCHSR MDGYIKRHDGCKVTCLINDNYCDTECKREGGSYGYCYSVGFACWCEGLPDD KAWKSETNTCD MKFFLLFLVVLPIMGVLGKKNGYAVDSKGKAPECFLSNYCNNECTKVHYADK GYCCLLSCYCFGLNDDKKVLEISDTTKKYCDFTIIN MVRDAYIAQNYNCVYTCFKDAHCNDLCTKNGASSGYCQWAGKYGNACWCY ALPDNVPIRIPGKCHRK MVRDGYIMIKDTNCKFSCNIFKKWEYCSPLCQSKGAETGYCYNFGCWCLDL PDDVPVYGDRGVICRTR MNYLVMISFALLLMTGVESVRDAYIAKPENCVYHCATNEGCNKLCTDNGAESG YCQWGGKYGNACWCIKLPDDVPIRVPGKCHR MDGYIRRRDGCKVSCLFGNEGCDKECKAYGGSYGYCWTWGLACWCEGLPDD KTWKSETNTCG MRAALLLAFSSLILTGVLTKKSGYPTQHDGCKIWCVFNHFCSNYCETYGGSGYCYT WKLACWCDNIHDWVPTWSYATTKCRAK MGRDGYIVDTKNCVYHCYPPCDGLCKKNQAKSGSCGFLYPSGLACWCVALPENV PIKDPNDDCHK VRDAYIAKPHNCVYECARNEYCNDLCTKDGAKSGYCQWVGKYGNGCWCIELPDNV PIRIPGNCH MDGYIRGGDGCKVSCVIDHVFCDNECKAAGGSYGYCWGWGLACWCEGLPADREWK YETNTCG MKFLLLFLVVLPIMGVFGKKNGYAVDSSGKAPECLLSNYCNNECTKVHYADKGYCCLL SCYCFGLNDDKKVLEISDTRKSYCDTTIIN MKILTVFMIFIANFLSMTQVFSLKDRFLLINGSYELCLYEENLDEDCERLCKEQNASDG FCRQPHCFCADMPDDYPTRPTTR MKLFLLLLISASMLIDGLVNADGYIRGSNGCKVSCLWGNEGCNKECGAYGASYGYCW TWGLACWCEGLPDDKTWKSESNTCGGKK MNYLVMISFAFLLMTGVESVRDAYIAQNYNCVYHCARDAYCNELCTKNGAKSGSCPY LGEHKFACYCKDLPDNVPIRVPGKCHRR MKLFLLLVISASMLIDGLVNADGYIRGSNGCKVSCLWGNEGCNKECKAFGAYYGYCW TWGLACWCQGLPDDKTWKSESNTCGGKK MKKNGYAVDSSGKAPECLLSNYCNNECTKVHYADKGYCCLLSCYCFGLSDDKKVLEIS DTRKKYCDYTIIN MNYLVFFSLALLVMTGVESVRDGYIADDKNCAYFCGRNAYCDDECKKNGAESGYCQW AGVYGNACWCYKLPDKVPIRVPGKCNGG MVRDGYIVDDKNCTFFCGRNAYCNDECKKKGGESGYCQWASPYGNACWCYKLPDRV PIKEKGRCNGR MNHLVMISLAFLFMTGVASVRDGYIAQPETCAYHCIPGSSGCYTLCKEKKGESGHCGWK SGHGSAWWCNDLPDKEGIIVDGKGCTRR MDGYIKGNKGCKVSCVINNVFCNSMCKSSGGSYGYCWSWGLACWCEGLPAAKKWLY AATNTCG MNYLIMISLALLLMTGVESGTGVRDAYIADDKNCVYTCALNSYCNTECTKNGAESGYCQ WLGQYGNACWCIKLPDRVPIRIPGKCRG MSSLMISTAMKGKAPYRQVRDGYIAQPHNCAYHCLKISSGCDTLCKENGATSGHCGH KSGHGSACWCKDLPDKVGIIVHGEKCHR MQRNLVVLLFLGMVALSSCGLREKHFQKLVKYAVPEGTLRTIIQTAVHKLGKTQFGCPA YQGYCDDHCQDIKKQEGFCHGFKCKCGIPMGF MQRNLVVLLFLGMVALSSCGLREKHFQKLVKYAVPEGTLRTIIQTAVHKLGKTQFGCP AYQGYCDDHCQDIKKEEGFCHGFKCKCGIPMGF MEIKYLLTVFLVLLIVSDHCQAFLFSLIPHAISGLISAFKGRRKRDLDGQIDRFRNFRKRD AELEELLSKLPIY MMFGIWILCGTAFFFCHVDAYLEYSNMAPGYNALVRRRSMKQPSEGRMFDNLGYNQE SLVKRNFDEIDNVGFNDFGPASRPGSGRSWFPKRNWELARYNLRRLVKRATQD ELMENKRQELDEIDKSGFGGFHKRNFDEIDRSGFNDFGKRSFDRFKLVRRADFNN MQNIFWILIGVGICITAVQCDSEMESSIRDILTKRRYLKYARSVLDDLNNQLDTLHKRSC VLNLPGMDCEYGDITGSGKDQDYWTSGRTPGKKRRSYCSLGIGNSEECLTKQLKDDM TDFNSWNDKFRPGKK The detected amino acid sequences of the five toxins identified with 100% coverage by the top‐down LC‐MS/MS; neurotoxin BmK‐II (P59360); beta‐insect depressant toxin BaIT2 (P80962); insect toxin BsIT4 (P82814); insect toxin LqhIT5 (P81240); and beta‐insect depressant toxin BotIT4 (P55903). Therefore, the detected peptides were divided into five categories on the basis of their molecular functions according to the UniProtKB database (https://www.uniprot.org); 63 neurotoxins acting on sodium channels (NaScTxs), constitute 93% of the components and represent a MW from 6564.78 to 7943.53 Da; two neurotoxins acting on potassium channels (KScTxs) (2.94%, 2506.46–6889.3 Da); one antimicrobial peptide (AMP) (1.47%, 1959.13 Da); one myotropic neuropeptide (1.47%, 3112.45 Da); and one hypothetical secreted protein (1.47%, 3939.79 Da) (Fig. 5A).
Fig. 5

(A) Relative abundance of the different peptide categories identified in reduced/alkylated B. occitanus venom filtrate by the top‐down LC‐MS/MS analysis. Peptides were divided on the basis of their molecular functions into: neurotoxins active on sodium channels (NaScTxs), neurotoxins active on potassium channels (KScTxs), myotropic neuropeptide, AMP, and hypothetical secreted protein. (B) Relative abundance of the different peptide categories identified in reduced/alkylated and digested B. occitanus venom by bottom‐up LC‐MS/MS analysis. The peptides were divided on the basis of their molecular functions into: neurotoxins active on sodium channels (NaScTxs), neurotoxins active on potassium channels (KScTxs), neurotoxins active on chloride channels (ClScTxs), neurotoxins active on calcium channels (CaScTx), toxin Acra, and amphipathic peptide.

(A) Relative abundance of the different peptide categories identified in reduced/alkylated B. occitanus venom filtrate by the top‐down LC‐MS/MS analysis. Peptides were divided on the basis of their molecular functions into: neurotoxins active on sodium channels (NaScTxs), neurotoxins active on potassium channels (KScTxs), myotropic neuropeptide, AMP, and hypothetical secreted protein. (B) Relative abundance of the different peptide categories identified in reduced/alkylated and digested B. occitanus venom by bottom‐up LC‐MS/MS analysis. The peptides were divided on the basis of their molecular functions into: neurotoxins active on sodium channels (NaScTxs), neurotoxins active on potassium channels (KScTxs), neurotoxins active on chloride channels (ClScTxs), neurotoxins active on calcium channels (CaScTx), toxin Acra, and amphipathic peptide. Additionally, we have observed, that between these 68 peptides, 27 of them (40%) were detected as chains or fragments, for example, venom toxin meuNa32 (A0A146CJ90); potassium channel toxin Meg‐beta‐KTx1 (A0A059UI30); putative alpha‐toxin Tx2 (B8XGX9); sodium channel toxin NaTx4 (A0A0U4RDS7); toxin BmKaIT1(Q9GQW3); sodium channel blocker AbNaTx26 (A0A0K0LBU9); neurotoxin BmK‐M11 (Q9N682); beta‐insect excitatory toxin LqhIT1b (P68722); toxin‐5 (P01496); toxin Td8 (Q1I163); alpha‐like toxin BmK‐M1 (P45697); toxin AahP1005 (Q9BLM4); makatoxin‐2 (Q86BW9); and alpha‐mammal toxin Bot3 (P01485) (Table 2). For the bottom‐up workflow, two digest methods were performed: (a) in‐solution digestion, the flow‐through containing toxin < 30 kDa was directly reduced with DTT, alkylated with IAA, and digested with trypsin; and (b) in‐gel digestion, the gel spot corresponding to peptides under 30 kDa (Fig. S1) was excised to small cubes, which after series of washings, were reduced, alkylated, and digested. The results generated by the bottom‐up approach using the in‐gel digestion yielded the identification of 36 peptides, whereas 37 was the total of the identified peptide by in‐solution digestion. The detected peptides showed similarity of sequences with peptides from other scorpion species, and with their sequence coverage ranging from 10.23% (P68721) to 86.15% (P01489) and from 8.75% (P0C294) to 92.86% (P80669) for the in‐gel and in‐solution digestions, respectively. The identified categories of peptides using the in‐gel digestion were as follows: 27 NaScTxs; seven KscTxs; and two ClTxs (Table 3). While, through the in‐solution digestion, we identified in addition to 24 NaScTxs, eight KScTxs and three ClScTxs, one entry that shares 60% of similarity with neurotoxin Tx‐2 (P83406) purified from Hottentotta judaicus, could correspond to a calcium channel activator ‘CaScTx’ scorpion. Besides neurotoxins, one amphipathic peptide was detected by this digestion method (Table 4).
Table 3

Bottom‐up data generated from in‐gel digestion of B. occitanus venom filtrate using nano‐LC‐MS/MS. Data sets generated from the mass spectrometer were analyzed by the proteome discover 2.2 software, against UniProtKB/Swiss‐Prot database.

CategoryAccessionDescriptionScoreCoverageNo. of proteinsNo. of unique peptidesNo. of peptidesNo. of PSMsNo. of AAsMW [kDa]calc. pI
NaScTx Q86SE0 Toxin Aam2 OS = Androctonus amoreuxi PE = 1 SV = 1 ‐ [SCX2_ANDAM]198.7424.42%9236869.37.87
P21150 Toxin AaHIT4 OS = Androctonus australis PE = 1 SV = 1 ‐ [SIX4_ANDAU]85.8129.23%2125657.88.46
P13488 Alpha‐like toxin Bom3 OS = Buthus occitanus mardochei PE = 1 SV = 1 ‐ [SCX3_BUTOM]169.7556.06%23315666.96.71
P68721 Beta‐insect excitatory toxin LqhIT1a OS = Leiurus quinquestriatus hebraeus PE = 3 SV = 1 ‐ [SIX1A_LEIQH]54.8110.23%2123889.98.09
P0DJH8 Alpha‐toxin Bu1 OS = Buthacus macrocentrus PE = 1 SV = 1 ‐ [SCX1_BUTMA]346.3271.64%1357677.58.48
P86406Neurotoxin MeuNaTx‐6 OS = Mesobuthus eupeus PE = 1 SV = 1 ‐ [SCXN6_MESEU]134.5615.15%3114667.87.87
P83644 Toxin Lqh4 OS = Leiurus quinquestriatus hebraeus PE = 1 SV = 1 ‐ [SCX4_LEIQH]305.5346.15%8137657.28.1
P01489 Alpha‐toxin Lqq4 OS = Leiurus quinquestriatus quinquestriatus PE = 1 SV = 1 ‐ [SCX4_LEIQU]531.9586.15%92511657.28.1
P01486 Alpha‐toxin Bot11 OS = Buthus occitanus tunetanus PE = 1 SV = 1 ‐ [SCXB_BUTOC]106.1935.38%7137657.57.87
P60255 Toxin Boma6a OS = Buthus occitanus mardochei PE = 3 SV = 1 ‐ [SCXA_BUTOM]65.8415.15%2112667.57.09
P17728 Alpha‐insect toxin LqhaIT OS = Leiurus quinquestriatus hebraeus PE = 1 SV = 2 ‐ [SCXA_LEIQH]174.2831.76%4123859.68.12
P04098 Neurotoxin 8 (Fragment) OS = Buthus occitanus tunetanus PE = 1 SV = 1 ‐ [SCX8_BUTOC]202.8372.22%2224364.16.24
P55902 Alpha‐insect toxin BotIT1 OS = Buthus occitanus tunetanus PE = 1 SV = 1 ‐ [SIX1_BUTOC]211.5941.54%2124657.37.55
P01488 Alpha‐toxin Bot1 OS = Buthus occitanus tunetanus PE = 1 SV = 2 ‐ [SCX1_BUTOC]136.5220.00%1112657.36.92
P81504 Insect toxin AaHIT5 OS = Androctonus australis PE = 1 SV = 1 ‐ [SIX5_ANDAU]51.4424.59%1112616.94.83
P59863 Beta‐toxin BotIT2 OS = Buthus occitanus tunetanus PE = 1 SV = 1 ‐ [SIX2_BUTOC]109.6643.33%1223606.94.84
Q17254 Alpha‐insect toxin Bot14 OS = Buthus occitanus tunetanus PE = 2 SV = 1 ‐ [SCXE_BUTOC]44.9918.82%1113859.28.5
D5HR52Alpha‐toxin Ac3 (Fragment) OS = Androctonus crassicauda PE = 3 SV = 1 ‐ [SCX3A_ANDCR]139.8663.77%102410697.87.87
P55904Beta‐insect depressant toxin BotIT5 OS = Buthus occitanus tunetanus PE = 1 SV = 1 ‐ [SIX5_BUTOC]64.6727.87%21229616.85.31
O77091 Beta‐insect excitatory toxin BmK IT‐AP OS = Mesobuthus martensii GN = IT‐AP PE = 1 SV = 1 ‐ [SIXP_MESMA]126.2617.78%82249010.25.36
P59864 Beta‐insect depressant toxin BotIT6 OS = Buthus occitanus tunetanus PE = 1 SV = 1 ‐ [SIX6_BUTOC]32.3711.29%1111627.38.1
P68723Beta‐insect excitatory toxin LqhIT1c OS = Leiurus quinquestriatus hebraeus PE = 1 SV = 1 ‐ [SIX1C_LEIQH]182.3111.36%1238889.98.1
P59360 Neurotoxin BmK‐II OS = Mesobuthus martensii PE = 1 SV = 1 ‐ [SCX2_MESMA]48.2615.63%3111647.27.09
P15224Toxin Os1 OS = Orthochirus scrobiculosus PE = 1 SV = 1 ‐ [SCX1_ORTSC]39.1419.70%1111667.67.88
D5HR50Alpha‐toxin Ac1 (Fragment) OS = Androctonus crassicauda PE = 2 SV = 1 ‐ [SCX1A_ANDCR]37.6611.11%2112818.77.55
M1JMR8Sodium channel alpha‐toxin Acra8 OS = Androctonus crassicauda PE = 3 SV = 1 ‐ [SCX8_ANDCR]66.8240.91%3235667.58.29
M1JBC0Sodium channel alpha‐toxin Acra4 OS = Androctonus crassicauda PE = 1 SV = 1 ‐ [SCX4_ANDCR]37.3929.23%1124657.18.31
KScTxP0C161Potassium channel toxin alpha‐KTx 2.8 OS = Centruroides elegans PE = 1 SV = 1 ‐ [KAX28_CENEL]45.5717.95%2111394.38.94
Q9NJC6 Potassium channel toxin BmTXK‐beta OS = Mesobuthus martensii PE = 2 SV = 1 ‐ [KBX2_MESMA]264.7823.33%21269010.48.82
P59869Potassium channel toxin alpha‐KTx 5.4 OS = Mesobuthus tamulus PE = 1 SV = 1 ‐ [KAX54_MESTA]40.722.58%2122313.58.02
B8XH40Potassium channel toxin BuTXK‐beta OS = Buthus occitanus israelis PE = 2 SV = 1 ‐ [KBX1_BUTOS]298.6542.86%225189110.28.57
Q9N661 Potassium channel toxin BmTXK‐beta‐2 OS = Mesobuthus martensii PE = 2 SV = 1 ‐ [KBX1_MESMA]230.6242.86%214139110.28.57
B3EWX9Potassium channel toxin alpha‐KTx 9.11 OS = Mesobuthus gibbosus PE = 1 SV = 1 ‐ [KAX9B_MESGB]85.3340.74%4112272.95.01
B8XH42Potassium channel toxin alpha‐KTx 16.6 OS = Buthus occitanus israelis PE = 2 SV = 1 ‐ [KA166_BUTOS]23.2512.07%1111586.58.12
ClScTx P45639 Chlorotoxin OS = Leiurus quinquestriatus quinquestriatus PE = 1 SV = 1 ‐ [CTXL_LEIQU]41.5638.89%11133648.13
P86436 Chlorotoxin‐like peptide OS = Androctonus australis PE = 1 SV = 1 ‐ [CTXL_ANDAU]290.944.12%1117343.68.34

Underlined peptide entries were identified by in‐gel and in‐solution digestion methods.

Table 4

Bottom‐up data generated from in‐solution digestion of B. occitanus venom filtrate using nano‐LC‐MS/MS. Data sets generated from the mass spectrometer were analyzed by the proteome discover 2.2 software, against UniProtKB/Swiss‐Prot database.

CategoryAccessionDescriptionScoreCoverageNo. of proteinsNo. of unique peptidesNo. of peptidesNo. of PSMsNo. of AAsMW (kDa)calc. pI
NaScTxs Q86SE0 Toxin Aam2 OS = Androctonus amoreuxi PE = 1 SV = 1 ‐ [SCX2_ANDAM]250.6824.42%82415869.37.87
P21150 Toxin AaHIT4 OS = Androctonus australis PE = 1 SV = 1 ‐ [SIX4_ANDAU]192.2330.77%22312657.88.46
P01482Alpha‐toxin Amm5 OS = Androctonus mauretanicus mauretanicus PE = 1 SV = 1 ‐ [SCX5_ANDMA]96.5728.13%1112647.37.5
P01481Alpha‐mammal toxin Lqq5 OS = Leiurus quinquestriatus quinquestriatus PE = 1 SV = 1 ‐ [SCX5_LEIQU]77.7125.00%2124647.38.1
P13488 Alpha‐like toxin Bom3 OS = Buthus occitanus mardochei PE = 1 SV = 1 ‐ [SCX3_BUTOM]155.659.09%22418666.96.71
P45698Neurotoxin BmK‐M9 OS = Mesobuthus martensii PE = 1 SV = 1 ‐ [SCX9_MESMA]124.5426.58%111315798.87.88
P68721 Beta‐insect excitatory toxin LqhIT1a OS = Leiurus quinquestriatus hebraeus PE = 3 SV = 1 ‐ [SIX1A_LEIQH]55.4810.23%2124889.98.09
P0DJH8 Alpha‐toxin Bu1 OS = Buthacus macrocentrus PE = 1 SV = 1 ‐ [SCX1_BUTMA]272.8471.64%1249677.58.48
P83644 Toxin Lqh4 OS = Leiurus quinquestriatus hebraeus PE = 1 SV = 1 ‐ [SCX4_LEIQH]293.246.15%71310657.28.1
P01489 Alpha‐toxin Lqq4 OS = Leiurus quinquestriatus quinquestriatus PE = 1 SV = 1 ‐ [SCX4_LEIQU]569.990.77%82618657.28.1
P01486 Alpha‐toxin Bot11 OS = Buthus occitanus tunetanus PE = 1 SV = 1 ‐ [SCXB_BUTOC]76.6335.38%2137657.57.87
P60255 Toxin Boma6a OS = Buthus occitanus mardochei PE = 3 SV = 1 ‐ [SCXA_BUTOM]46.0115.15%2112667.57.09
P17728 Alpha‐insect toxin LqhaIT OS = Leiurus quinquestriatus hebraeus PE = 1 SV = 2 ‐ [SCXA_LEIQH]369.6151.76%42513859.68.12
P04098 Neurotoxin 8 (Fragment) OS = Buthus occitanus tunetanus PE = 1 SV = 1 ‐ [SCX8_BUTOC]536.3477.78%23313364.16.24
P55902 Alpha‐insect toxin BotIT1 OS = Buthus occitanus tunetanus PE = 1 SV = 1 ‐ [SIX1_BUTOC]296.3561.54%1139657.37.55
P01488 Alpha‐toxin Bot1 OS = Buthus occitanus tunetanus PE = 1 SV = 2 ‐ [SCX1_BUTOC]185.3520.00%1113657.36.92
P81504 Insect toxin AaHIT5 OS = Androctonus australis PE = 1 SV = 1 ‐ [SIX5_ANDAU]49.4224.59%1112616.94.83
P01485Alpha‐mammal toxin Bot3 (Fragment) OS = Buthus occitanus tunetanus PE = 1 SV = 2 ‐ [SCX3_BUTOC]436.1761.11%32561728.17.53
P59863 Beta‐toxin BotIT2 OS = Buthus occitanus tunetanus PE = 1 SV = 1 ‐ [SIX2_BUTOC]164.4141.67%1224606.94.84
Q17254 Alpha‐insect toxin Bot14 OS = Buthus occitanus tunetanus PE = 2 SV = 1 ‐ [SCXE_BUTOC]91.7818.82%1119859.28.5
O77091 Beta‐insect excitatory toxin BmK IT‐AP OS = Mesobuthus martensii GN = IT‐AP PE = 1 SV = 1 ‐ [SIXP_MESMA]50.9317.78%81259010.25.36
P59864 Beta‐insect depressant toxin BotIT6 OS = Buthus occitanus tunetanus PE = 1 SV = 1 ‐ [SIX6_BUTOC]78.8353.23%1237627.38.1
P0C294Toxin Acra I‐3 OS = Androctonus crassicauda PE = 2 SV = 1 ‐ [TX13_ANDCR]43.588.75%1111808.88.25
P59360 Neurotoxin BmK‐II OS = Mesobuthus martensii PE = 1 SV = 1 ‐ [SCX2_MESMA]62.8815.63%3112647.27.09
KScTxsP0CC12Potassium channel toxin alpha‐KTx 8.5 OS = Odontobuthus doriae PE = 1 SV = 1 ‐ [KAX85_ODODO]86.2548.28%2112293.25.1
P83407Potassium channel toxin alpha‐KTx 19.1 OS = Mesobuthus martensii PE = 1 SV = 1 ‐ [KA191_MESMA]82.2232.26%1115313.38.73
Q95NJ8Potassium channel toxin alpha‐KTx 17.1 OS = Mesobuthus martensii PE = 1 SV = 1 ‐ [KA171_MESMA]79.7916.36%1115556.28
P80669Potassium channel toxin alpha‐KTx 9.3 OS = Leiurus quinquestriatus hebraeus PE = 1 SV = 1 ‐ [KAX93_LEIQH]211.7892.86%32292836.98
Q9NJC6 Potassium channel toxin BmTXK‐beta OS = Mesobuthus martensii PE = 2 SV = 1 ‐ [KBX2_MESMA]135.0527.78%22239010.48.82
Q9N661 Potassium channel toxin BmTXK‐beta‐2 OS = Mesobuthus martensii PE = 2 SV = 1 ‐ [KBX1_MESMA]96.942.86%32479110.28.57
P86399Neurotoxin lamda‐MeuTx OS = Mesobuthus eupeus PE = 1 SV = 2 ‐ [TXL_MESEU]26225.00%2118647.27.12
P80670Toxin GaTx2 OS = Leiurus quinquestriatus hebraeus PE = 1 SV = 1 ‐ [KAX83_LEIQH]86.0248.28%2112293.25.1
CaScTxsP83406Neurotoxin Tx‐2 OS = Buthotus judaicus PE = 1 SV = 1 ‐ [SCBT2_BUTJU]287.6360.71%12210282.94.89
ClScTxs P86436 Chlorotoxin‐like peptide OS = Androctonus australis PE = 1 SV = 1 ‐ [CTXL_ANDAU]993.1867.65%13365343.68.34
P45639 Chlorotoxin OS = Leiurus quinquestriatus quinquestriatus PE = 1 SV = 1 ‐ [SCXL_LEIQU]588.3838.89%122383648.13
P01498Neurotoxin P2 OS = Androctonus mauretanicus mauretanicus PE = 1 SV = 1 ‐ [SCXP_ANDMA]188.4871.43%1225353.77.88
Amphipathic peptideB8XH50Amphipathic peptide Tx348 OS = Buthus occitanus israelis PE = 2 SV = 1 ‐ [NDB5R_BUTOS]87.2719.40%4111677.89.19

Underlined peptide entries were identified by in‐gel and in‐solution digestion methods.

Bottom‐up data generated from in‐gel digestion of B. occitanus venom filtrate using nano‐LC‐MS/MS. Data sets generated from the mass spectrometer were analyzed by the proteome discover 2.2 software, against UniProtKB/Swiss‐Prot database. Underlined peptide entries were identified by in‐gel and in‐solution digestion methods. Bottom‐up data generated from in‐solution digestion of B. occitanus venom filtrate using nano‐LC‐MS/MS. Data sets generated from the mass spectrometer were analyzed by the proteome discover 2.2 software, against UniProtKB/Swiss‐Prot database. Underlined peptide entries were identified by in‐gel and in‐solution digestion methods. According to the results, 23 of the entries were detected by both digestion methods (Tables 3 and 4). Thus, 14 peptides were identified only by the in‐solution digestion method, for example, alpha‐toxin Amm5 (P01482), alpha‐mammal toxin Bot3 (P01485), potassium channel toxin alpha‐KTx 9.3 (P80669), neurotoxin Tx‐2 (P83406), neurotoxin P2 (P01498), and amphipathic peptide Tx348 (B8XH50). Otherwise, regarding the in‐gel digestion results, 13 peptides were identified only by this method of digestion, for example, potassium channel toxin alpha‐KTx 9.11 (B3EWX9); sodium channel alpha‐toxin Acra4 (M1JBC0); sodium channel alpha‐toxin Acra8 (M1JMR8), alpha‐toxin Ac3 (fragment) (D5HR52); and beta‐insect depressant toxin BotIT5 (P55904). Since the aim of using two methods of digestions was to identify the maximum of peptide, the data generated by bottom‐up approaches using in‐gel and in‐solution digestions were then summarized in Table 5; the repeated molecules were deleted and thus allowed the detection of a total of 50 peptides, which were divided into different categories according to their molecular functions. The generated data from the bottom‐up process confirmed that the family with the most diverse members in this venom is neurotoxins, with 31 NaScTxs (62%, 4.3–10.2 kDa), 13 KScTxs (26%, 2.9–10.4 kDa), three ClScTxs (6%, 3.6–4 kDa), one CaScTx (2%, 2.9 kDa), and one toxin Acra (2%, 8.8 kDa).
Table 5

List of the 50 peptides detected by the bottom‐up analysis of the reduced/alkylated B. occitanus venom filtrate. Data sets generated from the mass spectrometer were analyzed by the proteome discover 2.2 software, against UniProtKB/Swiss‐Prot database.

CategoryAccessionDescriptionMW (kDa)SpeciesDigestion method
NaScTxP86406Neurotoxin MeuNaTx‐67.8 Mesobuthus eupeus In‐gel digestion
P59863 Beta‐toxin BotIT26.9 Buthus occitanus tunetanus Both
D5HR52Alpha‐toxin Ac3 (Fragment)7.8 Androctonus crassicauda In‐gel digestion
P55904Beta‐insect depressant toxin BotIT56.8 Buthus occitanus tunetanus In‐gel digestion
O77091 Beta‐insect excitatory toxin BmK IT‐AP10.2 Mesobuthus martensii Both
P68723Beta‐insect excitatory toxin LqhIT1c9.9 Leiurus quinquestriatus hebraeus In‐gel digestion
P59360 Neurotoxin BmK‐II7.2 Mesobuthus martensii Both
P15224Toxin Os17.6 Orthochirus scrobiculosus In‐gel digestion
D5HR50Alpha‐toxin Ac1 (Fragment)8.7 Androctonus crassicauda In‐gel digestion
M1JMR8 Sodium channel alpha‐toxin Acra87.5 Androctonus crassicauda Both
M1JBC0Sodium channel alpha‐toxin Acra47.1 Androctonus crassicauda In‐gel digestion
Q86SE0Toxin Aam29.3 Androctonus amoreuxi Both
P21150 Toxin AaHIT47.8 Androctonus australis Both
P01482Alpha‐toxin Amm57.3 Androctonus mauretanicus mauretanicus In‐solution digestion
P01481Alpha‐mammal toxin Lqq57.3 Leiurus quinquestriatus quinquestriatus In‐solution digestion
P13488 Alpha‐like toxin Bom36.9 Buthus occitanus mardochei Both
P45698 Neurotoxin BmK‐M98.8 Mesobuthus martensii In‐solution digestion
P68721Beta‐insect excitatory toxin LqhIT1a9.9 Leiurus quinquestriatus hebraeus Both
P0DJH8Alpha‐toxin Bu17.5 Buthacus macrocentrus Both
P83644 Toxin Lqh47.2 Leiurus quinquestriatus hebraeus Both
P01489Alpha‐toxin Lqq47.2 Leiurus quinquestriatus quinquestriatus Both
P01486Alpha‐toxin Bot117.5 Buthus occitanus tunetanus In‐solution digestion
P60255 Toxin Boma6a7.5 Buthus occitanus mardochei Both
P17728 Alpha‐insect toxin LqhaIT9.6 Leiurus quinquestriatus hebraeus Both
P04098Neurotoxin 8 (Fragment)4.1 Buthus occitanus tunetanus Both
P55902 Alpha‐insect toxin BotIT17.3 Buthus occitanus tunetanus Both
P01488 Alpha‐toxin Bot17.3 Buthus occitanus tunetanus Both
P81504 Insect toxin AaHIT56.9 Androctonus australis Both
P01485 Alpha‐mammal toxin Bot3 (Fragment)8.1 Buthus occitanus tunetanus In‐solution digestion
P83406Neurotoxin Tx‐22.9 Buthotus judaicus In‐solution digestion
Q17254 Alpha‐insect toxin Bot149.2 Buthus occitanus tunetanus Both
P59864Beta‐insect depressant toxin BotIT67.3 Buthus occitanus tunetanus In‐solution digestion
P0C294Toxin Acra I‐38.8 Androctonus crassicauda In‐solution digestion
KScTxB3EWX9Potassium channel toxin alpha‐KTx 9.112.9 Mesobuthus gibbosus In‐gel digestion
P0C161Potassium channel toxin alpha‐KTx 2.84.3 Centruroides elegans In‐gel digestion
B8XH42Potassium channel toxin alpha‐KTx 16.66.5 Buthus occitanus israelis Both
P0CC12Potassium channel toxin alpha‐KTx 8.53.2 Odontobuthus doriae In‐solution digestion
P59869Potassium channel toxin alpha‐KTx 5.43.5 Mesobuthus tamulus In‐gel digestion
B8XH40Potassium channel toxin BuTXK‐beta10.2 Buthus occitanus israelis In‐gel digestion
Q95NJ8Potassium channel toxin alpha‐KTx 17.16.2 Odontobuthus doriae In‐solution digestion
P83407Potassium channel toxin alpha‐KTx 19.13.3 Mesobuthus martensii in‐solution digestion
P80669Potassium channel toxin alpha‐KTx 9.33 Leiurus quinquestriatus hebraeus In‐solution digestion
P86399Neurotoxin lamda‐MeuTx7.2 Mesobuthus eupeus In‐solution digestion
Q9NJC6Potassium channel toxin BmTXK‐beta10.4 Mesobuthus martensii Both
Q9N661 Potassium channel toxin BmTXK‐beta‐210.2 Mesobuthus martensii Both
ClScTxP01498Neurotoxin P23.7 Androctonus mauretanicus mauretanicus in‐solution digestion
P86436Chlorotoxin‐like peptide3.6 Androctonus australis Both
P45639Chlorotoxin4 Leiurus quinquestriatus quinquestriatus Both
P80670Toxin GaTx23.2 Leiurus quinquestriatus hebraeus In‐solution digestion
Amphipathic peptideB8XH50Amphipathic peptide Tx3487.8 Buthus occitanus israelis In‐solution digestion

Peptide entries in bold were identified by both top‐down and bottom‐up approaches.

In addition to these neurotoxins, we identified one amphipathic peptide (2%, 7.8 kDa) (Fig. 5B). Also, some peptides were detected as fragments (10% of total): alpha‐toxin Ac1 (D5HR50) and Ac3 (D5HR52); alpha‐mammal toxin Bot3 (P01485); and neurotoxin 8 (P04098). List of the 50 peptides detected by the bottom‐up analysis of the reduced/alkylated B. occitanus venom filtrate. Data sets generated from the mass spectrometer were analyzed by the proteome discover 2.2 software, against UniProtKB/Swiss‐Prot database. Peptide entries in bold were identified by both top‐down and bottom‐up approaches. As we mentioned above, we aimed to gain a deeper understanding of the B. occitanus peptidome (under 30 kDa), so the molecular diversity of its toxins. In this context, we combined data from the top‐down and bottom‐up analyses and then analyzed the generated data to infer a global and comprehensive characterization of this venom. According to this study, a total of 118 peptides were identified from B. occitanus venom; among them, 16 were identified by both approaches, for example, potassium channel toxin BmTXK‐beta‐2 (Q9N661); toxin AaHIT4 (P21150); and alpha‐mammal toxin Bot3 (Fragment) (P01485). Among the 102 identified peptides, the most representative category is neurotoxins, mainly NaScTxs (77%), followed by KScTxs (14%), ClScTxs (3%), CaScTx (1%), and toxin Acra (1%). We also characterized other peptides with low percentage such as AMPs (1%), amphipathic peptides (1%), hypothetical secreted proteins (1%), and myotropic neuropeptides (1%) (Fig. 6).
Fig. 6

Summary of the total peptides identified by top‐down and bottom‐up approaches. The 102 peptides were divided into neurotoxins, including NaScTxs, KScTxs, ClScTxs, CaScTx and toxin Acra, amphipathic peptide, myotropic neuropeptide, AMPs, and hypothetical secreted protein.

Summary of the total peptides identified by top‐down and bottom‐up approaches. The 102 peptides were divided into neurotoxins, including NaScTxs, KScTxs, ClScTxs, CaScTx and toxin Acra, amphipathic peptide, myotropic neuropeptide, AMPs, and hypothetical secreted protein. The majority of described peptides were identified for the first time in this Moroccan B. occitanus scorpion venom. The identified peptides showed sequence similarities with toxins previously detected from several genera of scorpions (Fig. 7), principally Mesobuthus sp (30%), Buthus Sp (20%), and Androctonus sp (18%).
Fig. 7

Percentage of B. occitanus peptides, which showed similarity of sequences with others from several scorpion genera.

Percentage of B. occitanus peptides, which showed similarity of sequences with others from several scorpion genera.

Discussion

Envenomation following scorpion stings constitutes one of the most encountered emergencies in large parts of the world, especially in North Africa, where the data show the highest incidence and lethality [1]. Morocco is a country known for a high risk of envenomation owing to its huge and diversified scorpion fauna. Among the different scorpion species living in this country, the yellow scorpion B. occitanus is one of the most dangerous species with venom responsible for severe cases of envenomation. Due to the limited knowledge about the composition and toxin arsenal of B. occitanus venom, we aimed in this study to elaborate the first exhaustive view of this scorpion venom peptidome and its molecular diversity, using mass spectrometry‐based top‐down and bottom‐up approaches. Top‐down data sets showed that the venom of B. occitanus is very complex, counting around 200 MWs ranging from 1868 to 16 720 Da. A similar number of components have been revealed by previous studies [32, 33, 34], others showed fewer components, as well as Leiurus abdullahbayrami (45 masses) and Opisthacanthus elatus (106 masses) [35, 36], whereas some other scorpion venoms were more complex, such as the Pandinus cavimanus (390 masses) and Centruroides limpidus (395 masses) [37, 38]. Additionally, the repartition of MWs showed that < 1% were components with molecular masses < 2000 Da, 14% were those from 2000 to 5000 Da, 74% were those between 5000 and 8000 Da, and 10% were those over than 8000 Da, while the repartition of MW from the French B. occitanus scorpion venom showed an abundance of molecules ranging from 2000 to 3000 Da and those less than 2000 Da [39]. Most importantly, the whole sequences of five toxins were identified with 100% sequence coverage using the top‐down approach. These neurotoxins were detected for the first time in this venom; they all belong to the NaScTxs category and shared high similarities of sequence with toxins identified from other scorpion species: neurotoxin BmK‐II (P59360), beta‐insect depressant toxin BotIT4 (P55903), beta‐insect depressant toxin BaIT2 (P80962), insect toxin LqhIT5 (P81240), and insect toxin BsIT4 (P82814). It is important to stress that the observed sequence of the P59360 entry with a MW of 7431.33 Da showed 100% similarity with the sequence of neurotoxin BmK‐II isolated from the Chinese scorpion Mesobuthus martensii, this neurotoxin is active in mammal and insect Nav channel [40]. In contrast, the detected sequence of the P81240 entry (6611.8 Da) showed the presence of methionine in the N‐terminal compared with the database sequence of the Insect toxin LqhIT5, an excitatory insect beta‐toxin from the Leiurus hebraeus scorpion [41]. Similar to the P82814 entry (6954.15 Da), in which the observed sequence corresponds 100% to the insect toxin BsIT4, a depressant insect beta‐toxins was isolated from Hottentotta tamulus sindicus [42]. Also, the observed sequence of the peptide corresponding to the depressant toxin BotIT4 (6837. 96 Da) presents methionine in N‐terminal compared with the database sequence. This toxin, identified for the first time from the Tunisian Buthus tunetanus [43], showed also 100% sequence identity with the P80962 entry (6845.9 Da), referred to the beta‐insect depressant toxin BaIT2 isolated from the Buthacus arenicola scorpion [44]. The high similarity of the amino acid sequence, in both detected depressant toxins and in the other peptides is commonly observed in scorpion toxins. Interestingly, the combined top‐down and bottom‐up data sets of B. occitanus venom provide the identification of 102 different peptides, whereas 147 proteins were characterized from the yellow Brazilian scorpion Tityus serrulatus, 60 of which were detected by the top‐down approach [45]. The major representative category of components identified in our venom was neurotoxins, mainly NaScTxs (77%), these neurotoxins are abundant in species from the Buthidae family [38, 46, 47] and less representative in scorpions from the non‐Buthidae family [33, 48, 49]. Those toxins are the ones responsible for envenomation symptoms [39]; their high content in the B. occitanus venom could explain the involvement of this scorpion in lethal cases of envenoming in the country. Between the entries corresponding to NaScTxs, there are alpha‐like toxins, this type of toxins had been already identified in several Buthus sp; yet, the alpha‐toxin Bot1 (P01488) has never been found in other Moroccan Buthus subspecies except from Buthus mardochei [39, 50, 51, 52, 53], but identified herein with a high sequence coverage (98.48% on top‐down data set). We should mention also that we identified for the first time, in this scorpion venom, peptides corresponding to atypical NaScTxs, as well as makatoxin‐1, fragment from makatoxin‐2, toxin Cg2, chain [20‐87] in venom toxin meuNa32, and AaHIT4 toxin (which could bind on receptor site 3 or 4 of sodium channel) [33]. Besides NaScTxs and KScTxs (14%), ClScTxs (3%) were identified, these categories of peptides showed activities against autoimmune disease and cancers, respectively [54, 55, 56, 57, 58]; also, we identified one entry that shared 60% of similarity with neurotoxin Tx‐2 (P83406), a calcium channel activator identified for the first time from the Buthotus judaicus, this category of toxins was identified in few scorpion species, for example, Parabuthus transvaalicus (Kurtoxin) and Parabuthus granulatus (Kurtoxin‐like I) but never been detected in a Moroccan scorpion venom [59, 60]. And last but not least, peptides referring to toxin Acra category have also been screened in B. occitanus venom, these toxins probably acting on ion channels. Some peptides with antibacterial activities were also found, for example, amphipathic peptide (B8XH50) and AMP AcrAP1 (A0A059UI30); this category was commonly present in scorpion venom due to its role in the protection of venom glands and its involvement in the neurotoxic effects [61, 62, 63, 64, 65]. Additionally, other components were identified with a low percentage, such as orcokinin, a myotropic neuropeptide identified from crustaceans, insects, and arachnids [17, 66], and hypothetical secreted proteins, which are proteins with unknown activities. Finally, we notice that some of the detected toxins were identified as fragments and chains, which may be due to the proteolysis of toxins. This process seems to be a usual PTM in scorpion and snake venoms, whereas its biological pertinence remains obscure [17, 45]. This study decrypted the peptidome arsenal of the Moroccan B. occitanus scorpion venom through proteomic view without the de novo sequence annotation. These findings constitute a step forward to a ‘deeper’ understanding of this scorpion venom; nevertheless, complete identification of this complex matrix is still a challenging task, especially with the lack of a specific database and/or a complete sequenced genome of this venom.

Conclusion

Herein; we reported the first proteomic study of the Moroccan B. occitanus scorpion peptidome, using mass spectrometry‐based top‐down and bottom‐up venomic approaches. The combination of these approaches allowed the identification of 102 components classified, with approximation, on different categories, mainly neurotoxins (96%), including NaScTxs (77%), KScTxs (14%), ClScTxs (3%), CaScTx (1%), and toxin Acra (1%). We also identified AMPs (1%), amphipathic peptides (1%), hypothetical secreted proteins (1%), and myotropic neuropeptides (1%). This study constitutes for sure a step forward to a deeper understanding of the B. occitanus venom; nevertheless, complete identification of this complex matrix is still a challenging task, especially with the lack of a specific database and a complete sequenced genome.

Conflict of interest

The authors declare no conflict of interest.

Author contributions

NO and JCR conceived the research. KD and CM performed experiments. KD and CM analyzed the data. KD interpreted data and wrote the manuscript. AL, BD, and SC participated in writing. JMS and RC reviewed the manuscript. NO designed the project, supervised the study, and reviewed the manuscript. All authors read and approved the final version for publication. Fig. S1. SDS/PAGE profile of the < 30 kDa filtrate of Buthus occitanus venom. Molecular weight markers (MM) are indicated in kDa. Proteins/Peptides were stained with Coomassie Brilliant Blue R (InstantBlue, Expedeon, CA, USA). Stained bands corresponding to proteins/peptides with massed < 30 kDa were manually excised into equal small cubes of 1 mm3 and subjected to a nanoLC‐MS/MS analysis. Click here for additional data file. Fig. S2. Detected amino acid sequences of the 68 peptides identified by Top‐down approach. Click here for additional data file.
  64 in total

1.  Purification, characterization, and primary structure of four depressant insect-selective neurotoxin analogs from scorpion (Buthus sindicus) venom.

Authors:  S A Ali; S Stoeva; J G Grossmann; A Abbasi; W Voelter
Journal:  Arch Biochem Biophys       Date:  2001-07-15       Impact factor: 4.013

2.  Moving pieces in a venomic puzzle: unveiling post-translationally modified toxins from Tityus serrulatus.

Authors:  Thiago Verano-Braga; Alexandre A A Dutra; Ileana R León; Marcella N Melo-Braga; Peter Roepstorff; Adriano M C Pimenta; Frank Kjeldsen
Journal:  J Proteome Res       Date:  2013-06-13       Impact factor: 4.466

3.  A depressant insect-selective toxin analog from the venom of the scorpion Leiurus quinquestriatus hebraeus--purification and structure/function characterization.

Authors:  H Moskowitz; R Herrmann; A D Jones; B D Hammock
Journal:  Eur J Biochem       Date:  1998-05-15

4.  Two neurotoxins (BmK I and BmK II) from the venom of the scorpion Buthus martensi Karsch: purification, amino acid sequences and assessment of specific activity.

Authors:  Y H Ji; P Mansuelle; S Terakawa; C Kopeyan; N Yanaihara; K Hsu; H Rochat
Journal:  Toxicon       Date:  1996-09       Impact factor: 3.033

5.  Purification and characterization of chlorotoxin, a chloride channel ligand from the venom of the scorpion.

Authors:  J A DeBin; J E Maggio; G R Strichartz
Journal:  Am J Physiol       Date:  1993-02

6.  Chlorotoxin, a scorpion-derived peptide, specifically binds to gliomas and tumors of neuroectodermal origin.

Authors:  Susan A Lyons; Jeffrey O'Neal; Harald Sontheimer
Journal:  Glia       Date:  2002-08       Impact factor: 7.452

7.  Top-down venomics of the East African green mamba, Dendroaspis angusticeps, and the black mamba, Dendroaspis polylepis, highlight the complexity of their toxin arsenals.

Authors:  Daniel Petras; Paul Heiss; Robert A Harrison; Roderich D Süssmuth; Juan J Calvete
Journal:  J Proteomics       Date:  2016-06-16       Impact factor: 4.044

8.  A multi-protease, multi-dissociation, bottom-up-to-top-down proteomic view of the Loxosceles intermedia venom.

Authors:  Dilza Trevisan-Silva; Aline V Bednaski; Juliana S G Fischer; Silvio S Veiga; Nuno Bandeira; Adrian Guthals; Fabricio K Marchini; Felipe V Leprevost; Valmir C Barbosa; Andrea Senff-Ribeiro; Paulo C Carvalho
Journal:  Sci Data       Date:  2017-07-11       Impact factor: 6.444

9.  Antimicrobial and Antibiofilm Effects of Peptides from Venom of Social Wasp and Scorpion on Multidrug-Resistant Acinetobacter baumannii.

Authors:  Rogério Coutinho das Neves; Márcia Renata Mortari; Elisabeth Ferroni Schwartz; André Kipnis; Ana Paula Junqueira-Kipnis
Journal:  Toxins (Basel)       Date:  2019-04-10       Impact factor: 4.546

Review 10.  It is time for top-down venomics.

Authors:  Rafael D Melani; Fabio C S Nogueira; Gilberto B Domont
Journal:  J Venom Anim Toxins Incl Trop Dis       Date:  2017-10-18
View more
  2 in total

1.  Anti-Cancer Effect of Moroccan Cobra Naja haje Venom and Its Fractions against Hepatocellular Carcinoma in 3D Cell Culture.

Authors:  Ayoub Lafnoune; Su-Yeon Lee; Jin-Yeong Heo; Imane Gourja; Bouchra Darkaoui; Zaineb Abdelkafi-Koubaa; Fatima Chgoury; Khadija Daoudi; Salma Chakir; Rachida Cadi; Khadija Mounaji; Najet Srairi-Abid; Naziha Marrakchi; David Shum; Haeng-Ran Seo; Naoual Oukkache
Journal:  Toxins (Basel)       Date:  2021-06-04       Impact factor: 4.546

2.  Toxinology in the proteomics era: a review on arachnid venom proteomics.

Authors:  Filipi Calbaizer Marchi; Edneia Mendes-Silva; Lucas Rodrigues-Ribeiro; Lucas Gabriel Bolais-Ramos; Thiago Verano-Braga
Journal:  J Venom Anim Toxins Incl Trop Dis       Date:  2022-02-28
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.