Literature DB >> 35868020

An integrated strategy for the comprehensive profiling of the chemical constituents of Aspongopus chinensis using UPLC-QTOF-MS combined with molecular networking.

Fengyu Zhang1,2, Bichen Li1,2, Ying Wen1,2, Yanyang Liu1,2, Rong Liu1,2, Jing Liu1,2, Shao Liu1,2, Yueping Jiang1,2.   

Abstract

CONTEXT: The extracts of Aspongopus chinensis Dallas (Pentatomidae), an insect used in traditional Chinese medicine, have a complex chemical composition and possess multiple pharmacological activities.
OBJECTIVE: This study comprehensively characterizes the chemical constituents of A. chinensis by an integrated targeted and untargeted strategy using UPLC-QTOF-MS combined with molecular networking.
MATERIALS AND METHODS: The ultra-performance liquid chromatography-tandem quadrupole time-of-flight mass spectrometry (UPLC-QTOF-MS) combined with molecular networking-based dereplication was proposed to facilitate the identification of the chemical constituents of aqueous and ethanol extracts of A. chinensis. The overall strategy was designed to avoid the inefficiency and costliness of traditional techniques. The targeted compounds discovered in the A. chinensis extracts were identified by searching a self-built database, including fragment ions, precursor ion mass, and other structural information. The untargeted compounds were identified by analyzing the relationship between different categories, fragmentation pathways, mass spectrometry data, and the structure of the same cluster of nodes within the molecular network. The untargeted strategy was verified using commercial standard samples under the same mass spectrometry conditions.
RESULTS: The proposed integrated targeted and untargeted strategy was successfully applied to the comprehensive profiling of the chemical constituents of aqueous and ethanol extracts of A. chinensis. A total of 124 compounds such as fatty acids, nucleosides, amino acids, and peptides, including 74 compounds that were reported for the first time, were identified in this study.
CONCLUSIONS: The integrated strategy using LC tandem HRMS combined with molecular networking could be popularised for the comprehensive profiling of chemical constituents of other traditional insect medicines.

Entities:  

Keywords:  Dereplication; cluster of nodes; mass spectrometry data; self-built database; targeted and untargeted strategy; traditional insect medicine

Mesh:

Substances:

Year:  2022        PMID: 35868020      PMCID: PMC9310793          DOI: 10.1080/13880209.2022.2096078

Source DB:  PubMed          Journal:  Pharm Biol        ISSN: 1388-0209            Impact factor:   3.889


Introduction

Aspongopus chinensis Dallas is an insect belonging to the Pentatomidae family (Hemiptera: Heteroptera: Pentatomidae); it has a shape resembling a turtle. It is widely distributed in China, especially in the provinces of Guizhou, Sichuan, Guangxi, and Yunnan (Tan et al. 2019). According to previous reports, the main chemical components in A. chinensis are fatty acids, proteins, amino acids, and other nutrients, as well as odour components, nucleosides, and dopamine compounds (Li et al. 2020). It is commonly used as a traditional medicine to relieve pain, warm the stomach, and treat nephropathy (Yan et al. 2019). Modern pharmacology shows that the extracts of A. chinensis possess strong protective effects including anticancer (Hou et al. 2012), antibacterial (Wu and Jin 2005), anti-inflammatory (Shi et al. 2014), antioxidant, anticoagulation (Xu 2019), antiulcer, and antifatigue activities (Li et al. 2020). The aqueous extract of A. chinensis improves the reproductive ability and protects against reproductive damage (He et al. 2016). A study showed that the fatty oil from the extract of A. chinensis has an antiulcer effect (Hou et al. 2012). Previous studies showed that trichloromethane extracts from A. chinensis as well as serum of this insect inhibit the proliferation of human gastric, colon, and breast cancer cells (Fan et al. 2011; Wei et al. 2019; Zhao et al. 2021). Moreover, the aqueous extracts from A. chinensis have a significant inhibitory effect on the proliferation, migration, and invasion of the human lung adenocarcinoma cell line A549 (Wu et al. 2020). However, in addition to these widely present components, unique small molecules in the insect are largely unknown. Therefore, in this work, a rapid method was used to examine the compounds included in the aqueous and ethanol extracts of A. chinensis and their differences were discussed. Our findings represent a further development in the use of A. chinensis extracts. The Global Natural Products Social Molecular Networking (GNPS, https://gnps.ucsd.edu/ProteoSAFe/static/gnps-splash.jsp) is an open-access online platform that generates automated molecular networking (MN) for analyzing mass spectrometry (MS/MS) datasets (Wang et al. 2016). MN is useful to speculate on the properties of multiple unannotated compounds in complex matrices by relating them to known compounds, especially structural analogues, based on the mass spectrum information and fragmentation pathway (Zhao et al. 2022). This platform shows strong integration and classification abilities of information obtained with MS, playing an important role in the rapid identification of compounds on a large scale and the discovery of novel framework compounds. Since molecular networking could identify the types of new compounds, experiments could be designed to isolate target compounds based on the characteristics of the identified type of compound. Thereby, the procedure for discovering new compounds can be simplified and the purpose of targeted separation can be achieved. Furthermore, the time necessary to discover new active compounds is greatly shortened using MN. At present, MN has been successfully used in the discovery of novel natural products, microbes, fungi as well as in marine life research (Chen et al. 2021; Rodrigues et al. 2022; Wang et al. 2022). The MN combined with UPLC-QTOF-MS is mostly used to analyze the metabolic components of fungi and one type of components in plants used in traditional Chinese medicines (TCMs) (Messaili et al. 2020; Santos et al. 2021; Wang et al. 2021), for example, dihydrochalcones in Star Fruit (Wang et al. 2021). UPLC-QTOF-MS is a powerful technique enabling the rapid characterization of the components in herbal medicines, resulting in an accurate mass determination and comprehensive MS data. This technique is widely used in the analysis of complex samples thanks to its high resolution and sensitivity (Huang et al. 2021). However, the processing of a large amount of data obtained from MS is time-consuming and complex. Several processing strategies have been applied to handle large amounts of data for a rapid elucidation of the structure of the non-targeted and complex constituents of TCMs including multifold characteristic ion filtering combined with statistical analysis (Jiang et al. 2019), key ion filtering, high-resolution neutral loss filtering, diagnostic ion filtering, high-resolution diagnostic product ions/neutral loss filtering, mass defect and fragment filtering, and pharmacophore filtering (Jiang et al. 2018). However, none of these strategies can rapidly, directly and comprehensively profile non-targeted constituents or metabolomics in non-targeted mass spectral data without the help of reference standards. Some strategies of processing data, such as combined diagnostic fragment ion filtering with reverse diagnostic fragment loss filtering and structural features guided ‘fishing’ can profile non-targeted constituents or metabolomics in non-targeted mass spectral data without the need for reference standards (Liu et al. 2019; Wang et al. 2020). However, while these strategies are accurate when analyzing a certain type of compound in TCMs, they tend to be time-consuming and unclear when multiple classes of components in TCMs are analyzed. At present, most mass spectra in the GNPS database are natural products of microorganisms and marine organisms, while the mass spectra of plant-source compounds are limited. Therefore, the database may not be effective for some natural products only derived from plants. However, the GNPS database was quite suitable for the identification of chemical constituents in traditional insect medicines, because the insects’ TCMs mainly contain these compounds such as fatty acids, amino acids nucleosides as well as dopamine compounds that were contained in the GNPS database. The chemical composition analysis of insect TCMs is few investigated. MN combined with UPLC-QTOF-MS is a powerful method for rapid analysis of the chemical constituents of medicines from insects in TCMs. To our knowledge, this is the first systematic and comprehensive method for analytical screening which allows the rapid analysis of the chemical constituents of medicines from insects in TCM. Therefore, in this study, a rapid and practical UPLC-QTOF-MS method was combined with a reliable and powerful data processing approach, molecular network and an in-house database. This method simultaneously recognized known and unknown chemical constituents of the traditional Chinese medicine from the insect A. chinensis in a short time. Our results demonstrated that this strategy could be suitable for the comprehensive profiling of complex components of TCMs without the help of reference standards.

Materials and methods

Reagents and materials

Acetonitrile (LC-MS grade, Cat. no. JA 105030) and methanol (HPLC grade, Cat. no. I1136107 107) were purchased from Merck (Darmstadt, Germany). LC-MS grade formic acid was purchased from Sigma-Aldrich (St. Louis, MO, USA, Cat. no. 197147). Distilled water was purchased from Watsons Distilled Water Company (Hong Kong, China, Cat. no.202107291357070E3). Methanol at 99.5% (Analytical reagent, Cat. no. 20211020801) and ethanol at 99.5% (Analytical reagent, Cat. no. 2020032602) were purchased from Chengdu Kelong Chemical Co., Ltd. (Chengdu, Sichuan, China). Aspongopus chinensis was purchased from the medicinal materials market of Bozhou City in 2020 (Bozhou City, Anhui, China) and authenticated by Prof. Shao Liu, Department of Pharmacy, Xiangya Hospital, Central South University. The voucher specimen of A. chinensis (ID 20200320) was deposited at the authors’ laboratory. Cyclo (Val-Val-) (Cat. no. S68110) at 95% was purchased from Acmec Technology Co. Ltd. (Shanghai, China); kynurenic acid (Cat. no. B50754, HPLC ≥ 98%) was purchased from Yuanye Technology Co. Ltd. (Shanghai, China). N-(2-Hydroxyethyl) adenosine (Cat. no. Lc0328024) at 98.39% was purchased from Leyan (Shanghai Haohong Biomedical Technology Co. Ltd., Shanghai, China). 1,2,3,4-tetrahydro-β-carboline-3-carboxylic acid (Cat. no. C12570529) at 97% was purchased from Macklin Technology Co. Ltd. (Shanghai, China).

Sample preparation

The processed product from A. chinensis (1 kg) was pulverized, and extracted with 4 L water two times, each time for 1 h (40 kHz, 210 W, 40 °C). Next, distilled water was filtered to obtain the aqueous extract and the medicine dregs. Then, the medicine dregs were extracted with 80% ethanol two times, each time for 1 h. The 80% ethanol extracts were filtered. The aqueous extracts and ethanol extracts were vacuum freeze-dried in a freeze dryer (Chaist, Alpha 1-2 LD plus, Germany) and quantified. Each of the extracts of A. chinensis powder (10 mg) was accurately weighed, 1 mL methanol was added, and then the mixture was shaken for 1 min at 25 °C. A test solution with a mass concentration of 10 mg/mL was obtained by filtration through a 0.22 µm membrane (Millipore, Merck Millipore Ltd., Germany).

UPLC-QTOF MS analysis

The chromatographic evaluation was performed using an Agilent 1290 series UPLC system (Agilent Corp., Santa Clara, CA, USA) equipped with a binary pump, micro degasser, autosampler, and temperature-controlled column compartment. The sample was separated on an Agilent C clipsePlusC18 (1.8 µm, 2.1 × 100 mm, Agilent Corp., Santa Clara, USA). The mobile phase was composed of two solvents (A and B). The mobile phase (A) was aqueous formic acid (0.1%, v/v), and the mobile phase (B) was acetonitrile. The following mobile phase gradient was selected: 0–2 min: 2% B; 2–5 min: 2–10% B; 5–20 min: 10–30% B; 20–23 min: 30–50% B; 23–25 min: 50–60% B; 25–30 min: 60% B. Aspongopus chinensis mainly contain alkaloids and amino acids which are easier to ionize in positive ion mode. Therefore, a positive mode was used for the ionization of A. chinensis components. The flow rate was 0.3 mL/min, the injection volume was 4 μL, and the column temperature was 30 °C. The separated components were passed through an Agilent 6545 A Q-TOF mass spectrometer (Agilent Corp., Santa Clara, CA, USA), equipped with an ESI interface. The operating parameters were as follows. The drying N2 gas flow rate was 8 L/min, the temperature was 325 °C, the nebulizer was at 35 Psi, the capillary was 4000 V, MS/MS data acquisition mode was Auto MS/MS, the max precursor per cycle was 5, and the acquisition time was 250 ms/spectrum. The samples were analyzed in positive ion mode, and mass spectra data were recorded across a range of 50–1700 m/z. The reference mass of 121.0509 (purine) and 922.0098 (HP-0921) were utilized for the internal mass calibration during runs in positive ion mode. Fixed collision energies of 10.00, 20.00, 40.00 V were chosen at a scan rate of 4.0 spectra/s using Auto MS/MS data acquisition with a medium MS/MS isolation width.

Molecular network design

The original data of MS/MS spectra were converted into an mzML format that contains all the information regarding the analysis. Then, these data were uploaded to the GNPS platform for analysis (Chen et al. 2021; Rodrigues et al. 2022; Wang et al. 2022). Finally, the MS/MS spectra were compared pairwise to search for spectral similarities, including the same fragment ions and/or neutral losses. The optimum parameters were the following, including parent mass and fragments: tolerance 0.02 Da; cosine score ≥0.7; matched peaks ≥6; network TopK 10; maximum connected component size 100; minimum cluster size 1, No run MSCluster. The results were downloaded and exported to be visualized on Cytoscape 3.8.2 software [https://cytoscape.org]. The identification of the compounds was supported by spectral libraries of the GNPS. The MS/MS spectra of A. chinensis compounds were compared to the MS/MS spectra of the compounds contained in the GNPS library platform, using the following parameters, including library search min matched peaks of 6; score threshold of 0.7; maximum analog search mass difference of 100.

Results and discussion

Frame of integrated targeted and untargeted strategy

The objective of this work was to target and identify, tentatively, components from A. chinensis extract. The total ion chromatogram obtained with MS is shown in Figure 1. This study proposed an integrated interpretation strategy illustrated in Figure 2 in order to reach the purpose of efficient organization and annotation of data. A comprehensive analysis based on UPLC-QTOF-MS and GNPS platform, easy to perform, was proposed for targeted screening of known compounds. MN was utilized for structural classification by non-targeted data organization based on MS/MS spectral similarity. Starting with the nodes of known compounds, the adjacent unreported compounds in the visualized networks were identified by detailed analysis of the MS/MS spectra, and online database searching.
Figure 1.

Total ion Chromatogram (TIC) of extract of Aspongopus chinensis in positive ion mode.

Figure 2.

Aspongopus chinensis compounds identification strategy.

Total ion Chromatogram (TIC) of extract of Aspongopus chinensis in positive ion mode. Aspongopus chinensis compounds identification strategy. The targeted identification strategy was performed as follows. First, Pubmed (https://pubmed.ncbi.nlm.nih.gov/), PubChem (https://pubchem.ncbi.nlm.nih.gov/), China National Knowledge Infrastructure (CNKI; https://www.cnki.net/) and other Chinese and international databases. The related data were researched in literature to build an information database by results from high-resolution chromatography and mass spectrometry, including m/z ratio, ultraviolet absorption characteristics, relative molecular mass, fragments ion mass spectrometry, and accurate [M + H]+. Second, the library matching function of the Agilent MassHunter Qualitative Analysis B08.00 software was used to match the [M + H]+ and the fragment ion with the data from the TCM and Metlin databases that are built into the software. Chemical formulas with a score of ≥90 were selected for the next step of identification. Third, the MS/MS data obtained from the aqueous extract/ethanol extract matched with the data in the information database established in steps (1) and (2) identified the compounds found in the A. chinensis extract. The untargeted identification strategy was as follows. The identification of unannotated points was based on the characterization of a similar mass spectrum of the same type of compound, and each MN was performed using the same type of compound. The classification of annotated points was analyzed in order to determine the type of network where they were located. Then, the unannotated peaks were processed according to the following steps. First, the element composition of unannotated points was automatically deduced by the formula predictor of the Qualitative Navigator B.08.00 software. The elements carbon (C), hydrogen (H), oxygen (O), nitrogen (N), phosphorus (P), and sulphur (S) were selected to calculate the elemental composition of the compounds, and only consistent chemical formulas were considered. The maximum MS mass tolerance was set at ≤5 ppm. Second, the molecular formula predicted by the molecular predictor was uploaded into the SciFinder database (https://origin-scifinder.cas.org/scifinder/) to search for potential matching compounds. If they were found, the compound category was set to further filter the potential structural formula. Then, the fracture modes of the annotated points were summarised in the same network. The chromatographic characteristics and the MS/MS data, as well as the bibliographic information, were utilized for the identification of compounds. Potential structural formulas were verified by fragment ions and fracture modes to determine the structural formula of the unknown compound. Finally, Cytoscape 3.8.2 was used to visualize and edit the entire MN. Verification was performed as follows. Four potentially active compounds deduced by MN were chosen as standard reference substances. Their primary/secondary MS/MS data were obtained under the corresponding mass spectrometry conditions. Then, the retention time of the substance used as standard reference and the second-level mass spectrometry fragments were compared to the deduced unannotated points to confirm whether it was the same compound as the unannotated points or not.

UPLC-QTOF MS analysis result

Targeted identification result

UPLC-QTOF-MS was utilized to evaluate components of A. chinensis extracts. The A. chinensis extracts could be analyzed within 30 min. The molecular formula was accurately assigned, within a mass error of 5 ppm. The fragment ions were used to further confirm the chemical structure. In total, 49 compounds were identified from the A. chinensis extracts on the basis of the MS2 fragmentation information, TCM database, Metlin database, and literature. These compounds included nucleosides and their analogues, dopamine derivatives, sesquiterpenes, alkaloids, and cyclic peptides. The total ion chromatogram of the fraction of A. chinensis extracts in positive ion mode is shown in Figure 1. The information on these 49 compounds, including the compound name, retention time, formula, precursor ions, and fragment ions are listed in Table 1.
Table 1.

Characterisation of constituents in aqueous/ethanol extract of Aspongopus chinensis Dallas by UPLC-Q-TOF-MS/MS and MS2 analysis in positive ion mode.

NumbertR (min)[M + H]+MS2FormulaIdentificationModeSource
10.757243.097560.0812/84.080/1104.1069/124.9913C10H14N2O5Thymidine[M + H]+Alcohol
20.828148.061056.0495/57.0336/84.0441/85.0501/102.0547C5H9NO4Glutamic acid[M + H]+Water/alcohol
30.839252.1084136.0378/136.0745C10H13N5ODeoxyadenosine[M + H]+Water/alcohol
41.238126.055053.0389 68.0131 108.0442C6H7NO26-(Hydroxymethy1) pyridin-3-ol[M + H]+Water
51.318123.055353.0390 78.0340C6H6N2ONicotinamide[M + H]+Water/alcohol
61.466259.092556.0501 84.0444 124.0764 170.0779C10H14N2O62’-O-Methyluridine[M + H]+Water
71.655153.040755.0294 69.0082 81.0091 93.0069 110.0348 136.0144C5H4N4O2Xanthine[M + H]+Water/alcohol
81.707113.034653.9384 70.0656 95.0314C4H4N2O2Uracil[M + H]+Water/alcohol
91.767245.076855.0550 70.0291 113.0347 140.1088 173.1052 214.1260C9H12N2O6Uridine[M + H]+Water/alcohol
101.89486.060061.0287 69.0351C4H7NO2-Pyrrolidinone[M + H]+Water
112.490268.103755.0290/57.0326/69.0339/87.0017/94.0391/109.0488/135.1923C10H13N5O4Adenosine[M + H]+Alcohol
122.746144.065555.0186 70.0662 104.0496C6H9NO3N-(2-Hydroxyethyl) succinimide[M + H]+Water/alcohol
133.632100.075755.0553 72.9383C5H9NOValerolactam[M + H]+Water
143.943166.086051.0233/65.0389/77.0392/91.0548/103.0547/120.0802C9H11NO2L-Phenylalanine[M + H]+Water/alcohol
154.018138.091253.0378/137.0781C8H11NO2-Ethyl-3-hydroxy-6-methylpyridine[M + H]+Water
165.256127.050254.0333/84.0438/110.0222/127.0502C5H6N2O2Thymine[M + H]+Water
176.793171.112855.0551 67.0550 90.9066 113.0350C8H14N2O2Cyclo(gly-L-leu)[M + H]+Water
187.071205.0973142.0650/130.0651/117.0572/103.0550/91.0543/77.0385C11H12N2O2Tryptophan[M + H]+Water/alcohol
197.337146.0600117.0566/90.0499/78.0420C9H7NO2-Quinolinol[M + H]+Water/alcohol
207.343146.060051.0238/65.0385/77.0386/91.0543/101.0358/118.0641/132.0776/146.0577C9H7NOIndole-3-aldehyde[M + H]+Water/alcohol
217.812162.055065.0386/89.0381/116.0488/149.1066C9H7NO2Indole-β-carboxylic acid[M + H]+Water/alcohol
227.812190.0499190.0489/144.0449C10H7NO3Transtorine[M + H]+Water/alcohol
237.815162.055451.0226/64.0187/77.0387/89.0389/116.0503/117.0519/144.0446C9H7NO2Indole-3-carboxylic acid[M + H]+Water/alcohol
247.990237.134655.0549/79.0537/106.0657/118.0653/134.0981/144.0817/163.0911/178.1226/206.1183C11H16N4O2Asponguanine B[M + H]+Water
258.364264.145569.0691/97.0648/134.0976/150.0780/169.0351/202.0975/247.1415C12H17N5O2Aspongadenine A[M + H]+Water
268.492331.128862.9825/84.0810/104.9935/153.0236/177.0507/199.0298/233.1527/268.1346C17H18N2O5Aspongamides C[M + H]+Water
278.513236.150655.0536/67.0274/91.0534/123.0911/147.1026/164.1044/186.1149C11H17N5OAspongadenine B[M + H]+Water/alcohol
288.999136.061655.0288/65.0137/67.0297/77.0410/92.0243/109.0478C5H5N5Adenine[M + H]+Water
299.291385.1393355.2017C20H20N2O6Aspongopusamide B[M + H]+Alcohol
3010.571251.113970.0659/86.0962/114.9779/137.0452/153.0388/180.1031/206.0983/233.1251C11H14N4O3Asponguanine A[M + H]+Water/alcohol
3110.607137.045755.0292/65.0378/67.0550/82.0409/83.0233/92.0236/94.0395/109.0502/119.0345120.0383/C5H4N4OHypoxanthine[M + H]+Water
3210.704172.096855.0179/84.9577/142.0607C8H13NO3Asponglactam A[M + H]+Water/alcohol
3311.330226.107457.0331/72.9373/105.0670/122.0279/156.0803/168.0794/184.1138/210.0652C11H15NO4N-[2-(3,4-Dihydroxyphenyl)-2-methoxyethyl]-acetamide[M + H]+Water
3411.755382.172157.0332/96.0800/135.0286/152.0554/189.0741/306.1020/232.1181C16H23N5O6Asponguanosines A[M + H]+Water/alcohol
3511.787382.1721/C16H23N5O6Asponguanosines B[M + H]+Water/alcohol
3612.291192.065553.0385/65.0378/77.0385/94.0659/104.0490/123.0428/146.0927/158.0979/176.1336C10H9NO3Aspongopusin[M + H]+Alcohol
3612.303192.0655同上C10H9NO3Aspongopusin[M + H]+Water
3713.176387.1542150.0561/193.0746/328.1190/269.0816/123.0435/137.0586/328.1190C20H22N2O6trans-2-(3′4′-Dihydroxyphenyl)-3-acetylamino-6-(N-acetyl-2′′-aminoethyl)-1,4-benzodioxane[M + H]+Water/alcohol
3813.201194.0788117.0380/140.9607/178.0494/194.1551C8H13NO3(±)-Asponglactam A[M + Na]+Alcohol
3913.985385.1393150.0561/193.0746/328.1190/269.0816/123.0435/137.0586/328.1190C20H20N2O6trans-2-(3′4′-Dihydroxyphenyl)-3-acetylamino-7-(N-acetyl-2-amino-ethylene)-1,4-benzodioxane[M + H]+Water/alcohol
4014.442/ 14.398189.1114190.15C9H16O4Aspongester A[M + H]+Water/alcohol
4114.851385.1393150.2973C20H20N2O6Aspongopusamide A[M + H]+Alcohol
4215.021387.1542150.0561/193.0746/328.1190/269.0816/123.0435/137.0586/328.1190C20H22N2O6trans-2-(3′4′-Dihydroxyphenyl)-3-acetylamino-7-(N-acetyl-2′′-aminoethyl)-1,4-benzodioxane[M + H]+Water/alcohol
4315.644385.1393150.0558/122.0605/94.0652C20H20N2O6trans-2-(3′4′-Dihydroxyphenyl)-3-acetylamino-6-(N-acetyl-2′′-aminoethylene)-1,4-benzodioxane[M + H]+Alcohol
4417.547578.213560.0438/89.0369/123.0444/150.0552/164.0689/181.0612/193.0720/206.0814/239.0705/269.0791/286.1037/312.0851/328.1181/345.1466/366.7094/401.1060/417.1175/443.1217/459.1297/521.1740C30H31N3O9(±)-Aspongamide A[M + H]+Alcohol
4521.796111.044155.9330 72.9369C6H6O21,2-Benzenediol[M + H]+Water
4624.680285.17064.0157/213.1140/285.1694/287.6971C15H24O5Aspongnoid C[M + H]+Water
4724.925283.152055.0188/139.0764/171.1024C18H34O2Oleic acid[M + H]+Water/alcohol
4827.107255.2323137.1306/69.0702/55.0546C16H30O2(Z)-Hexadec-11-enoic acidI[M + H]+Alcohol
4827.368255.2323137.1306/69.0702/55.0546C16H30O2(Z)-Hexadec-11-enoic acid II[M + H]+Water
4927.702281.248455.0546/69.0696/95.0858/133.0987/147.1141C18H32O2Linoleic acid[M + H]+Alcohol
4828.454255.2323137.1306/69.0702/55.0546C16H30O2(Z)-Hexadec-11-enoic acid III[M + H]+Alcohol
4829.509255.2323137.9628/69.0690/55.0550C16H30O2(Z)-Hexadec-11-enoic acid IV[M + H]+Alcohol
Characterisation of constituents in aqueous/ethanol extract of Aspongopus chinensis Dallas by UPLC-Q-TOF-MS/MS and MS2 analysis in positive ion mode.

Untargeted identification results

MN was formed by GNPS, which is able to identify analogues or derivatives of known compounds, based on the relationship between the nodes within the same network, and the relationship between the spectra of the nodes. The relationship among network nodes was visualized using Cytoscape 3.8.2. Thus, MN was used to discover complex crude extracts. The disadvantages of low efficiency, the high cost of using traditional techniques involved in the discovery of traditional compounds, and the repeated separation of known compounds could effectively be avoided. In this study, A total of 13 clusters of A. chinensis aqueous and ethanol extracts were analyzed and shown in Figure 3. 75 compounds were obtained by MN analysis, and among them, 74 were reported for the first time. The compound names, retention times, formula, precursor ions, and fragment ions are shown in Table 2.
Figure 3.

Thirteen analysable clusters obtained through molecular networks. Green represents fatty acid; orange represents nucleoside; red represents amino acid; yellow represents peptide; purple represent other type. The circles represent the annotated point; the squares represent unannotated point.

Table 2.

The constituents in aqueous/ethanol extract of Aspongopus chinensis Dallas annotated by UPLC-Q-TOF-MS/MS and molecular network.

NumberIdentificationFormula[M + H]+MS2IdentificationTypeCluster
50Palmitamide*C16H33NO256.263557.0703/71.0500/83.0879/88.0749/ 102.0935Water/alcoholFatty acidsMN1
51Tetradecyldiethanolamine*C18H39NO2302.305457.0700/70.0651/71.0855/88.0755/102.0906/106.0856/284.2933Water/alcoholFatty acidsMN1
52N-Lauryldiethanolamine*C16H35NO2274.274157.0710/70.0659/88.0760/106.0869/274.2710Water/alcoholFatty acidsMN1
532-[2-Hydroxyethyl(2-octoxyethyl)amino]ethanolC14H31NO3262.237757.0699/70.0638/88.0753/102.0903/176.1427/200.2001Water/alcoholFatty acidsMN1
54N,N-bis(2-Hydroxyethyl)tridecylamineC17H37NO2288.289757.0703/70.0654/88.0753/102.0916/106.0870/270.2827Water/alcoholFatty acidsMN1
552,2′-(Decylimino)bisethanolC14H31NO2246.242857.0699/70.0657/88.0756/106.0853/228.2307Water/alcoholFatty acidsMN1
56StearyldiethanolamineC22H47NO2358.368057.0708/70.0664/88.0766/106.0870/128.0656/340.3619Water/alcoholFatty acidsMN1
57N-Hexadecyl diethanolamineC20H43NO2330.336757.0705/88.0759/106.0868/312.3272Water/alcoholFatty acidsMN1
582,2′-(Octylazanediyl)diethanolC12H27NO2218.211557.0703/70.0652/88.0755/106.0861/200.2008Water/alcoholFatty acidsMN1
59Ethanol,2,2′-(heptadecylimino)bis-C21H45NO2344.352359.0318/70.0648/88.0750/102.0874/160.9885/267.2069/268.2992/287.1601/312.3217Water/alcoholFatty acidsMN1
60n-OleyldiethanolamineC22H45NO2356.352355.0546/69.0698/70.0652/83.0854/88.0759/106.0861/338.3512Water/alcoholFatty acidsMN1
6116-[bis(2-Hydroxyethyl)amino]hexadecane-1,15-diolC20H43NO4362.326557.0698/70.0650/88.0751/132.1014/256.2611/300.2884Water/alcoholFatty acidsMN1
621-[bis(2-Hydroxyethyl)amino]octadecan-2-olC22H47NO3374.362957.0698/70.0653/88.0758/102.0902/312.3245Water/alcoholFatty acidsMN1
631-[bis(2-Hydroxyethyl)amino]icosan-2-olC24H51NO3402.394257.0703/70.0657/88.0762/102.0918/340.3586Water/alcoholFatty acidsMN1
644-[bis(2-Hydroxyethyl)amino]-1-octoxybutan-2-olC16H35NO4306.263957.0706/70.0661/86.0970/132.1026/146.1172/244.2266Water/alcoholFatty acidsMN1
651-[bis(2-Hydroxyethoxy)amino]octadecan-2-olC22H47NO5406.352757.0697/70.0644/88.0750/300.2846/344.3121Water/alcoholFatty acidsMN1
661-[2-[bis(2-Hydroxyethyl)amino]ethoxy] hexadecan-2-olC22H47NO4390.357857.0701/70.0659/88.0761/132.1031/146.1177/276.1986/328.3216Water/alcoholFatty acidsMN1
671-[bis(2-Hydroxyethyl)amino]hexadecan-2-olC20H43NO3346.331657.0700/70.0655/88.0755/102.0907/284.2947Water/alcoholFatty acidsMN1
689-Octadecenamide*C18H35NO282.279157.0706/69.0709/83.0871/97.1022/111.1188/121.1017/135.1174/149.1365/240/247.2439/282.2825WaterFatty acidsMN2
69Linoleamide(9,12-octadecadienamide)C18H33NO280.263555.0540/57.0696/69.0697/83.0850/97.1005/135.``52/149.1296/247.2398WaterFatty acidsMN2
70Linolenic acidC18H30O2279.231957.0701/69.0698/83.0853/97.1006/149.1310/163.1474/247.2412WaterFatty acidsMN2
71Myristoyl Ethanolamide*C16H33NO2272.258457.0698/62.0598/67.0542/71.0852/95.0845AlcoholFatty acidsMN3
72Linoleoyl Ethanolamide*C20H37NO2324.289762.0608/67.0552/83.0474/95.0867AlcoholFatty acidsMN3
73Palmitoyl Ethanolamide*C18H37NO2300.289757.0700/62.0602/71.0857/283.2630AlcoholFatty acidsMN3
74Palmitoleoyl EthanolamideC18H35NO2298.274155.0550/62.0605/81.0701/281.2434AlcoholFatty acidsMN3
70Linolenic acid*C18H30O2279.231955.0545/67.0545/81.0699/95.0855/109.1012/123.1175/137.1299/149.0237AlcoholFatty acidsMN4
75Coriolic acidC18H32O3297.242455.0530/67.0546/81.0695/95.057/147.1158/279.2315AlcoholFatty acidsMN4
76Succinoadenosine*C14H17N5O8384.1150136.0606/162.0765/188.0560/192.0500/206.0669/234.0618/252.0725WaterNucleosidesMN5
77N-(2-Hydroxyethyl)adenosineC12H17N5O5312.1302136.0750/162.0753/180.0875/255.2666WaterNucleosidesMN5
786-[[9-[(2R,3R,4S,5R)-3,4-Dihydroxy-5-(hydroxymethyl)oxolan-2-yl]purin-6-yl]amino]hexanoic acidC16H23N5O6382.1721136.0538/152.0557/190.0729/206.1031/250.1279WaterNucleosidesMN5
79Isopentenyl-Adenine-7-glucoside or Isopentenyl-Adenine-9-glucosideC16H23N5O5366.1772136.0610/190.1083/234.1340WaterNucleosidesMN5
80N-Fructosyl phenylalanine*C15H21NO7328.139191.0561/120.0805/132.0805/166.0852/178.0855/264.1218/292.1168/310.1255WaterAmino acidMN6
81N-Fructosyl isoleucine*C12H23NO7294.154786.0966/212.1286/230.1387/258.1340/276.1449WaterAmino acidMN6
82Boc-Glu(OMe)-OmeC12H21NO6276.144286.0958/132.1012/144.1011/161.0663/212.1267/230.1370/258.1313WaterAmino acidMN6
83N-Fructosyl valineC11H21NO7280.139172.0816/84.0811/118.0868/130.0854/138.0523/150.0759/198.1108/216.1244/221.0196/244.1173/262.1264WaterAmino acidMN6
84beta-D-Glucopyranosyl-O-L-tyrosineC15H21NO8344.134097.0277/136.0761/165.0563/182.0821/194.0817/280.1192/308.1148/326.1252WaterAmino acidMN6
85N-(1-Deoxy-D-sorbopyranose-1-yl)-L-glutamic acidC11H19NO9310.113397.0296/130.0508/138.0550/148.0600/160.0597/180.0668/210.0740/226.0724/246.0976274.0929/292.1032WaterAmino acidMN6
86N-(1-Deoxy-D-fructosyl)-L-prolineC11H19NO7278.123472.0799/118.0852/128.0692/216.1216/232.1176/242.1014/244.1187/260.1122WaterAmino acidMN6
87O-MannopyranosylthreonineC10H19NO8282.118357.0327/102.0551/120.0642/132.0654/138.0556/144.1018/186.0755/200.0895/246.0968/264.1082WaterAmino acidMN6
882-[(1-Methylethyl)oxy]-8-(methyloxy)-1H-purin-6-amineC9H13N5O2224.114262.0593/74.0594/86.0596/97.0279/158.0797/188.0921/206.1014WaterAmino acidMN6
89Allyl 2-acetamido-2-deoxy-beta-D-glucopyranosideC11H19NO6262.128555.0163/57.0324/72.0798/84.0795/112.0370/118.0847/130.0854/143.0552/198.1075/216.1204/244.1141WaterAmino acidMN6
90L-Tyrosine*C9H11NO3182.081291.0549/119.0497/123.0447/136.0764/147.0444/165.077WaterAmino acidMN7
91N-(4-Carboxy-2-methylbutanoyl)-L-tyrosineC15H19NO6310.128557.0697/113.0595/139.0385/181.0511/204.1026/240.1330/247.0066/251.0915/264.1225/293.1020WaterAmino acidMN7
92Cyclo(L-Val-L-Pro)*C10H16N2O2197.128555.0543/70.0650/72.0803/98.0593/124.1111/154.0722/169.1306Water/alcoholPeptidesMN8
93Cyclo(L-Phe-L-Pro)*C14H16N2O2245.128555.0532/70.0656/98.0595/120.0802/154.0741Water/alcoholPeptidesMN8
94Cyclo(Pro-Leu)*C11H18N2O2211.144155.0535/70.0658/86.0964/98.0593/138.1273/194.1200Water/alcoholPeptidesMN8
95Cyclo(L-Val-L-Leu)*C11H20N2O2213.159855.0537/72.0806/86.0962/140.1440/168.1375/185.1640Water/alcoholPeptidesMN8
96Cyclo(Val-Val-)C10H18N2O2199.144155.0546/72.0804/84.9592/100.0757/126.1269/171.1481Water/alcoholPeptidesMN8
97Val-valC10H20N2O3217.154755.0552/72.0811/84.9598/114.9827Water/alcoholPeptidesMN8
98PyroGlu-Pro*C10H14N2O4227.102670.0653/84.0447/116.0703/181.0970/209.0887/227.1382WaterPeptidesMN9
99PyroGlu-Val*C10H16N2O4229.118357.0347/72.0814/84.0045/86.0966/116.0709/138.0922/183.1109WaterPeptidesMN9
100LeucylprolineC11H20N2O3229.154770.0656/86.0960/126.0898/140.0817/142.0868WaterPeptidesMN9
101Leu-ValC11H22N2O3231.170355.0553/72.0809/86.0964/126.1272/172.1321/186.1490WaterPeptidesMN9
102Boc-L-leucine/Boc-L-isoleucineC11H21NO4232.154355.0546/69.0707/72.0811/97.0653/126.1283/136.0631/172.1333/186.150WaterPeptidesMN9
103Ile-Gly-Ile*C14H27N3O4302.207486.0968/132.1021/143.1178/171.1125WaterPeptidesMN10
104Gamma-Glu-leuC11H20N2O5261.144586.0949WaterPeptidesMN10
105Leu-Gly-ValC13H25N3O4288.191855.0556/72.0811/86.0969/132.1009/189.1240/229.0979WaterPeptidesMN10
106(E,2R)-2-[(2-Acetyloxyacetyl)amino]-6-ethoxy-6-oxohex-4-enoic acidC12H17NO7288.107872.08170/86.0952/132.1025/143.1185/189.1211/229.0923WaterPeptidesMN10
107Asp-Leu*C10H18N2O5247.128869.07/86.0968/88.0386/132.0627/141.1004/201.1427WaterPeptidesMN11
108L-Asparaginyl-L-IsoleucineC10H19N3O4246.144855.0540/57.0335/69.0698/86.0954/110.0227/132.0981/141.1012/166.0851/201.1230/212.0925/229.1163WaterPeptidesMN11
109N(6)-(2-Carboxyethyl)lysineC9H18N2O4219.133957.0694/60.0444/84.9589/86.0959/132.1008/143.0017/161.0965/176.1043/173.1263WaterPeptidesMN11
110Kynurenic Acid*C10H7NO3190.049989.0387/116.0494/144.0440Water/alcoholOthersMN12
111Indole-3-carboxylic acidC9H7NO2162.055089.0380/116.0488/144.0438Water/alcoholOthersMN12
112Tetrahydroharman-3-carboxylic acid*C13H14N2O2231.112874.0224/130.0646/143.0726/158.0959/168.0796/188.0680/214.0830WaterOthersMN13
1131,2,3,4-Tetrahydro-beta-carboline-3-carboxylic acidC12H12N2O2217.0972144.0807/77.0385/74.0233143.0715WaterOthersMN13
114Undecaethylene glycol*C22H46O12503.306287.0438/89.0595/133.0848/177.1104Water/alcoholFatty acids 
115sn-Glycero-3-phosphocholine*C8H20NO6P258.110160.0811/86.0962/104.1067/184.0730WaterFatty acids 
1162-(3-Carboxypropanoylamino)-3-methylpentanoic acid*C10H17NO5232.117955.0188/69.0710/86.9565/132.0785WaterFatty acids 
1171-(9Z-Octadecenoyl)-sn-glycero-3-phosphocholine*C26H52NO7P522.355460.0811/86.0957/104.1066/184.0731/504.34AlcoholFatty acids 
1181-Hexadecanoyl-sn-glycero-3-phosphocholine*C24H50NO7P496.339860.0811/86.0957/104.1066/184.0731/478.3244AlcoholFatty acids 
1191-Stearoyl-2-hydroxy-sn-glycero-3-phosphoethanolamine*C23H48NO7P482.324157.0692,62.0509,227.2013,155.01, 310.3108,341.3047,464.3073AlcoholFatty acids 
1201-Palmitoyl-2-hydroxy-sn-glycero-3-phosphoethanolamine*C21H44NO7P454.292857.0347/62.0594/71.0861/85.1024/98.9848/155.0098/282.2801/313.2770AlcoholFatty acids 
121N,N-Diethyl-3-methylbenzamide*C12H17NO192.138391.0528/109.0720/119.0720AlcoholFatty acids 
1222-Methyl-4′-(methylthio)-2-morpholinopropiophenone*C15H21NO2S280.136670.0647/88.0389/150.0764AlcoholAlkaloids 
123Riboflavin*C17H20N4O6377.145669.0347/99.0440/172.0876/200.0824/243.0886/377.1502AlcoholAlkaloids 
124Xanthurenic acid*C10H7NO4206.044851.0237/77.0385/104.0490/132.0447/160.0397AlcoholAlkaloids 

*Means the points which are annotated by GNPS in molecular network.

Thirteen analysable clusters obtained through molecular networks. Green represents fatty acid; orange represents nucleoside; red represents amino acid; yellow represents peptide; purple represent other type. The circles represent the annotated point; the squares represent unannotated point. The constituents in aqueous/ethanol extract of Aspongopus chinensis Dallas annotated by UPLC-Q-TOF-MS/MS and molecular network. *Means the points which are annotated by GNPS in molecular network.

Analysis of the unannotated points of fatty acid clusters

The common characteristic of fatty acids is the successive loss of H2O (18 Da) in their MS2 spectra and the cleavage of the carbon chain. MN1 is the representative cluster of fatty acids. Thus, tetradecyl diethanolamine (51) was chosen to be the compound that could help investigate the MS2 fragmentation patterns of fatty acids (Figure 4). The [M + H]+ quasi-molecular ion of m/z 302 of tetradecyl diethanolamine (C18H39NO2) in the positive ion mode can easily be formed. The parent ion could be dehydrated (losing H2O) to produce the fragment ion of m/z 284. Then the carbon chain of m/z 284 ion can be broken to obtain a fragment at m/z 102. The quasi-molecular ion of [M + H]+ could be losing C14H30NO2 to obtain the fragment of m/z 57 or losing the carbon chain to obtain the fragment of m/z 106 directly. Then the latter ion could be dehydrated to obtain a fragment of m/z 88 or further dehydrate for the production of the fragment ions of m/z 70. The cleavage of the carbon chain within the fragment at m/z 302 easily formed the fragment at m/z 106, which was a characteristic fragment ion of this fatty acid cluster. Moreover, the fragment ions at m/z 88, m/z 70 and m/z 57 were obtained from the fragment ion at m/z 106, and they were also characteristic fragment ions. N-lauryl diethanolamine (52) and tetradecyl diethanolamine have the same fragmentation pathway. The carbon chains of both of them were broken, then continuously dehydrated, thereby producing fragment ions of the same mass. Their fragmentation pattern is shown in Figure 4. Other 15 fatty acids were identified by the fragmentation pattern of tetradecyl diethanolamine, and their chemical structures along with that of tetradecyl diethanolamine (51) are shown in Figure 4, with the associated information listed in Table 2.
Figure 4.

Analysis of fatty acid. Green circles are annotated points; grey squares are inferred unannotated point.

Analysis of fatty acid. Green circles are annotated points; grey squares are inferred unannotated point. The m/z 106/88/70/55 fragment ion peaks appeared in the compounds 53∼60. The [M + H]+ quasi-molecular ion peaks differed by an integer multiple of 14 Da, suggesting that they were consistent with the tetradecyl diethanolamine skeleton. The deduction of the unannotated point N-hexadecyl diethanolamine (57, [M + H]+ 330.3367) was utilized as an example. First, compound 57 was directly connected to tetradecyl diethanolamine in the cluster, and the [M + H]+ was 28 Da more than the quasi-molecular ion peak of compound 51. The basic skeleton was consistent with 51. The results of the formula predictor (ppm ≤ 5) led to two different molecular formulae: C20H43NO2 (ppm = 1.99) and C18H41N4O (ppm = −2.08). The above two molecular formulae were imported into Scifinder in order to search for potential compounds. Then, the type of the compounds was set and the Chemical Abstracts Service Registry Number of the potential compound was obtained. The structure of the potential compound was obtained from PubChem, which led to the discovery of N-hexadecyl diethanolamine. The self-build database, which included mass spectrum, chromatographic data, CNKI, MassBank (http://www.massbank.jp/RecordDisplay?id=PR300821), National Institute of Standards and Technology (NIST, https://www.nist.gov/) and other databases, as well as the literature, were used to search the N-hexadecyl diethanolamine. The comparison of the mass spectrometry data of compound 51, the mass splitting, structural formula and other information finally led to the discovery that the unannotated point 330.336 was the compound N-hexadecyl diethanolamine. The m/z 106/88/70/57 fragment ions were shown on compounds 61∼57. There was one more dehydration (H2O) fragment ion peak in their structures compared to the structure of tetradecyl diethanolamine, indicating that there were more hydroxyl (OH) molecules than in compounds 53∼60, as determined by Scifinder. Their structure can be determined by combining their quasi-molecular ion peak of [M + H]+ and other information. All 13 compounds were identified from the A. chinensis extracts for the first time (Table 2). The mass splitting of the annotated points in the three MNs (MN2, MN3, and MN4) had continuous methylene fragment ions, which was in line with the typical fatty acid fragmentation pattern. Through the structural formula and fracture pattern of these annotated points, the unannotated points can be deduced (Figure 4). These four compounds were identified from the A. chinensis extracts for the first time.

Analysis of the unannotated points of nucleoside clusters

Adenosine derivatives easily lose their glycosyl group (C5H9O4) to produce the characteristic ion [M‒C5H9O4+H]+. Succinoadenosine (76, C14H17N5O8) is a succinic acid derivative of adenosine. It was selected as the reference compound to study the MS2 fragmentation pathway of the nucleoside MN5. The [M + H]+ quasi-molecular ion of 76 was m/z 384 in the positive ion mode. The glycosyl group (C5H9O4) could be easily released to form the characteristic fragment ions [M‒C5H9O4+H]+ of m/z 252. Then, the fragment ions passed through a series of cracking, thereby forming the intermediate ions of m/z 188 and m/z 162. Finally, the succinic acid group (C4H6O4) was fully removed in order to form the fragment ion of m/z 136. The ion of [M‒C5H9O4+H]+ and [M‒C5H9O4‒C4H6O4+H]+ were two characteristic fragment ions of this cluster. Three more nucleoside compounds were annotated in MN5 (Figure 5) by the screening of the online databases, including GNPS, MassBank, METLIN, and ChemSpider, with related information listed in Table 2.
Figure 5.

Analysis of nucleosides. Orange circles are annotated points of nucleosides; grey squares are inferred unannotated points.

Analysis of nucleosides. Orange circles are annotated points of nucleosides; grey squares are inferred unannotated points.

Analysis of the unannotated points of amino acid clusters

The amino acid cluster was divided into MN6 and MN7. N-fructosyl phenylalanine (80) and N-fructosyl isoleucine (81) in MN6, and L-tyrosine (90) in MN7 were selected for analysis of the fragmentation pathway of amino acids (Figure 6). MN6 is an amino acid with glycosyl substitution. The neutral loss of H2O (18 Da) and the loss of glycosyl (C5H10O5) were the features of MN6. The [M + H]+ quasi-molecular ion m/z 328 of N-fructosyl phenylalanine could be formed easily in the positive ion mode. The quasi-molecular ion was easily and continuously dehydrated to produce [M‒H2O + H]+ of m/z 310 and [M‒2H2O + H]+ of m/z 292. Additionally, the glycosyl group (C5H10O5) was removed to form the characteristic fragment ion of m/z 178 or the C6H11O5 was removed to generate the m/z 166 ion. The quasi-molecular ion could also generate the [M + H‒2H2O‒CO2]+ fragment ion of m/z 264 firstly. Then, the dissociation of the C5H8O4 formed the ion of m/z 132. Likewise, the cleavage of the formic acid group, the continuous dehydration, and the loss of the glycosyl group from the N-fructosyl isoleucine was also observed. The structures of the other eight compounds in MN6 could be deduced on the basis of the pathway of these two compounds. N-(4-carboxy-2-methylbutanoyl)-l-tyrosine is a compound that is characterized by the replacement of one hydrogen on the amino group of tyrosine with 4-carboxy-2-methylbutanoyl. The substituent was easily removed to produce an ion at m/z 204 in the positive ion mode, and [M + H]+ quasi-molecular ion m/z 310. This was conformed by the fragmentation pattern of tyrosine. Thus, the unannotated point at m/z 310.128 was recognized as N-(4-carboxy-2-methylbutanoyl)-l-tyrosine. The chemical structures of the compounds in MN6 and MN7 are shown in Figure 6, and the relevant information is listed in Table 2.
Figure 6.

Analysis of amino acids. The red circle is the annotated point of nucleosides; the grey square is inferred unannotated point.

Analysis of amino acids. The red circle is the annotated point of nucleosides; the grey square is inferred unannotated point.

Analysis of the unannotated points of peptide clusters

Peptides were included in four clusters (MN8, MN9, MN10, and MN11). Cyclo (l-Phe-l-Pro) (43) in MN8 and PyroGlu-Val (99) in MN9 were selected as the reference compounds to investigate the MS2 fragmentation patterns of the peptide (Figure 7). The points in MN9 that exhibited the highest frequency of ions at m/z 55/57, m/z 70/72, m/z 84/86 were the elementary characteristic ions (Figure 7). The cyclic dipeptides could be easily open from the [M + H]+ quasi-molecular ion m/z 245 of the cyclo (l-Phe-l-Pro) in the positive ion mode to form the fragment of m/z 120 and m/z 98, which is one of the characteristic fragment ions of this MN. In addition, the carbonyl (CO) compound could dissociate from m/z 98 to generate the characteristic fragment ions of m/z 70. Finally, the opening of the tetrahydropyrrole could lead to the production of the characteristic fragment ions of m/z 55. Similar fragmentation pathways were observed on MN9, MN10 and MN11, including mainly deamination, decarboxylation, decarboxylation and peptide chain scission (i.e., PyroGlu-Val). Their chemical structure is shown in Figure 7, and the relevant information is listed in Table 2.
Figure 7.

Analysis of peptides. The yellow circle is the annotated point of peptides; the grey square is inferred unannotated point.

Analysis of peptides. The yellow circle is the annotated point of peptides; the grey square is inferred unannotated point.

Analysis of the unannotated points of other clusters

Other clusters were included in MN12 and MN13. Kynurenic acid (110) in MN12 and tetrahydroharman-3-carboxylic acid (112) in MN13 were selected as reference compounds to investigate the MS2 fragmentation patterns. In MN12, the carboxyl group (COOH) could be easily lost from the [M + H]+ quasi-molecular ion m/z 190 of kynurenic acid to form the fragment at m/z 144 in the positive mode. Then, the carbonyl group (CO) dissociated from the m/z 144 to form a fragment ion at m/z 116. Finally, the amino group was removed from the m/z 116 to generate the fragment ion of m/z 89. The fragment ions, m/z 144, m/z 116, and m/z 98, appeared in two other compounds belonging to MN12, which suggests that their skeletons are similar. Two this two unannotated components in the cluster were identified. Their chemical structure and that of kynurenic acid are shown in Figure 8, and the relevant information is listed in Table 2.
Figure 8.

Analysis of other categories. Purple circles are annotated points; grey squares are inferred unannotated points.

Analysis of other categories. Purple circles are annotated points; grey squares are inferred unannotated points. MN is an online method that is continuously updated and allows for the visualization and targeting of natural products, thereby enabling biological research and biotechnology applications in a wide range of fields. The generation of MN is based on the analysis of mass spectrometry of crude extracts. The MS/MS spectra of compounds are compared in pairs to find similarities in fragmentation pathways, that is, the same fragment ion or similar neutral loss. The nodes in MN represent spectra or compounds, and the edges between nodes represent the similarity between the two spectra. Compounds with similar fragmentation patterns are grouped into the same cluster. Additionally, those with different MS/MS spectra are not relevant (Wang et al. 2019). In other words, the two nodes connected by a line in MN represent two compounds that have similar structures and fracture patterns. The identification of certain compounds by direct use of standard compounds to generate MS/MS spectra, or the use of public spectral libraries (i.e., GNPS or referencing known documents) is sufficient to determine the molecular family of related compounds, thereby facilitating the identification of unknown compounds (Li et al. 2020). The MN of A. chinensis was generated using GNPS. The aqueous extract produced a total of 76 clusters, of which 11 were analyzable. Moreover, a total of 47 compounds matched with compounds within the database. The ethanol extract of A. chinensis produced 54 clusters, and seven of them were analyzable. A total of 34 compounds matched with compounds in the database. Since the five clusters obtained from the aqueous and ethanol extracts were exactly the same, 13 clusters of A. chinensis crude extract were visualized by Cytoscape 3.8.2 and shown in Figure 3. The annotated point was indicated by a circle, while the deduced unannotated point was indicated by a square. Each node represented a compound, and inside the node, the molecular mass of the parent ion, such as the molecular ion, fragment ion, or adduct mass of the compound, was indicated, linked by a line indicating the similarity among the compounds. Moreover, fatty acids were represented by a green node; nucleosides were represented by an orange node; amino acids were represented by a red node. Peptides and other clusters were represented by yellow and purple points, respectively. A total of 34 fatty acids, 6 nucleosides, 18 peptides, 12 amino acids and 4 other compounds were initially identified, among which 74 compounds were reported for the first time. The relevant information is listed in Table 2. Previous studies on A. chinensis have mainly focussed on nucleosides and their analogues, as well as the dopamine derivatives contained in its extracts. In this study, 124 compounds were found in the extracts of A. chinensis, and they were subdivided into five categories, which represented an informative supplement to fill the gap in previous research. Among the ingredients reported for the first time in this study, fatty acids, peptides, and amino acids were the majority. Nucleosides and amino acids were detected only in the aqueous extract. The aqueous extract contained a higher number of peptides as compared to the amount in the ethanol extract. The ethanol extract contained a higher content of fatty acids as compared to the amount in the aqueous extract. The difference in composition was related to the polarity of the compound itself. Nucleosides, amino acids and peptides mostly had polar carboxyl groups and glycosyl groups. Thus, they were mostly present in the aqueous extract, while the polarity of fatty acids was lower. Therefore, fatty acids were found in the ethanol extract. Previous studies have revealed that the haemolymph of A. chinensis has a high protein content and physiologically active peptides, which effectively inhibit the growth of gastric cancer and human breast cancer. In addition, a small peptide with antibacterial properties was purified from its haemolymph. The protective effect of A. chinensis insect extract on nephropathy is related to its rich content in dopamine, indole, and piperidine, while its antiulcer effect resides in the fatty oil extract (Hou et al. 2012). However, the biologically active substances in A. chinensis are largely unknown. Therefore, in this study, the compounds in the extracts of A. chinensis were subdivided into five categories. Our findings revealed a further development in the utilization of A. chinensis.

Validation of the molecular network result

N-(2-Hydroxyethyl) adenosine (28, adenosine), cyclo (Val-Val-) (47, peptide chain), kynurenic acid (61, other categories: quinoline acid) and 1,2,3,4-tetrahydro-β-carboline-3-carboxylic acid (64, other types: carboxylic acid) were the unannotated points deduced from the molecular network. Hence, these compounds were selected to verify whether the molecular network speculation was correct. These four compounds were used as reference substances and were purchased on the China Reagent website. They were dissolved in methanol to prepare a mixed control solution of 1.0 mg/mL. Then, the primary/secondary mass spectrum data were obtained according to the method described in the section ‘UPLC-QTOF MS analysis’. The results in Table 3 and Figure 9 demonstrate that the retention time and ion fragments of the reference substances matched with the predicted compound, which confirms the accuracy of our method, which used MN analogues to cluster compounds with similar fragmentation pathways.
Table 3.

The mass spectrum/chromatographic information of the four compounds in the standard reference substance and Aspongopus chinensis Dallas extract.

IdentificationFormula[M + H]+MS2tR (min)Source 
N-(2-Hydroxyethyl)adenosineC12H17N5O5312.1302180.0875/136.0750/162.0753/255.26665.850WaterInference
N-(2-Hydroxyethyl)adenosineC12H17N5O5312.1302180.0882/136.0623/162.07765.853WaterReference
Cyclo(Val-Val-)C10H18N2O2199.144155.0546/72.0804/84.9592/100.0757/ 126.1269/171.148110.686Water/alcoholInference
Cyclo(Val-Val-)C10H18N2O2199.144155.0538/72.0809/84.9597/100.0768/ 126.1274/171.149010.702Water/alcoholReference
Kynurenic AcidC10H7NO3190.049989.0387/116.0494/144.04407.815Water/alcoholInference
Kynurenic AcidC10H7NO3190.049989.0393/116.0500/144.04497.789Water/alcoholReference
1,2,3,4-Tetrahydro-beta-carboline-3-carboxylic acidC12H12N2O2217.0972144.0807/77.0385/74.0233/143.07158.567WaterInference
1,2,3,4-Tetrahydro-beta-carboline-3-carboxylic acidC12H12N2O2217.0972144.0813/77.0390/74.0252/143.07268.533WaterReference
Figure 9.

The extracted ion chromatogram (EIC) and MS2 spectrum of the four compounds in the standard reference substance and Aspongopus chinensis extract.

The extracted ion chromatogram (EIC) and MS2 spectrum of the four compounds in the standard reference substance and Aspongopus chinensis extract. The mass spectrum/chromatographic information of the four compounds in the standard reference substance and Aspongopus chinensis Dallas extract.

Conclusions

UPLC-QTOF-MS performed in full MS/dd MS2 mode represented a rapid and reliable method to determine the composition of A. chinensis aqueous and ethanol extracts. Furthermore, a practical, integrated, targeted and untargeted strategy of data processing was used, in combination with an in-built database with MN, enabling, for the first time, the rapid profiling of non-target constituents of TCMs from insects. The efficiency in the identification of unknown compounds was greatly improved by multiple database matching and fragmentation rules, which avoids the high cost and inefficiency of traditional techniques that are used to discover compounds in traditional Chinese medicine, being helpful for the discovery of new compounds. A method using the retention time of the substance used as standard reference and the mass spectrometry fragments to verify the accuracy of the unannotated compound was proposed, which avoids the complexity of the identification of first-report compounds. This study reported a total of 124 compounds in A. chinensis extracts, which provides a reference for the future development and use of A. chinensis. In short, this might be a promising study providing a convenient and powerful processing method of the data for the rapid profiling of non-target chemical constituents of insects used in TCMs. Due to the limitations of MN database, compounds in some clusters cannot be identified because there are no annotated compounds in these clusters. These identified compounds in this study need to be further isolated for activity screening in vitro or in vivo.
  19 in total

1.  Discovery of the bioactive peptides secreted by Bifidobacterium using integrated MCX coupled with LC-MS and feature-based molecular networking.

Authors:  Shengshuang Chen; Guoxin Huang; Weilin Liao; Shilin Gong; Jianbo Xiao; Jiao Bai; W L Wendy Hsiao; Na Li; Jian-Lin Wu
Journal:  Food Chem       Date:  2021-01-07       Impact factor: 7.514

2.  Systematic investigation of the pharmacological mechanism for renal protection by the leaves of Eucommia ulmoides Oliver using UPLC-Q-TOF/MS combined with network pharmacology analysis.

Authors:  Qi Huang; Fengyu Zhang; Shao Liu; Yueping Jiang; Dongsheng Ouyang
Journal:  Biomed Pharmacother       Date:  2021-05-19       Impact factor: 6.529

3.  Gamma linolenic acid suppresses hypoxia-induced proliferation and invasion of non-small cell lung cancer cells by inhibition of HIF1α.

Authors:  Yan Wang; Jian Shi; Liya Gong
Journal:  Genes Genomics       Date:  2020-07-04       Impact factor: 1.839

4.  Identification of flavonoid-3-O-glycosides from leaves of Casearia arborea (Salicaceae) by UHPLC-DAD-ESI-HRMS/MS combined with molecular networking and NMR.

Authors:  Augusto L Santos; Marisi G Soares; Lívia S de Medeiros; Marcelo J P Ferreira; Patricia Sartorelli
Journal:  Phytochem Anal       Date:  2021-02-08       Impact factor: 3.373

5.  Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking.

Authors:  Mingxun Wang; Jeremy J Carver; Vanessa V Phelan; Laura M Sanchez; Neha Garg; Yao Peng; Don Duy Nguyen; Jeramie Watrous; Clifford A Kapono; Tal Luzzatto-Knaan; Carla Porto; Amina Bouslimani; Alexey V Melnik; Michael J Meehan; Wei-Ting Liu; Max Crüsemann; Paul D Boudreau; Eduardo Esquenazi; Mario Sandoval-Calderón; Roland D Kersten; Laura A Pace; Robert A Quinn; Katherine R Duncan; Cheng-Chih Hsu; Dimitrios J Floros; Ronnie G Gavilan; Karin Kleigrewe; Trent Northen; Rachel J Dutton; Delphine Parrot; Erin E Carlson; Bertrand Aigle; Charlotte F Michelsen; Lars Jelsbak; Christian Sohlenkamp; Pavel Pevzner; Anna Edlund; Jeffrey McLean; Jörn Piel; Brian T Murphy; Lena Gerwick; Chih-Chuang Liaw; Yu-Liang Yang; Hans-Ulrich Humpf; Maria Maansson; Robert A Keyzers; Amy C Sims; Andrew R Johnson; Ashley M Sidebottom; Brian E Sedio; Andreas Klitgaard; Charles B Larson; Cristopher A Boya P; Daniel Torres-Mendoza; David J Gonzalez; Denise B Silva; Lucas M Marques; Daniel P Demarque; Egle Pociute; Ellis C O'Neill; Enora Briand; Eric J N Helfrich; Eve A Granatosky; Evgenia Glukhov; Florian Ryffel; Hailey Houson; Hosein Mohimani; Jenan J Kharbush; Yi Zeng; Julia A Vorholt; Kenji L Kurita; Pep Charusanti; Kerry L McPhail; Kristian Fog Nielsen; Lisa Vuong; Maryam Elfeki; Matthew F Traxler; Niclas Engene; Nobuhiro Koyama; Oliver B Vining; Ralph Baric; Ricardo R Silva; Samantha J Mascuch; Sophie Tomasi; Stefan Jenkins; Venkat Macherla; Thomas Hoffman; Vinayak Agarwal; Philip G Williams; Jingqui Dai; Ram Neupane; Joshua Gurr; Andrés M C Rodríguez; Anne Lamsa; Chen Zhang; Kathleen Dorrestein; Brendan M Duggan; Jehad Almaliti; Pierre-Marie Allard; Prasad Phapale; Louis-Felix Nothias; Theodore Alexandrov; Marc Litaudon; Jean-Luc Wolfender; Jennifer E Kyle; Thomas O Metz; Tyler Peryea; Dac-Trung Nguyen; Danielle VanLeer; Paul Shinn; Ajit Jadhav; Rolf Müller; Katrina M Waters; Wenyuan Shi; Xueting Liu; Lixin Zhang; Rob Knight; Paul R Jensen; Bernhard O Palsson; Kit Pogliano; Roger G Linington; Marcelino Gutiérrez; Norberto P Lopes; William H Gerwick; Bradley S Moore; Pieter C Dorrestein; Nuno Bandeira
Journal:  Nat Biotechnol       Date:  2016-08-09       Impact factor: 54.908

6.  Combination of molecular network and centrifugal partition chromatography fractionation for targeting and identifying Artemisia annua L. antioxidant compounds.

Authors:  Souhila Messaili; Cyril Colas; Laëtitia Fougère; Emilie Destandau
Journal:  J Chromatogr A       Date:  2019-12-12       Impact factor: 4.759

Review 7.  [Advances in studies on chemical constituents, pharmacological effects and clinical application of Aspongopus chinensis].

Authors:  Sha Li; Lei Li; Hong-Bing Peng; Xiao-Jing Ma; Lu-Qi Huang; Juan Li
Journal:  Zhongguo Zhong Yao Za Zhi       Date:  2020-01

8.  A Molecular Networking Based Discovery of Diketopiperazine Heterodimers and Aspergillicins from Aspergillus caelatus.

Authors:  Xinhui Wang; Rachel Serrano; Victor González-Menéndez; Thomas A Mackenzie; Maria C Ramos; Jens C Frisvad; Thomas O Larsen
Journal:  J Nat Prod       Date:  2022-01-19       Impact factor: 4.050

9.  Straightforward N-Acyl Homoserine Lactone Discovery and Annotation by LC-MS/MS-based Molecular Networking.

Authors:  Alice M S Rodrigues; Raphaël Lami; Karine Escoubeyrou; Laurent Intertaglia; Clément Mazurek; Margot Doberva; Pedro Pérez-Ferrer; Didier Stien
Journal:  J Proteome Res       Date:  2022-02-01       Impact factor: 4.466

10.  Antiproliferative and Proapoptotic Effects of a Protein Component Purified from Aspongopus chinensis Dallas on Cancer Cells In Vitro and In Vivo.

Authors:  Jun Tan; Ying Tian; Renlian Cai; Tianci Yi; Daochao Jin; Jianjun Guo
Journal:  Evid Based Complement Alternat Med       Date:  2019-01-03       Impact factor: 2.629

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.