Literature DB >> 35529375

A proposed protocol based on integrative metabonomics analysis for the rapid detection and mechanistic understanding of sulfur fumigation of Chinese herbal medicines.

Dai Shengyun1,2, Wang Yuqi1, Wang Fei1,3, Mei Xiaodan1, Zhang Jiayu4,5.   

Abstract

In the current work, Lonicera japonica Flos (FLJ) was selected as a model Chinese herbal medicine (CHM) and a protocol was proposed for the rapid detection of sulfur-fumigated (SF) CHMs. A multiple metabonomics analysis was conducted using HPLC, NIR spectroscopy and a UHPLC-LTQ-Orbitrap mass spectrometer. First, the group discriminatory potential of each technique was respectively investigated based on PCA. Then, the effect of mid-level metabonomics data fusion on sample spatial distribution was evaluated based on data obtained using the above three technologies. Furthermore, based on the acquired HRMS data, 76 markers discriminating SF from non-sulfur-fumigated (NSF) CHMs were observed and 49 of them were eventually characterized. Moreover, NIR absorptions of 18 sulfur-containing markers were identified to be in close correlation with the discriminatory NIR wavebands. In conclusion, the proposed protocol based on integrative metabonomics analysis that we established for the rapid detection and mechanistic explanation of the sulfur fumigation of CHMs was able to achieve variable selection, enhance group separation and reveal the intrinsic mechanism of the sulfur fumigation of CHMs. This journal is © The Royal Society of Chemistry.

Entities:  

Year:  2019        PMID: 35529375      PMCID: PMC9072333          DOI: 10.1039/c9ra05032a

Source DB:  PubMed          Journal:  RSC Adv        ISSN: 2046-2069            Impact factor:   4.036


Introduction

Sulfur fumigation is a traditional storage method for Chinese herbal medicines (CHMs). It was first applied to the processing of Dioscoreae rhizoma, and has been widely misused in various CHMs in the last two decades, such as in the processing of Chrysanthemi Flos, Gastrodia Rhizoma, Radix Paeoniae Alba and Lonicera japonica Flos (FLJ).[1-3] It plays an important role in the production and post-harvest handling of CHMs due to its usage in moisturizing, bleaching, retaining freshness and killing parasites.[4-6] However, sulfur fumigation induces significant chemical transformations in inherent herbal constituents, resulting in alterations in the bioactivities, pharmacokinetics and toxicities of CHMs.[7,8] Besides, it often leads to the production in CHMs of excessive sulfur dioxide (SO2), sulfates, sulfites, heavy metal residues and other detrimental exogenous materials, which exhibit harmful potential toxicities or side effects to human health.[9-11] Although the use of sulfur fumigation has been officially restricted in China since 2005,[12] some illicit herbal farmers and wholesalers still misuse sulfur fumigation during the post-harvest handling and storage of CHMs. Moreover, SO2 residue-based detection standards formulated by many countries and organizations are often ineffective at evaluating the degree of sulfur fumigation because of the high volatility of SO2. Current studies mainly focus on total SO2 residues and neglect the transformations of inherent herbal constituents and corresponding mechanisms.[13-16] Therefore, the development of rapid and sensitive approaches based on stable quality-markers (Q-markers), such as sulfur-containing derivatives, to discriminate sulfur-fumigated (SF) CHMs from non-sulfur-fumigated (NSF) CHMs is urgently needed.[17] Integrative omics combining and interpreting data from multiple sources have already been adopted to successfully elucidate the mechanisms of human diseases, such as diabetes, obesity and schizophrenia.[18,19] Besides, integrative omics analysis has been used to characterize genes in the context of the molecular pathophysiology of the disease and its interacting genes and pathways.[20,21] Likewise, multi-omics data collected using various detection technologies such as liquid chromatography combined with mass spectrometry (LC-MS), high-performance liquid chromatography (HPLC) and near infrared (NIR) spectroscopy have been used for the screening and identification of Q-markers for the analysis of CHMs.[22,23] Of these technologies, HPLC retains the practicality and principles of LC, while increasing the overall interlaced attributes of sensitivity and resolution, MS has emerged as a powerful tool for quantitative and qualitative analysis of the complex components in CHMs, and NIR spectroscopy is a very rapid and alternative non-destructive method that shows electromagnetic absorption signals in the NIR region associated with specific chemical structures and that can be assigned to specific chemical functional groups and molecular structures. Nevertheless, although each technique has its own powerful capabilities for specific issues, any data set obtained by one single technique cannot capture the complexity of the overall system. Thus, integrative metabonomics analysis based on multiple levels of data fusion and correlation combines the information provided by various analytical technologies so as to achieve much better statistical predictions and interpretations than those obtained from any individual technique. FLJ, also known as Jin Yin Hua in Chinese, is one of the most well-known CHMs. It is derived from the dried buds or flowers of Lonicera japonica Thunb. and contains various biological ingredients such as organic acids, flavonoids and iridoid glycosides.[24-27] Pharmacological investigations indicated that FLJ displays various pharmacological activities, such as hepatoprotective, cytoprotective, anti-microbial, anti-oxidative, anti-viral and anti-inflammatory activities.[28-30] FLJ is also used in many food products, such as FLJ tea, a well-known health drink that has been highly praised for thousands of years for clearing away heat and toxic materials and treating exogenous pathogenic wind-heat.[31] However, in the last two decades, sulfur fumigation has been frequently misused in post-harvest handling during the drying and storage of FLJ. Therefore, we used FLJ as a study case to present a proposal for a protocol based on integrative metabonomics analysis in order to clarify the inherent chemical transformations of CHMs and to classify the CHMs based on these transformations. SF and non-sulfur-fumigated (NSF) FLJ along with organic acids, flavonoids and iridoid glycosides were used to verify the effectiveness of the established strategy.

Materials and methods

Chemicals, reagents and herbal materials

A total of 22 batches of authenticated NSF FLJ samples were collected from several different areas in China (Table S1†). All of their identities were authenticated using morphological and histological methods to be the dried buds of L. japonica Thunb. according to the monograph of Chinese Pharmacopoeia (version 2015).[32] The authenticated specimens were deposited in the Beijing Research Institute of Chinese Medicine, Beijing University of Chinese Medicine, China. Standard substances including 3-caffeoylquinic acid (3-CQA), 4-CQA, 5-CQA, 3,4-dicaffeoylquinic acid (3,4-DiCQA), 3,5-diCQA, 4,5-diCQA, lonicerin, secologanic acid, swertiamarin and luteolin 7-O-β-glucoside were purchased from Chengdu Bio-purify Phytochemicals Ltd (Sichuan, China) with their purities all greater than 98% (Fig. S1 and Table S2†). HPLC-grade acetonitrile and methanol were provided by Fisher Scientific (Fisher, Fair Lawn, NJ, USA). Formic acid was provided by Aldrich (St. Louis, MO). All of the other chemicals were of analytical grade and obtained commercially from Beijing Chemical Works (Beijing, China). De-ionized water was purified using a Milli-Q Gradient A 10 System (Millipore, Billerica, MA). The 0.22 μm membranes were purchased from Xinjinghua Co. (Shanghai, China).

Sample preparation

SF FLJ herbal samples

Fifteen batches of SF FLJ samples were prepared following the modified procedures that were employed by farmers and illicit wholesalers.[33,34] A total of 200 g of FLJ dried buds were wetted with water and allowed to stand for 0.5 h. Afterwards, a proper amount of sulfur powder was heated until it burned, and then the burning sulfur and wetted FLJ were carefully put into, respectively, the lower and upper layers of a desiccator. The desiccator was then kept closed for 12 h in order to achieve a sufficient sulfur fumigation. Meanwhile, the SF 5-CQA that was utilized for result validation was also prepared with this same method.

Reference solutions

A certain amount of each of 3-CQA, 4-CQA, 5-CQA, 3,4-CQA, 3,5-diCQA, 4,5-diCQA, lonicerin, secologanic acid, swertiamarin and luteolin 7-O-β-glucoside was respectively weighed accurately and then dissolved in methanol to obtain the mixed reference solutions (0.01 mg mL−1). These reference solutions were stored at 4 °C prior to analysis.

FLJ sample solutions

FLJ powders ground and sieved through a 65 mesh sieve were soaked for 30 min. A total of 1.0 g of powder was accurately weighed and then extracted with 25 mL methanol/water (70 : 30, v/v) in an ultrasonic bath (40 kHz, Eima Ultrasonics Corp., Germany) for 30 min at room temperature. Then the same solvent was added to compensate for the lost weight during the extraction process. The methanol solution was subjected to centrifugation (10 000 g) for 10 minutes, and then filtered through a 0.22 μm microporous membrane before being injected into an LC-MS system for analysis. To ensure the quality of the HPLC and LC-MS-based metabonomics data, pooled quality control (QC) samples were prepared by mixing equal amounts of 37 sample solutions. NIR, HPLC-DAD and UHPLC-LTQ-Orbitrap MS data were collected from these samples. The conditions for the three methods are listed in the ESI.†

Primary metabonomics data processing

All of the data were subjected to principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) to interpret the interrelationships between the samples. With respect to PLS-DA, samples were divided into a calibration set for modeling and a validation set for the established model evaluation. The prediction set consisted of samples that were not used for the calibration set. The very high quantities of acquired UHPLC-LTQ-Orbitrap MS raw data were processed with an Xcalibur 2.1 workstation (Thermo Scientific, Germany). The normalization was accomplished using Sieve 2.1 software (Thermo Scientific, USA), which was specifically used to perform background subtraction, component detection and peak alignment. SIMCA-P+ 11.5 (Umetrics, Sweden) and Unscrambler 7.0 (CAMO, Norway) software were utilized to carry out the spectral pre-processing. PCA and PLS were conducted using Matlab version R2009a (The MathWorks, Inc., USA) with Statistical Toolbox and in-house functions. The iToolbox utilized to run synergy interval partial least squares (siPLS) analysis algorithms was downloaded from http://www.models.kvl.dk/ for the NIR wavelength selection.

Integrative metabonomics analysis

Metabonomics data fusion

In order to integrate the information acquired from the different technologies, mid-level metabonomics data fusion was carried out to investigate the three obtained metabonomics data sets based on NIR, HPLC and LC-MS approaches. Much more attention was paid to the comparison between two methods, i.e., metabonomics data fusion with or without variable selection. The first method specifically applied PCA to describe the metabonomics data and determine the data structure without variable selection. To overcome the shortage of differences in scores and eigenvalues, all of them were standardized and eigenvalues were converted into percentage of explained variance. Scores were then multiplied by eigenvalues, and matrices were combined (HPLC, NIR and HRMS). The second method was based on the selection of relevant variables for each analytical method using PLS-DA with variable selection, which was then investigated using PCA. In this way, the number of explanatory variables was markedly reduced, and the selected variables only described the primary differences between all of the samples.

Method validation

To ensure the reliability of the experimental results, the reference standard, 5-CQA, was obtained to perform the SF process. SF chlorogenic acid samples (0.5, 1.0, 2.0, 2.5 and 5.0 mg) were respectively weighed, and each sample was then thoroughly mixed with 5.0 mg of dextrin. Meanwhile, NSF chlorogenic acid samples were prepared in the same way to test the reliability of the results. These ten samples were analyzed using the same NIR and LC-MS methods.

Results and discussion

Basic data sets for the three technologies

Typical NIR, HPLC-DAD and total ion chromatogram (TIC) results of the representative SF and NSF samples are presented in Fig. S2.† After peak alignment and removal of the missing values, 90 features were finally obtained for the HPLC-based metabonomics analysis. Meanwhile, for each of 37 samples, the NIR and MS data sets included 1557 and 5000 variables, respectively.

Primary metabonomics data analysis

Primary data analysis results were obtained respectively from unsupervised PCA and supervised PLS-DA for the three different techniques (Table 1 and Fig. 1).

Primary data analysis results for the three techniquesa

TechniquePCAPLS
Lvs R 2(X)Lvs R 2(Y) Q 2 Q 2-interceptPermuted R2 value p-Value
HPLC-DAD335.7%394%63.5%−0.2620.6241.16 × 10−5
NIR539.0%397.2%55.2%0.5040.5040.037
LC-MS672.0%294.9%82.9%−0.2640.4393.73 × 10−11

Lvs: the number of latent variables.

Fig. 1

The results of primary metabonomics data fusion analysis. (A–C) PCA for HPLC-DAD, NIR and LC-MS; (D–F) PLS-DA for HPLC-DAD, NIR and LC-MS.

Lvs: the number of latent variables. As for PCA, SF and NSF FLJ samples were not explicitly clustered into two groups with regards to HPLC-DAD and NIR analysis (The preprocess method was SG9+2nd, and the results obtained from the other preprocess methods are illustrated in Fig. S3 and Table S3†). While for the LC-MS analysis results, the 37 SF and NSF samples did segregate into two distinct groups. Fig. 1A–C show plots of these scores for these three techniques, respectively. A distinct classification trend could be observed in the LC-MS score plot. However, the results for these samples were nevertheless scattered considerably, with that for sample number 16 attributed to the NSF group being located in the SF group, which indicated that some of the variation in the samples cannot be obtained from the PCA. Therefore, PLS-DA was performed to improve the group separation. The PLS-DA model resulted in a clear separation of the SF and NSF samples for each of the three different technologies (Fig. 1D–F). As for the HPLC and LC-MS analyses, statistical models were considered to be statistically significant when the corresponding Q2-intercept values (−0.262 and −0.264) for the permutation model were negative. Meanwhile, the permuted R2 values (0.624 and 0.439) were lower than the original R2-values (0.903 and 0.927). Additionally, analysis of variance of the cross-validated predictive residuals (CV-ANOVA) tests were performed to confirm that the SF and NSF groups discriminated by PLS-DA were significantly different. The common practice was to interpret a p value (1.16 × 10−5 and 3.73 × 10−11) dramatically lower than 0.05 as contributing a significant model. As for the NIR analysis (for which the preprocess method was SG9+2nd, and the results obtained from the other preprocess methods are shown in Fig. S4†), the Q2-intercept value was lower than 0.5, indicating the poor predictive capability of PLS-DA here. The poor predictive capability was also verified by the finding of a positive Q2-intercept value from the permutation test. Therefore, the use of the NIR and HPLC technologies did not achieve a satisfactory classification based on PCA while the use of MS did so.

Mid-level metabonomics data fusion analysis without variable selection

Data fusion is defined as the integration of data blocks from different analytical platforms into a single model so as to improve the capability of statistical prediction and facilitate interpretation.[35] Low-level, mid-level and high-level data fusion have been three commonly used strategies. As the most commonly used strategy, mid-level data fusion can either combine variables after a relevant selection procedure or concatenate latent variables extracted from different statistical methods. Herein, the mid-level metabonomics fusion of NIR, HPLC and MS data was investigated. PCA was respectively conducted with various combinations of these techniques, specifically NIR-HPLC, HPLC-MS, NIR-MS and NIR-HPLC-MS (Table 2).

The results of the mid-level data fusion analysis for the three techniques

TechniquesPCAPLS-PCA
Lvs R 2(X)Lvs R 2(X)
NIR-HPLC593.5%555.2%
HPLC-MS571.4%679.4%
NIR-MS675.0%566.9%
NIR-HPLC-MS674.4%566.7%
Compared to the primary metabonomics analyses for HPLC and NIR alone, the principal factor total cumulative based on the results of the fusion of NIR and HPLC data was higher, with a value of 93.5%. However, the discrimination was still unsatisfactory. NSF and SF samples were dispersed in the three-dimensional space, indicating that the key information about the discrimination between the two analyses might not be captured (Fig. 2A). HPLC-MS-, NIR-MS- and NIR-HPLC-MS-based metabonomics data fusion generated summarized principal factorial plane results that were similar to the result for MS analysis mentioned above. Besides, many more variables were in the MS data set than in the HPLC data set, and hence the information contained in the MS data set could cover up the limited information of HPLC to some extent. Therefore, the results from MS-HPLC metabonomics data fusion were similar to those from primary metabonomics based on the MS data set (Table 3).
Fig. 2

The results of mid-level metabonomics data fusion analysis. (A–D) Mid-level metabonomics data fusion analysis without variable selection for NIR-HPLC, HPLC-MS, NIR-MS and NIR-HPLC-MS. (E–H) Mid-level metabonomics data fusion analysis with variable selection for NIR-HPLC, HPLC-MS, NIR-MS and NIR-HPLC-MS.

Identification of discriminatory markers in SF and NSF FLJ using UHPLC-LTQ-Orbitrap MS

No. t R Experimental massFormula [M–H]MS/MS fragment ionsIdentification
M1a4.47353.0869C16H17O9MS2[353]: 191, 179, 1353-CQA
M2a6.91353.0858C16H17O9MS2[353]: 191, 179, 1615-CQA
M3a7.73353.0856C16H17O9MS2[353]: 173, 179, 191, 1354-CQA
M4a2.14373.1122C16H21O10MS2[373]: 193, 149, 167, 179, 119Swertiamarin
M5a5.30373.1118C16H21O10MS2[373]: 211, 167, 149, 193, 179Secologanic acid
M67.88373.1118C16H21O10MS2[373]: 193, 149, 167, 179Swertiamarin isomer
M74.23375.1292C16H23O10MS2[375]: 213, 169, 151Loganin acid isomer
M84.84375.1280C16H23O10MS2[375]: 213, 169, 151, 195Loganin acid
M95.84375.1273C16H23O10MS2[375]: 213, 169, 151Loganin acid isomer
M106.63375.1292C16H23O10MS2[375]: 195, 151Loganin acid isomer
M111.81391.1231C16H23O11MS2[391]: 229, 211, 193, 185, 167, 149Secologanic acid hydrate
M122.45391.1255C16H23O11MS2[391]: 211, 229, 193, 167, 149, 185Secologanic acid hydrate
M1314.33403.1223C17H23O11MS2[403]: 371, 223, 179, 121, 91Secologanin
M141.63433.0428C16H17O12SMS2[433]: 241, 415, 353, 161, 191, 287CQA sulfate
M152.53433.0427C16H17O12SMS2[433]: 415, 387, 353, 241, 353CQA sulfate
M162.66433.0433C16H17O12SMS2[433]: 241, 415, 387, 259, 353CQA sulfate
M174.62433.0423C16H17O12SMS2[433]: 415.387, 259CQA sulfate
M185.01433.0419C16H17O12SMS2[433]: 415, 241, 161, 259, 387CQA sulfate
M191.12435.0591C16H17O12SMS2[435]: 353, 191, 179CQA sulfite
M203.15437.0720C16H21O12SMS2[437]: 193, 149, 373, 355Secologanic acid sulfite
M21a19.06447.0916C21H19O11MS2[447]: 285Luteolin-7-O-glucoside
M2221.06447.0918C21H19O11MS2[447]: 285Luteolin-7-O-glucoside isomer
M231.93455.0822C16H23O13SMS2[455]: 373, 411, 437, 193, 211Secologanic acid sulfite
M242.15455.0836C16H23O13SMS2[455]: 373, 437, 411, 193, 211Secologanic acid sulfite
M2518.22463.0854C21H19O12MS2[463]: 301, 271, 445Hyperoside isomer
M2618.73463.0861C21H19O12MS2[463]: 301, 445, 271Hyperoside
M2723.05499.1231C25H23O11MS2[499]: 337, 173, 335, 3534-pCo-1-CQA
M2823.49499.1233C25H23O11MS2[499]: 353, 337, 191, 335, 1795-pCo-3-CQA
M2925.21499.1230C25H23O11MS2[499]: 353, 337, 179, 1913-pCo-4-CQA
M30a20.36515.1155C25H23O11MS2[515]: 353, 335, 173, 1793,4-DiCQA
M31a20.85515.1155C25H23O11MS2[515]: 353, 191, 179, 3353,5-DiCQA
M32a22.44515.1163C25H23O11MS2[515]: 353, 191, 179, 335, 3534,5-DiCQA
M3317.34527.0494C21H19O14SMS2[527]: 447, 285, 481Luteolin-7-O-glucoside sulfate
M3423.82529.1343C26H25O12MS2[529]: 367, 179, 335, 353, 1933-C-4-FQA
M3524.60529.1340C26H25O12MS2[529]: 353, 367, 191, 1795-C-3-FQA
M3625.86529.1335C26H25O12MS2[529]: 353, 367, 173, 335 cis-5-C-3-FQA
M378.73543.0431C21H19O15SMS2[543]: 463, 381, 525, 301Hyperoside sulfate
M3812.76543.0432C21H19O15SMS2[543]: 381, 301, 381, 463Hyperoside sulfate
M3918.80593.1488C27H29O15MS2[593]: 285, 447Lonicerin isomer
M4019.71593.1483C27H29O15MS2[593]: 285, 447Lonicerin isomer
M41a20.50593.1486C27H29O15MS2[593]: 285Lonicerin
M4216.70595.0737C25H23O15SMS2[595]: 549, 577, 415, 241, 259DiCQA sulfate
M4316.98595.0750C25H23O15SMS2[595]: 549, 577, 415, 301, 397DiCQA sulfate
M4417.61595.0748C25H23O15SMS2[595]: 577, 549, 415, 433, 241, 259DiCQA sulfate
M4517.89595.0737C25H23O15SMS2[595]: 577, 549, 415, 433, 241, 259DiCQA sulfate
M4619.38595.0745C25H23O15SMS2[595]: 577, 549, 415, 433, 259DiCQA sulfate
M4721.25595.0745C25H23O15SMS2[595]: 577, 415, 549, 433, 259, 241DiCQA sulfate
M4822.70607.1653C28H31O15MS2[607]: 299Chrysoeriol-7-O-β-d-neohesperidoside
M4918.30609.1403C27H29O16MS2[609]: 301, 300, 271, 255, 179, 591Rutin

Identified by comparison with reference standards; CQA, caffeoylquinic acid; DiCQA, dicaffeoylquinic acid; pCoCQA, p-coumaroylcaffeoylquinic acid; CFQA, caffeoylferuloylquinic acid.

Identified by comparison with reference standards; CQA, caffeoylquinic acid; DiCQA, dicaffeoylquinic acid; pCoCQA, p-coumaroylcaffeoylquinic acid; CFQA, caffeoylferuloylquinic acid.

Mid-level metabonomics data fusion analysis with variable selection

Neither primary data analysis nor data fusion of different analytical technologies without variables selection could distinctly discriminate SF from NSF samples. This observation indicated that data fusion could not increase the classification capability, as the obtained results from metabonomics data fusion analysis were not improved greatly when compared to those obtained from the primary data analysis. Subsequently, we investigated the mid-level metabonomics data fusion with PLS-DA to improve the group separation. The results from NIR-HPLC, HPLC-MS, NIR-MS and NIR-HPLC-MS data fusions were respectively presented after preliminarily screening all of data acquired from an individual platform according to variable importance values (VIP > 1.0). The initial NIR, HPLC and MS data sets included 1557, 90, and 5000 variables, respectively. After the screening based on the VIP scores, 607, 27 and 1843 variables, respectively, were considered to be the most effective variables and hence retained for the subsequent discrimination. So now, a new PCA model could be constructed to enhance the group discrimination of SF and NSF samples based on the generated variables data set. HPLC-NIR data fusion using the new PCA model yielded much better results than ever before (Fig. 2E), even though 37 batches of FLJ were not distinctly clustered into two groups. Meanwhile, the HPLC-MS data fusion generated much better results without any misclassification, while one misclassification was still found in the NIR-MS data fusion results (no. 16 was still far from the NSF group) (Fig. 2F and G). Fig. 2H shows the results obtained from NIR-MS-HPLC metabonomics data fusion. Although no. 16 was not correctly allocated into the NSF group, the group discrimination potential was significantly improved when compared with those obtained from metabonomics data fusion without variable selection. Thus it could be seen from the results that metabonomics data fusion with variable selection made greater improvements in class separation than did the metabonomics data fusion without variable selection.

Identification of the markers discriminating NSF and SF FLJ samples based on LC-MS metabonomics analysis

As LC-MS could make up for the drawbacks in the structural identification of the discriminatory markers, a UHPLC-LTQ-Orbitrap high-resolution mass spectrometer was employed to perform the discrimination of the SF and NSF FLJ samples. To obtain satisfactory group separation based on differential variables and precisely distinguish the discriminatory markers, we applied the VIP values to filter several variables that contributed to them. Meanwhile, S-plots and t-tests were also typically used for identification of the discriminatory markers and selection of the informative correlations between the markers and the modeled classes. Therefore, the filtered markers required certain conditions to be satisfied, specifically position in S plot (|p| > 0.05 and |pcorr| > 0.3), VIP value (>1.0) and t-test (p < 0.05) (Fig. 3E). As a result, the number of original variables was 5000. Then, 76 peaks were chosen as potential markers. Because the peak areas of most screened discriminatory variables were too low to obtain their MS data, a parent ion list-dynamic exclusion (PIL-DE)-based method for acquiring data was utilized to accomplish the comprehensive acquisition of HRMS[1] and MS data sets, which greatly helped in the following structural identification.[36]
Fig. 3

The mass fragmentation behaviors of identified markers. (A) HRMS1 spectrum of M14. (B) ESI-MS2 spectrum of M20. (C) HRMS1 spectrum of M23. (D) ESI-MS2 spectrum of M24. (E) The S-plot of LC-MS metabonomics analysis.

Markers 1, 2 and 3 yielded identical [M–H]− ions at an m/z value of 353.0867 (C16H23O10, mass error within ±5 ppm) in negative ion mode. Their deprotonated molecular ions all generated a series of diagnostic fragment ions including those with m/z values of 191 [M–H–caffeoyl]−, 179 [caffeic–H]− and 173 [M–H–caffeoyl–H2O]−.[37] CQAs attributed to three different linkage positions of caffeoyl groups on quinic acid have been reported to display different intensities of their ESI-MS2 base peak ions and predominant product ions. Meanwhile, based on retention times and MS spectra of the corresponding reference substances and literature data, markers 1–3 were identified to be 5-CQA (Fig. S5†), 3-CQA and 4-CQA, respectively. Markers 14–18 generated their deprotonated [M–H]− molecular ions each at an m/z of 433.0435 (C16H17O12S, mass error within ±5 ppm). In their ESI-MS2 spectra, the diagnostic product ions were at m/z values of 415 [M–H–H2O]−, 387 [M–H–H2O–CO]−, 353 [M–H–SO3]−, 259 [caffeic–H + SO3]− and 241 [caffeic–H + SO3–H2O]−. The observation of the pair of ions at m/z values of 433 and 353 (Fig. 3A) further confirmed that the sulfate moiety was introduced to the CQA molecule, which has to the best of our knowledge never been reported before. Finally, markers 14–18 were tentatively identified as isomeric CQA sulfate. Similarly, the ESI-MS2 spectrum of marker 20 (Fig. 3B) showed an m/z signal corresponding to its deprotonated [M–H]− molecular ion at a value of 437.0748 (C16H21O12S, error within ±5 ppm). Moreover, the characteristic product ions at m/z values of 373 [M–H–SO2]−, 193 [M–H–SO2–Glc–H2O]− and 149 [M–H–SO2–Glc–H2O–CO2]− were all observed. Based on the observation of the signals at the m/z values of 193 and 149 coupled with its [M–H]− ion, marker 20 may be concluded to be secologanic acid.[38] Meanwhile, the observation of the product ion at the m/z value of 373 confirmed that the sulfite moiety was introduced into the iridoid molecule. Accordingly, marker 20 was tentatively identified as isomeric secologanic acid sulfate. In addition, a combination of the isotopic pattern combined and chromatography analyses was used for screening sulfur-containing compounds in the complex systems, mainly because the 34S isotopic ion has been shown to be drastically affected by 13C2 and 18O.[39] Markers 23 and 24 produced their [M–H]− ions each at an m/z of 455.0822 (C16H23O12S, error within ±5 ppm). And both of them generated a series of fragment ions at m/z values of 437 [M–H–H2O]−, 411 [M–H–CO2]−, 373 [M–H–H2SO3]−, 211 [M–H–H2SO3–Glc]− and 193 [M–H–H2SO3–Glc–H2O]−. Furthermore, they simultaneously produced the isotopic patterns of the 34S ion at an m/z of 457.07822 and of the 13C2 + 18O ion at an m/z of 457.11760. Their characteristic product ions at m/z values of 437 and 373 probably resulted from the occurrence of the sulfite moiety in some of the iridoid molecules. Accordingly, markers 23 and 24 were putatively identified as secologanic acid-sulfite or its isomers (Fig. 3C and D). Taken together, a total of 49 discriminatory markers (Table S4†) attributed to iridoids, organic acids and flavones were screened and characterized according to the fragmentation behaviors, isotopic patterns and diagnostic product ions obtained using the UHPLC-LTQ-Orbitrap MS coupled with the established integrated strategy. Eighteen of these markers were assigned to sulfate/sulfite derivatives of iridoid and chlorogenic acid, which could be chosen as the characteristic Q-markers for SF FLJ discrimination.[40] (Note that Fig. S6† shows a histogram of signal intensities of sulfur derivatives.)

Multi-omics correlation analysis (MOCA)

At first, we performed a selection of specific wavenumbers according to the NIR-based metabonomics data analysis. Analysis of the available NIR spectra, specifically of the different NIR wavebands, quickly provided vast amounts of useful chemical information. However, it was in fact unable to present selective valid wavebands with discriminating potential. In order to interpret the sulfur fumigation process and screen the potential wavebands that presented the significant differences between SF and NSF samples, synergy interval partial least squares (siPLS) analysis with three intervals was employed to obtain the significant differences between SF and NSF samples. To eliminate the influence of overfitting, we set the latent variables to be within the range 1–10. As demonstrated in Table 4, the siPLS analysis with SG11+2nd optimized (the other siPLS methods are shown in Fig. S7 and S8†) was the best discriminatory model and achieved a value of 0.859 for R2(Y) and 0.471 for Q2; i.e., this method produced a better model performance than did any of the other preprocessing methods. Furthermore, the SF and NSF FLJ samples were clearly separated (Fig. 4A). The wavenumbers of the optical subinterval combinations ranged from 5000 to 5200 cm−1. Thereafter, NIR spectra were integrated with two-dimensional correlation spectra (2D-COS) that were expected to clearly identify the screened wavebands, discriminate features and details of the structural changes of SF and NSF FLJ samples. For the synchronous 2D-COS auto-peak analysis, two main sensitive variables were identified and displayed wavenumbers ranging from 5000 to 5200 cm−1 (see Fig. 4B, which shows the diagonal data of the 2D-COS plot, and Fig. S9,† which shows the 2D-COS plot), whose correlation analysis was in accordance with the wavebands screened using the siPLS method. Similar to that previously reported, strong absorptions in the wavenumber range 5000–5200 cm−1 were attributed to S–H and S–OH.[33,41] As mentioned above, some sulfur-containing components were identified as the discriminatory markers. As a result, a close relationship was identified between the screened wavebands (5000–5200 cm−1) and the sulfur-containing markers generated in the sulfur fumigation process.

The results of SiPLS analysis

Preprocessing methodPCAPLS-DA
Lv R 2(X) R 2(Y)Lvs R 2(X) R 2(Y) Q 2
Baseline30.9980.99730.9970.5010.326
Spectroscopic transformation30.9990.99830.9990.3900.248
MSC60.9990.99730.8580.4540.206
Normalization50.9990.99930.9750.4940.309
Original30.9990.99930.9990.4970.249
SG91st50.8990.83140.8450.8270.601
SG92nd50.6210.38130.4250.8270.309
SG111st40.6930.57830.5830.7910.476
SG112nd60.5820.23430.2920.9220.418
SNV40.9970.99630.9930.5170.336
WDS30.9990.99930.9990.5380.159
Fig. 4

The MOCA for the FLJ. (A) The discriminatory information of preprocess method of SG9+1st; (B) the synchronous 2D-COS auto-peak analysis of the SF and NSF samples; (C) the mass fragmentation behaviors of SF chlorogenic acid; (D) the synchronous 2D-COS auto-peak analysis of the SF and NSF chlorogenic acid samples; (E) the mass fragmentation behaviors of SF chlorogenic acid.

To validate the above-mentioned results, one of the main representative chemical constituents, namely chlorogenic acid (5-CQA), was subjected to sulfur fumigation and analyzed using the same methods. The autocorrelation curves of the SF and NSF chlorogenic acid samples (Fig. 4D) were derived from their respective 2D-COS spectra. Obvious differences between the SF and NSF chlorogenic acid samples in the wavebands between 5000 and 5200 cm−1 were observed, which was also in accordance with the wavebands screened using the siPLS model. The subsequent LC-HRMS analysis of an SF chlorogenic acid mixture also indicated the presence of newly generated constituents (Fig. 4C and E) except for the prototype drug during the process of sulfur fumigation. Through analyzing the fragment ions of the S-derivatives, it was found that SO3 (79.9568) and H2SO3 (81.9725) were the characteristic neutral losses of organic sulfates or sulfites. The assignments of these newly emerged peaks were confirmed to be the rudimentary sulfate derivatives of chlorogenic acid based on the HRMS data, which indicated a mass 79.95 Da (SO3) more than that of standard reference. It also indicated that the results of NIR were reliable and credible in the discrimination of SF FLJ.

Proposed protocol for the rapid detection and mechanistic explanation of the sulfur fumigation of CHMs using SF FLJ as a study case

Evaluating the quality of CHMs and judging their authenticity are two major challenges. Sulfur fumigation has attracted increasing attention due to its alteration of CHM quality resulting from its damage to bioactive components, generating excessive sulfur dioxide residue and especially changing the chemically active ingredients in CHMs. NIR, HPLC and LC-MS were proposed to be used to evaluate the quality of CHMs. However, the variations during sulfur fumigation were much more complicated than expected. Furthermore, the amount of data obtained based on one single method was still limited, making it difficult to expound on the mechanism of sulfur fumigation. To experimentally support our inference, we selected FLJ as a model herb in this study. With the development of a few high-throughput strategies, integrative metabonomics analysis was applied to integrate the multiple interactions of NIR spectra, HPLC chromatograms and HRMS data. The results aimed to reveal whether the herb underwent sulfur fumigation and to illuminate the inherent mechanism of the NIR judgment method by associating NIR with HPLC and UHPLC-MS analyses. Our established approach was applied to rapidly discriminate SF FLJ among many unknown samples, and is expected to be greatly beneficial for guaranteeing CHM quality. To perform the analysis of the sulfur fumigation of CHMs, the process of sulfur fumigation was first simulated in the laboratory. According to the characteristics of each analytical technology, optimum analytical conditions were adopted and the corresponding high-quality data of SF and NSF CHM samples were obtained. All of these experiments provided the foundation for subsequent data analysis, which is illustrated in Fig. 5.
Fig. 5

A suggested protocol for the rapid discrimination of SF CHMs and an explanatory mechanism.

Step 1: Performing PCA and PLS-DA for the single technology. This step was focused on the analytical capability of each single technology and whether the SF and NSF CHMs could be distinguished. Our study demonstrated that NIR spectroscopy based on a data preprocessing method (SG9+2nd) with a multivariate calibration approach such as PCA and PLS-DA was the appropriate tool to discriminate SF from NSF FLJ samples. The chemical constituents in FLJ samples displayed strong ultraviolet absorption, which was observed with HPLC-DAD at 330 nm, 238 nm, 254 nm and 280 nm. Peak areas (≥150 mAU) were selected separately through the data fusion of the four wavelengths and then analyzed by performing PCA and PLS-DA. In addition, a UHPLC-LTQ-Orbitrap high-resolution MS was employed to comprehensively and dynamically profile the chemical constituents in FLJ. The derivative content during the sulfur fumigation process was not abundant enough, i.e., the signals of sulfur-containing analytes would have been drowned out by the contribution of inherent constituents. Step 2 and Step 3: Performing the integrative metabonomics analysis, such as mid-level metabonomics data fusion analysis without/with variable selection. In our previous study, the data fusion of NIR- and HRMS-based metabonomics-like analysis was successfully applied to accomplishing the discrimination of SF Ophiopogon Radix.[17] Herein, we combined three kinds of analytical techniques including NIR, HPLC and UHPLC-HRMS to obtain the dimensional information of SF samples, and investigated two types of mid-level metabonomics data fusion strategies as illustrated in Step 2 and Step 3. For that, informative features of the raw data from a single instrument were separately extracted using their own protocol from sample preparation to data preprocessing. The comparison between the unique model and the metabonomics data fusion model is illustrated in Fig. 6. No single analytical platform could be utilized to accurately discriminate the SF samples based on PCA score plots. HPLC and NIR led to classification without rhyme or reason and HRMS could not correctly discriminate one of the SF samples (no. 16). Thus, we believed that utilizing mid-level metabonomics data fusion without variable selection to obtain more accurate characteristics of the samples might be a much better choice. As a result, the potential to discriminate between of NSF and SF samples was actually improved with no. 16 still in the wrong class, and the results were worse than the individual application of MS. Mid-level fusion with variable selection was employed and clearly improved the class separation, as samples were correctly classified and less scattered (Fig. 6K). Taking the classification into account, the fusion of NIR and HRMS data, accomplished with high accuracy, provided the best model (Fig. 6I and J). Moreover, variables that were selected before classification generated better classification results than those obtained when all variables were used. Overall, the proposed metabonomics data fusion approach demonstrated an ability to effectively discriminate key information from raw analysis data.
Fig. 6

Comparison between the unique model and the metabonomics data fusion model. (A–C) PCA for HPLC-DAD, NIR and LC-MS. (D–G) Result for HPLC-NIR, HPLC-MS, NIR-MS and NIR-HPLC-MS data fusion without variable selection analysis. (H–K) Result for HPLC-NIR, HPLC-MS, NIR-MS and NIR-HPLC-MS data fusion with variable selection analysis.

The results demonstrated that the mid-level metabonomics data fusion methods were much better than all of the primary analyses, which meant that the information obtained from individual techniques was in fact insufficient. The results from both kinds of mid-level fusion strategies accomplished the effective discrimination of SF FLJ samples. Step 4: Identifying the discriminatory markers attributed to group separation. LTQ-Orbitrap high-resolution MS has been one of the most powerful approaches used for the rapid identification of multiple constituents in CHMs.[42,43] It has been used to combine high trapping capacity and multiple data acquisition of linear ion traps to generate a large amount of information from MS1 and MS data. In this study, a highly sensitive and effective strategy was utilized for rapidly screening and identifying SF FLJ by using PIL-DE acquisition based on a hybrid LTQ-Orbitrap mass spectrometer to accomplish the overall acquisition of data sets, which helped allow for a search of a greater number of potential active compounds especially for the sulfur-containing constituents. As a result, 49 markers including iridoids, organic acids and the sulfur-containing derivatives were positively or tentatively identified. Step 5: Application of MOCA for deriving mechanistic explanations of the sulfur fumigation process and the corresponding method validation. An NIR spectrum was constructed from different wavebands, but not every waveband displayed a special discrimination ability. Therefore, siPLS analysis was employed to screen the potential wavebands that presented the significant differences between SF and NSF samples. In step 4 mentioned above, some sulfur-containing constituents were identified that would explain the potential NIR wavebands. Chlorogenic acid was selected as the example to validate whether the new sulfur-containing derivatives were produced after the sulfur fumigation process.

Conclusions

Our work indicated that the proposed protocol for the rapid detection of SF CHMs is also beneficial for revealing the intrinsic mechanism of sulfur fumigation and boosting the ability to discriminate SF from NSF FLJ samples, and could serve as an example for future research on rapidly detecting other SF CHMs. Integrative metabonomics analysis was also found to be beneficial for evaluating the quality of and rapidly detecting CHMs. Our work also suggests a future trend of integrating multiple metabonomics datasets from different technologies to achieve a sound evaluation.

Conflicts of interest

All authors declare that they have no conflict of interest.
  37 in total

1.  Approach based on high-performance liquid chromatography fingerprint coupled with multivariate statistical analysis for the quality evaluation of Gastrodia Rhizoma.

Authors:  Xuerong Zhang; Ziwan Ning; Yi Chen; Chunqin Mao; Tulin Lu
Journal:  J Sep Sci       Date:  2015-10-09       Impact factor: 3.645

2.  Conjugates of a secoiridoid glucoside with a phenolic glucoside from the flower buds of Lonicera japonica Thunb.

Authors:  Yoshiki Kashiwada; Yuka Omichi; Shin-ichiro Kurimoto; Hirofumi Shibata; Yoshiyuki Miyake; Tsukasa Kirimoto; Yoshihisa Takaishi
Journal:  Phytochemistry       Date:  2013-10-10       Impact factor: 4.072

3.  UPLC-QTOF-MS/MS-guided isolation and purification of sulfur-containing derivatives from sulfur-fumigated edible herbs, a case study on ginseng.

Authors:  Li Zhang; Hong Shen; Jun Xu; Jin-Di Xu; Zhen-Ling Li; Jie Wu; Ye-Ting Zou; Li-Fang Liu; Song-Lin Li
Journal:  Food Chem       Date:  2017-11-01       Impact factor: 7.514

4.  Flavonoids Isolated from Flowers of Lonicera japonica Thunb. Inhibit Inflammatory Responses in BV2 Microglial Cells by Suppressing TNF-α and IL-β Through PI3K/Akt/NF-kb Signaling Pathways.

Authors:  Min Ho Han; Won Sup Lee; Arulkumar Nagappan; Su Hyun Hong; Ji Hyun Jung; Cheol Park; Hye Jung Kim; Gi-Young Kim; GonSup Kim; Jin-Myung Jung; Chung Ho Ryu; Sung Chul Shin; Soon Chan Hong; Yung Hyun Choi
Journal:  Phytother Res       Date:  2016-08-18       Impact factor: 5.878

5.  Sulfur dioxide residue in sulfur-fumigated edible herbs: The fewer, the safer?

Authors:  Su-Min Duan; Jun Xu; Ying-Jia Bai; Yan Ding; Ming Kong; Huan-Huan Liu; Xiu-Yang Li; Qing-Shan Zhang; Hu-Biao Chen; Li-Fang Liu; Song-Lin Li
Journal:  Food Chem       Date:  2015-07-03       Impact factor: 7.514

Review 6.  Sulfur fumigation, a better or worse choice in preservation of Traditional Chinese Medicine?

Authors:  Xue Jiang; Lin-Fang Huang; Si-Hao Zheng; Shi-Lin Chen
Journal:  Phytomedicine       Date:  2012-11-03       Impact factor: 5.340

7.  Structural elucidation of a pectin from flowers of Lonicera japonica and its antipancreatic cancer activity.

Authors:  Liyan Lin; Peipei Wang; Zhenyun Du; Wucheng Wang; Qifei Cong; Changping Zheng; Can Jin; Kan Ding; Chenghao Shao
Journal:  Int J Biol Macromol       Date:  2016-03-18       Impact factor: 6.953

Review 8.  Integrative omics for health and disease.

Authors:  Konrad J Karczewski; Michael P Snyder
Journal:  Nat Rev Genet       Date:  2018-02-26       Impact factor: 53.242

Review 9.  Approaches to establish Q-markers for the quality standards of traditional Chinese medicines.

Authors:  Wenzhi Yang; Yibei Zhang; Wanying Wu; Luqi Huang; Dean Guo; Changxiao Liu
Journal:  Acta Pharm Sin B       Date:  2017-05-23       Impact factor: 11.413

10.  Novelty application of multi-omics correlation in the discrimination of sulfur-fumigation and non-sulfur-fumigation Ophiopogonis Radix.

Authors:  Shengyun Dai; Zhanpeng Shang; Fei Wang; Yanfeng Cao; Xinyuan Shi; Zhaozhou Lin; Zhibin Wang; Ning Li; Jianqiu Lu; Yanjiang Qiao; Jiayu Zhang
Journal:  Sci Rep       Date:  2017-08-30       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.