| Literature DB >> 28671555 |
Wenbin Liu1, Bingyang Zhang2, Zhongquan Xin3, Dabing Ren4, Lunzhao Yi5.
Abstract
This present study aims to identify the key bioactive components in acorus tatarinowii rhizoma (ATR), a traditional Chinese medicine (TCM) with various bioactivities. Partial least squares regression (PLSR) was employed to describe the relationship between the radical scavenging activity and the volatile components. The PLSR model was improved by outlier elimination and variable selection and was evaluated by 10-fold cross-validation and external validation in this study. Based on the PLSR model, eleven chemical components were identified as the key bioactive components by variable importance in projection. The final PLS regression model with these components has good predictive ability. The Q² was 0.8284, and the root mean square error for prediction was 2.9641. The results indicated that the eleven components could be a pattern to predict the radical scavenging activity of ATR. In addition, we did not find any specific relationship between the radical scavenging ability and the habitat of the ATRs. This study proposed an efficient strategy to predict bioactive components using the combination of quantitative chromatography fingerprints and PLS regression, and has potential perspective for screening bioactive components in complex analytical systems, such as TCM.Entities:
Keywords: acori tatarinowii rhizoma; gas chromatography–mass spectrometry; partial least squares regression; radical scavenging activity
Mesh:
Substances:
Year: 2017 PMID: 28671555 PMCID: PMC5535835 DOI: 10.3390/ijms18071342
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Representative gas chromatography–mass spectrometry (GC-MS) fingerprint of acorus tatarinowii rhizoma (ATR): (A) total ion chromatogram (TIC) of ATR at 3–13 min; (B)TIC of ATR at 13–24 min; (C) TIC of ATR at 24–35 min. Eighty volatile components were detected by GC-MS.
Comparison of partial least squares regression models before and after variable selection.
| Matrix | nLV | RMSEC | RMSEP | RMSECV | ||
|---|---|---|---|---|---|---|
| 49 × 80 a | 8 | 0.8284 | 0.7824 | 3.2597 | 3.2210 | 5.0261 |
| 43 × 80 b | 6 | 0.9090 | 0.8124 | 2.3084 | 3.0996 | 3.4109 |
| 43 × 15 c | 5 | 0.9069 | 0.7969 | 2.3440 | 3.1713 | 2.9716 |
| 43 × 11 d | 6 | 0.8955 | 0.8284 | 2.4745 | 2.9641 | 3.3196 |
| 43 × 10 e | 5 | 0.9050 | 0.7940 | 2.3679 | 3.1932 | 2.9927 |
a: the PLS regression model established by dataset of all samples; b: the PLS regression model established by dataset of samples after outlier elimination; c: the variables were selected by regression coefficients (RC); d: the variable were selected by variable importance in projection (VIP); e: the common variables selected by RC and VIP. nLV: number of latent variables; R2: determination coefficient for calibration set; Q2: determination coefficient for validation set; RMSEC: root mean square error of calibration; RMSEP: root mean square error of prediction (validation set); RMSECV: root mean square error of cross validation.
Figure 2Distribution of the predicted mean and standard deviation values of 2,2-diohenyl-1-picryl-hydrazyl (DPPH) radical scavenging assay for 49 ATR samples by Monte-Carlo cross-validation (MCCV) method. A 2000-time Monte-Carlo sampling was conducted for the dataset of all samples (49 × 80).
Figure 4Screening of the key bioactive components: (A) The Q2 of the partial least squares regression models of different combinations of variables. The first one was the variable with the highest variable importance in projection (VIP) or regression coefficient (RC) value. The second combination was the first one plus the second one, then the first three, and so on. In this study, the number of variables changed from one to twenty; (B) RCs of the PLS regression model (43 × 80) for the 80 components; (C) VIP value of each component. Fifteen and eleven components were selected by RC and VIP, respectively. There are ten common components selected by the two methods. Components 1–16: 1, Estragole; 2, Methyleugenol; 3, cis-Methylisoeugenol; 4, Shyobunone; 5, Ledene; 6, Isoshyobunone; 7, δ-Cadinene; 8, Calacorene; 9, γ-Asarone; 10, β-Asarone; 11, cis-Calamenene; 12, Dehydroxy-isocalamendiol; 13, α-Cadinol; 14, α-Asarone; 15, Calamusenone; 16, Isocalamendiol.
Figure 3The partial least squares regression model between the radical scavenging activity and the volatile components: (A) Selection of the optimal latent variables by 10-fold cross validation. The first six latent variables were selected; (B) actual measured DPPH values versus their predicted values obtained by partial least squares regression model. The size of data set is 43 × 80. (▲) calibration set; (●) validation set.
Figure 5The result of standard deviation versus mean value for each sample calculated by Monte-Carlo cross-validation method. Samples in the red region are identified as normal, whereas the others are outliers.