| Literature DB >> 28337449 |
Michelle Chun-Har Ng1, Tsui-Yan Lau2, Kei Fan1, Qing-Song Xu3, Josiah Poon4, Simon K Poon4, Mary K Lam5, Foo-Tim Chau2, Daniel Man-Yuen Sze6.
Abstract
The current use of a single chemical component as the representative quality control marker of herbal food supplement is inadequate. In this CD80-Quantitative-Pattern-Activity-Relationship (QPAR) study, we built a bioactivity predictive model that can be applicable for complex mixtures. Through integrating the chemical fingerprinting profiles of the immunomodulating herb Radix Astragali (RA) extracts, and their related biological data of immunological marker CD80 expression on dendritic cells, a chemometric model using the Elastic Net Partial Least Square (EN-PLS) algorithm was established. The EN-PLS algorithm increased the biological predictive capability with lower value of RMSEP (11.66) and higher values of Rp2 (0.55) when compared to the standard PLS model. This CD80-QPAR platform provides a useful predictive model for unknown RA extract's bioactivities using the chemical fingerprint inputs. Furthermore, this bioactivity prediction platform facilitates identification of key bioactivity-related chemical components within complex mixtures for future drug discovery and understanding of the batch-to-batch consistency for quality clinical trials.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28337449 PMCID: PMC5350422 DOI: 10.1155/2017/3923865
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1The HPLC-DAD chromatographic profiles of each RA extract from batches A, B, and C.
Figure 2The immunomodulatory effect (relative percentage change of CD80 expression to the blank, after standardization) of each RA extract from three batches on THP-1 cell.
The immunomodulatory effect (relative change of CD80 expression to the blank, %, after standardization) of each RA extract from three batches on THP-1 cell.
| Batch A | Batch B | Batch C | |||
|---|---|---|---|---|---|
| Sample | Act. activity (%) | Sample | Act. activity (%) | Sample | Act. activity (%) |
| A1 | +12.70 | B1 | +1.87 | C1 | +15.58 |
| A2 | +8.33 | B2 | −2.25 | C2 | +20.29 |
| A3 | +15.48 | B3 | +2.25 | C3 | +20.29 |
| A4 | +14.68 | B4 | +17.23 | C4 | +43.48 |
| A5 | +15.08 | B5 | +8.61 | C5 | +35.14 |
| A6 | +9.52 | B6 | −4.87 | C6 | +26.45 |
| A7 | +15.08 | B7 | +20.60 | C7 | +22.46 |
| A8 | +5.95 | B8 | −7.12 | C8 | +57.25 |
| A9 | +7.54 | B9 | +16.10 | C9 | +25.36 |
| A10 | 0 | B10 | −1.12 | C10 | +25.36 |
| A11 | 0 | B11 | −4.12 | C11 | +29.35 |
| A12 | +2.38 | B12 | −8.24 | C12 | +27.17 |
| A13 | +0.79 | B13 | −15.36 | C13 | +27.54 |
| A14 | +2.78 | B14 | −10.11 | C14 | +19.20 |
| A15 | 0 | B15 | −7.87 | C15 | +27.17 |
| A16 | +3.57 | B16 | −7.12 | C16 | +26.81 |
| A17 | −10.32 | B17 | −7.49 | C17 | +21.38 |
| A18 | −8.73 | B18 | −8.24 | C18 | +19.20 |
| A19 | −6.35 | B19 | −4.87 | C19 | +19.20 |
| A20 | −14.68 | B20 | −12.36 | C20 | +19.93 |
| A21 | +25.00 | B21 | −11.75 | C21 | +17.17 |
| A22 | +13.86 | B22 | −17.17 | C22 | +5.12 |
| A23 | +30.72 | B23 | −19.28 | C23 | +2.41 |
| A24 | +6.02 | B24 | −17.47 | C24 | −7.23 |
|
| |||||
|
|
|
| |||
|
|
|
| |||
|
|
|
| |||
|
|
|
| |||
The results of the models built by three algorithms (PLS and EN-PLS).
| Model | # of variables | Optimum # of PLS components | Training set | Test set | |||
|---|---|---|---|---|---|---|---|
|
| RMSET | RMSECV |
| RMSEP | |||
| PLS | 10493 | 8 | 0.87 | 5.95 | 16.63 | 0.34 | 12.70 |
| EN-PLS | 309 | 7 | 0.93 | 4.34 | 6.93 | 0.55 | 11.66 |
R 2 is correlation coefficient of regression between the predicted and experimental activities of the extracts (t refers to training set and p refers to the test set); RMSET is the fitting error of the model in the training; RMSECV is the Root Mean Squared Errors of Cross-Validation; RMSEP is Root Mean Squared Errors of Prediction of the test set; q2 is the cross-validated R2 which is calculated by the equation: q2 = 1 − ∑(Ypred − Yact)2/∑(Yact − Ymean)2.
Figure 3Plots of predicted versus experimental activity from training set data and test set data on (a) PLS and (b) EN-PLS. Open blue diamond and the open red triangle represent training set data and the test set data, respectively.
The actual CD80 activity (Act.) and the predicted values (Pred.) of the test set predicted by PLS and EN-PLS.
| Number | Act | PLS | EN-PLS | ||||
|---|---|---|---|---|---|---|---|
|
| REP | PRESS |
| REP | PRESS | ||
| 1 | 108.33 | 100.04 | −8.29 | 68.72 | 95.66 | −12.67 | 160.53 |
| 2 | 105.95 | 109.26 | 3.31 | 10.96 | 101.16 | −4.79 | 22.94 |
| 3 | 107.54 | 121.08 | 13.54 | 183.33 | 150.84 | 43.30 | 1874.89 |
| 4 | 100.00 | 95.54 | −4.46 | 19.89 | 110.57 | 10.57 | 111.72 |
| 5 | 100.79 | 94.68 | −6.11 | 37.33 | 97.03 | −3.76 | 14.14 |
| 6 | 89.68 | 103.86 | 14.18 | 201.07 | 101.68 | 12.00 | 144.00 |
| 7 | 91.27 | 105.41 | 14.14 | 199.94 | 87.64 | −3.63 | 13.18 |
| 8 | 85.32 | 121.30 | 35.98 | 1294.56 | 103.43 | 18.11 | 327.97 |
| 9 | 101.87 | 85.91 | −15.96 | 254.72 | 104.86 | 2.99 | 8.94 |
| 10 | 102.25 | 111.26 | 9.01 | 81.18 | 104.34 | 2.09 | 4.37 |
| 11 | 108.61 | 96.89 | −11.72 | 137.36 | 112.05 | 3.44 | 11.83 |
| 12 | 116.10 | 102.6 | −13.50 | 182.25 | 107.22 | −8.88 | 78.85 |
| 13 | 91.76 | 96.31 | 4.55 | 20.70 | 90.37 | −1.39 | 1.93 |
| 14 | 92.13 | 99.56 | 7.43 | 55.20 | 100.61 | 8.48 | 71.91 |
| 15 | 82.83 | 82.06 | −0.77 | 0.59 | 86.56 | 3.73 | 13.91 |
| 16 | 82.53 | 85.69 | 3.16 | 9.99 | 82.04 | −0.49 | 0.24 |
| 17 | 115.58 | 120.84 | 5.26 | 27.67 | 122.50 | 6.92 | 47.89 |
| 18 | 120.29 | 115.73 | −4.56 | 20.79 | 125.46 | 5.17 | 26.73 |
| 19 | 135.14 | 126.63 | −8.51 | 72.42 | 136.89 | 1.75 | 3.06 |
| 20 | 126.45 | 110.55 | −15.90 | 252.81 | 121.62 | −4.83 | 23.33 |
| 21 | 122.46 | 104.37 | −18.09 | 327.25 | 116.8 | −5.66 | 32.04 |
| 22 | 121.38 | 129.29 | 7.91 | 62.57 | 113.11 | −8.27 | 68.39 |
| 23 | 119.20 | 122.39 | 3.19 | 10.18 | 133.17 | 13.97 | 195.16 |
| 24 | 102.41 | 120.91 | 18.50 | 342.25 | 99.67 | −2.74 | 7.51 |
REP = relative error of prediction = (calculated value-measured value)/measured value.
PRESS = predicted error sum of square for test set = ∑(Ypred − Yact)2.
The changes of CD80 expression related to % changes of chromatogram regions with different correlation coefficients.
| Chromatogram region | Correlation coefficient generated by the prediction algorithm based on | % changes in CD80 expression when chromatogram region intensity (quantity of corresponding compound) was increased by 50, 100, or 200% | ||||||
|---|---|---|---|---|---|---|---|---|
| PLS | EN-PLS | PLS | EN-PLS | |||||
| 50% increase | 100% increase | 200% increase | 50% increase | 100% increase | 200% increase | |||
| 1 | 8.08 | 12.76 | 0.42 | 0.83 | 2.91 | 0.38 | 0.76 | 2.66 |
| 2 | −10.59 | −10.92 | −0.98 | −1.95 | −3.90 | −0.50 | −1.01 | −2.01 |
| 3 | 11.85 | 33.57 | 18.47 | 36.93 | 73.87 | 31.75 | 63.51 | 127.02 |
| 4 | −9.07 | −93.96 | −1.19 | −2.38 | −4.76 | −6.93 | −13.86 | −27.72 |
| 5 | −10.29 | −9.32 | −0.14 | −0.27 | 0.54 | −0.06 | −0.12 | −0.25 |
| 6 | 9.07 | 30.64 | 0.14 | 0.28 | 0.56 | 0.23 | 0.47 | 0.93 |
| 7 | −12.66 | −55.19 | −1.41 | −2.82 | −5.64 | −3.30 | −6.59 | −13.18 |
| 8 | −13.81 | −56.59 | −6.53 | −13.05 | −26.10 | −14.00 | −28.00 | −56.00 |
| 9 | 10.63 | 41.34 | 0.62 | 1.25 | 2.50 | 1.34 | 2.68 | 5.36 |
| 10 | 12.19 | 48.64 | 0.15 | 0.31 | 0.61 | 0.34 | 0.68 | 1.37 |
| 11 | −10.78 | −30.46 | −0.17 | −0.35 | −0.70 | −0.24 | −0.48 | −0.95 |
| 12 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 13 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |