| Literature DB >> 26329995 |
Hao Chen, Josiah Poon, Simon K Poon, Lizhi Cui, Kei Fan, Daniel Sze.
Abstract
BACKGROUND: Recent quality control of complex mixtures, including herbal medicines, is not limited to chemical chromatographic definition of one or two selected compounds; multivariate linear regression methods with dimension reduction or regularisation have been used to predict the bioactivity capacity from the chromatographic fingerprints of the herbal extracts. The challenge of this type of analysis requires a multi-dimensional approach at two levels: firstly each herb comprises complex mixtures of active and non-active chemical components; and secondly there are many factors relating to the growth, production, and processing of the herbal products. All these factors result in the significantly diverse concentrations of bioactive compounds in the herbal products. Therefore, it is imminent to have a predictive model with better generalisation that can accurately predict the bioactivity capacity of samples when only the chemical fingerprints data are available.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26329995 PMCID: PMC4705500 DOI: 10.1186/1471-2105-16-S12-S4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
the MSEof each predictive model build by PCR, PLSR, OPLS, EN and SMLR
|
| |||||
|---|---|---|---|---|---|
| 1 | 0.18545 | 0.17693 | 0.17693 | 0.14299 | 0.16524 |
| 2 | 0.26334 | 0.29223 | 0.29223 | 0.27688 | 0.27026 |
| 3 | 0.22515 | 0.21094 | 0.21094 | 0.18428 | 0.18602 |
| 4 | 0.27171 | 0.25480 | 0.25480 | 0.26347 | 0.25632 |
| 5 | 0.20965 | 0.19050 | 0.19050 | 0.22645 | 0.19704 |
| 6 | 0.33327 | 0.32225 | 0.32225 | 0.30530 | 0.30847 |
| 7 | 0.23789 | 0.22115 | 0.22115 | 0.24703 | 0.21343 |
| 8 | 0.25015 | 0.22506 | 0.23109 | 0.24708 | 0.22395 |
| 9 | 0.18473 | 0.17900 | 0.17900 | 0.20893 | 0.18507 |
| 10 | 0.27251 | 0.26961 | 0.26961 | 0.27560 | 0.26843 |
the MSEof each predictive model build by PCR, PLSR, OPLS, EN and SMLR
|
| |||||
|---|---|---|---|---|---|
| 1 | 0.05722 | 0.01562 | 0.01562 | 0.04591 | 0.02273 |
| 2 | 0.07726 | 0.02304 | 0.02304 | 0.0688 | 0.0339 |
| 3 | 0.05432 | 0.00992 | 0.00992 | 0.00278 | 0.01183 |
| 4 | 0.0372 | 0.02113 | 0.02113 | 0.17149 | 0.03334 |
| 5 | 0.02094 | 0.01609 | 0.01609 | 0.05693 | 0.0168 |
| 6 | 0.04337 | 0.02291 | 0.02291 | 0.20607 | 0.03839 |
| 7 | 0.03314 | 0.02129 | 0.02129 | 0.04031 | 0.01823 |
| 8 | 0.04345 | 0.01429 | 0.02959 | 0.04318 | 0.02377 |
| 9 | 0.0417 | 0.03122 | 0.03122 | 0.02044 | 0.02484 |
| 10 | 0.07736 | 0.01078 | 0.01078 | 0.07534 | 0.02725 |
Criteria of goodness of prediction
| Differences (%) | How Good is the Prediction |
|---|---|
| ≤ 10 | Excellent |
| >10 & ≤20 | Good |
| >20 & ≤30 | Acceptable |
| > 30 | Poor |
Average difference of predicted responses and observed responses in each test set for each model
| Difference (%) | |||||
|---|---|---|---|---|---|
| 1 | 12.72 | 12.089 | 12.089 | 10.9955 | 11.972 |
| 2 | 14.6231 | 14.972 | 14.972 | 13.635 | 14.2868 |
| 3 | 12.8561 | 12.6299 | 12.6299 | 11.5313 | 12.1758 |
| 4 | 15.1355 | 14.5171 | 14.5171 | 15.3884 | 14.8151 |
| 5 | 13.0635 | 12.3538 | 12.3538 | 13.8607 | 12.8326 |
| 6 | 14.6371 | 13.954 | 13.954 | 13.9972 | 13.9497 |
| 7 | 15.2913 | 14.5146 | 14.5146 | 14.8474 | 14.132 |
| 8 | 14.4245 | 14.0604 | 14.3823 | 13.9682 | 13.9149 |
| 9 | 12.1609 | 11.7033 | 11.7033 | 12.918 | 12.0041 |
| 10 | 14.4008 | 13.6742 | 13.6742 | 14.2981 | 13.9341 |
One-way ANOVA for the MSE of each model of PCR, PLSR, OPLS, EN and SMLR
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| 0.00135 | 4 | 0.00034 | 0.14765 | 2.57874 | ||
| 0.10264 | 45 | 0.00228 | ||||
| 0.10399 | 49 |
One-way ANOVA for differences of predicted and observed responses of PCR, PLSR, OPLS, EN, SMLR
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| 0.00018 | 4 | 4.56E-05 | 0.32640 | 2.57874 | ||
| 0.00629 | 45 | 0.00014 | ||||
| 0.00647 | 49 |