| Literature DB >> 28098186 |
Haiqing Li1, Wei Zhang1, Ying Chen2, Yumeng Guo1, Guo-Zheng Li1, Xiaoxin Zhu2.
Abstract
Excavating from small samples is a challenging pharmacokinetic problem, where statistical methods can be applied. Pharmacokinetic data is special due to the small samples of high dimensionality, which makes it difficult to adopt conventional methods to predict the efficacy of traditional Chinese medicine (TCM) prescription. The main purpose of our study is to obtain some knowledge of the correlation in TCM prescription. Here, a novel method named Multi-target Regression Framework to deal with the problem of efficacy prediction is proposed. We employ the correlation between the values of different time sequences and add predictive targets of previous time as features to predict the value of current time. Several experiments are conducted to test the validity of our method and the results of leave-one-out cross-validation clearly manifest the competitiveness of our framework. Compared with linear regression, artificial neural networks, and partial least squares, support vector regression combined with our framework demonstrates the best performance, and appears to be more suitable for this task.Entities:
Mesh:
Year: 2017 PMID: 28098186 PMCID: PMC5241636 DOI: 10.1038/srep40652
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
L9(34) test schedule of Wuji pill.
| Category | Coptis | Evodia | Peony |
|---|---|---|---|
| 1# | 0.48 | 0.15 | 0.23 |
| 2# | 0.48 | 0.29 | 0.47 |
| 3# | 0.48 | 0.88 | 0.93 |
| 4# | 0.96 | 0.15 | 0.47 |
| 5# | 0.96 | 0.29 | 0.93 |
| 6# | 0.96 | 0.88 | 0.23 |
| 7# | 1.92 | 0.15 | 0.93 |
| 8# | 1.92 | 0.29 | 0.23 |
| 9# | 1.92 | 0.88 | 0.47 |
| 10# | 0.48 | 0 | 0 |
| 11# | 0.96 | 0 | 0 |
| 12# | 1.92 | 0 | 0 |
| 13# | 0 | 0.15 | 0 |
| 14# | 0 | 0.29 | 0 |
| 15# | 0 | 0.88 | 0 |
| 16# | 0 | 0 | 0.23 |
| 17# | 0 | 0 | 0.47 |
| 18# | 0 | 0 | 0.93 |
Figure 1Data distribution of original data and fitted curve.
Only part of the Cmax data was chosen as original data in order to show it more clearly. The dashed curve was obtained by polynomial regression, whose order was set to two.
Figure 2Multi-target Regression Framework.
Relation among Time-series data is utilized here to improve the precision of prediction.
The Multi-target Regression Algorithm.
| Input: Training dataset Tr(x, y) and test dataset Ts procedure: |
| 1) Train a model L on the training set Tr with the first time node by using regression algorithm and calculated the training error. |
| 2) Add the prediction value of previous target to the feature set, then use regression algorithm to predict the current target and calculate the training error. |
| 3) Perform step 2 iteratively up to and including the last target. Combine all the parameters and run on the test data. |
| Output: Test error E0 |
Comparison among the mean absolute errors of each component, processed with LR, PR, SVR, ANN and PLS.
| MAE | ber | pal | evo | rut | pae |
|---|---|---|---|---|---|
| LR-ST | 61.92 ± 25.78 | 35.90 ± 9.58 | 21.91 ± 9.23 | 12.34 ± 7.39 | 80.94 ± 43.51 |
| LR-MT | 55.54 ± 31.17 | 27.49 ± 14.84 | 10.39 ± 6.04 | 10.21 ± 7.01 | 47.04 ± 30.26 |
| LR-RMT | 51.20 ± 32.10 | 26.86 ± 12.76 | 20.65 ± 7.32 | 13.62 ± 8.24 | 84.09 ± 47.76 |
| PR-ST | 67.42 ± 33.70 | 35.05 ± 8.02 | 18.30 ± 7.43 | 14.89 ± 8.79 | 65.19 ± 33.99 |
| PR-MT | 57.79 ± 35.14 | 27.88 ± 8.99 | 18.18 ± 6.87 | 13.57 ± 7.88 | 53.23 ± 25.74 |
| PR-RMT | 61.78 ± 33.33 | 30.80 ± 8.87 | 19.57 ± 8.38 | 14.98 ± 8.84 | 70.73 ± 39.22 |
| SVR-ST | 62.23 ± 28.11 | 35.90 ± 8.50 | 20.62 ± 8.89 | 11.47 ± 6.46 | 74.47 ± 42.51 |
| SVR-MT | |||||
| SVR-RMT | 62.99 ± 26.37 | 33.11 ± 12.57 | 19.86 ± 8.00 | 12.04 ± 6.73 | 78.71 ± 45.61 |
| ANN-ST | 109.6 ± 46.73 | 59.84 ± 25.89 | 34.27 ± 13.32 | 18.01 ± 9.28 | 106.08 ± 55.64 |
| ANN-MT | 95.26 ± 60.89 | 47.53 ± 13.99 | 23.82 ± 9.55 | 17.86 ± 10.72 | 82.89 ± 60.43 |
| ANN-RMT | 88.68 ± 61.26 | 51.68 ± 22.84 | 30.28 ± 14.45 | 18.15 ± 8.00 | 110.23 ± 51.89 |
| PLS-ST | 64.97 ± 37.20 | 33.32 ± 7.08 | 21.39 ± 8.63 | 18.70 ± 9.47 | 66.24 ± 32.85 |
| PLS-MT | 63.09 ± 31.63 | 43.65 ± 14.13 | 64.38 ± 27.25 | 32.04 ± 21.64 | 73.38 ± 45.61 |
| PLS-RMT | 70.04 ± 32.40 | 33.90 ± 9.77 | 24.18 ± 10.69 | 22.02 ± 14.29 | 74.65 ± 37.64 |
Here ST denotes Single-target regression, MT denotes Multi-Target regression with last target, RMT denotes Multi-Target regression with random target.
All data here means MAE ± SD, SD is standard deviation.
Comparison among the mean absolute percentage errors of each component, processed with LR, PR, SVR, ANN and PLS.
| MAPE | ber | pal | evo | rut | pae |
|---|---|---|---|---|---|
| LR-ST | 0.464 ± 0.128 | 1.073 ± 0.491 | 1.950 ± 0.581 | 0.692 ± 0.144 | 0.878 ± 0.224 |
| LR-MT | 0.375 ± 0.062 | 0.752 ± 0.431 | 0.800 ± 0.446 | 0.491 ± 0.111 | 0.525 ± 0.199 |
| LR-RMT | 0.394 ± 0.156 | 0.895 ± 0.620 | 1.874 ± 0.688 | 0.750 ± 0.177 | 0.926 ± 0.217 |
| PR-ST | 0.511 ± 0.159 | 1.136 ± 0.333 | 1.383 ± 0.391 | 0.677 ± 0.132 | 0.651 ± 0.125 |
| PR-MT | 0.383 ± 0.074 | 0.890 ± 0.374 | 1.422 ± 0.391 | 0.633 ± 0.123 | 0.493 ± 0.146 |
| PR-RMT | 0.436 ± 0.103 | 0.983 ± 0.346 | 1.512 ± 0.407 | 0.694 ± 0.136 | 0.737 ± 0.201 |
| SVR-ST | 0.468 ± 0.172 | 0.947 ± 0.303 | 1.413 ± 0.356 | 0.540 ± 0.131 | 0.711 ± 0.136 |
| SVR-MT | |||||
| SVR-RMT | 0.493 ± 0.203 | 0.889 ± 0.382 | 1.302 ± 0.331 | 0.544 ± 0.139 | 0.820 ± 0.259 |
| ANN-ST | 0.872 ± 0.317 | 1.846 ± 0.746 | 3.620 ± 1.667 | 0.895 ± 0.269 | 1.286 ± 0.391 |
| ANN-MT | 0.654 ± 0.490 | 1.411 ± 0.454 | 2.265 ± 0.893 | 0.957 ± 0.258 | 1.068 ± 0.712 |
| ANN-RMT | 0.668 ± 0.404 | 1.594 ± 0.816 | 2.573 ± 1.193 | 1.063 ± 0.247 | 1.373 ± 0.362 |
| PLS-ST | 0.462 ± 0.137 | 1.035 ± 0.348 | 1.529 ± 0.402 | 0.890 ± 0.288 | 0.701 ± 0.144 |
| PLS-MT | 0.471 ± 0.234 | 1.507 ± 1.136 | 5.595 ± 2.686 | 1.602 ± 0.639 | 0.899 ± 0.767 |
| PLS-RMT | 0.523 ± 0.137 | 1.051 ± 0.361 | 1.769 ± 0.461 | 1.039 ± 0.413 | 0.775 ± 0.239 |
Here ST denotes Single-target regression, MT denotes Multi-Target regression with last target, RMT denotes Multi-Target regression with random target.
All data here means MAPE ± SD.
Comparison among the root mean square errors of each component, processed with LR, PR, SVR, ANN and PLS.
| RMSE | ber | pal | evo | rut | pae |
|---|---|---|---|---|---|
| LR-ST | 66.88 ± 27.08 | 37.46 ± 9.99 | 22.49 ± 9.31 | 13.08 ± 7.48 | 82.05 ± 43.61 |
| LR-MT | 60.92 ± 32.56 | 29.38 ± 14.66 | 11.07 ± 6.07 | 11.17 ± 7.09 | 48.61 ± 30.77 |
| LR-RMT | 28.52 ± 13.55 | 21.43 ± 7.56 | 14.39 ± 8.41 | 85.85 ± 48.09 | |
| PR-ST | 71.83 ± 34.23 | 36.63 ± 8.51 | 18.89 ± 7.58 | 15.58 ± 8.84 | 66.50 ± 34.10 |
| PR-MT | 63.03 ± 37.46 | 29.53 ± 9.23 | 18.78 ± 6.95 | 14.32 ± 7.92 | 54.68 ± 25.97 |
| PR-RMT | 66.71 ± 34.41 | 32.46 ± 9.28 | 20.36 ± 8.66 | 15.75 ± 8.96 | 72.10 ± 39.36 |
| SVR-ST | 67.09 ± 28.61 | 37.37 ± 8.91 | 21.18 ± 8.98 | 12.22 ± 6.62 | 75.79 ± 42.53 |
| SVR-MT | 55.72 ± 30.95 | ||||
| SVR-RMT | 68.00 ± 26.80 | 34.98 ± 12.70 | 20.67 ± 8.31 | 12.81 ± 6.92 | 80.33 ± 45.75 |
| ANN-ST | 113.71 ± 47.65 | 61.09 ± 26.01 | 34.73 ± 13.38 | 18.80 ± 9.44 | 107.3 ± 55.69 |
| ANN-MT | 99.38 ± 62.11 | 49.43 ± 14.31 | 24.48 ± 9.74 | 18.89 ± 10.79 | 84.48 ± 60.39 |
| ANN-RMT | 92.62 ± 63.76 | 53.80 ± 23.29 | 31.19 ± 14.59 | 19.11 ± 8.26 | 113.69 ± 53.18 |
| PLS-ST | 69.61 ± 38.04 | 34.91 ± 7.56 | 22.02 ± 8.77 | 19.31 ± 9.58 | 67.66 ± 32.99 |
| PLS-MT | 68.47 ± 32.59 | 45.02 ± 14.38 | 64.80 ± 27.23 | 32.64 ± 21.59 | 74.72 ± 45.71 |
| PLS-RMT | 75.07 ± 33.80 | 35.62 ± 9.92 | 24.89 ± 10.83 | 22.65 ± 14.39 | 76.24 ± 37.99 |
Here ST denotes Single-target regression, MT denotes Multi-Target regression with last target, RMT denotes Multi-Target regression with random target.
All data here means RMSE ± SD.
Figure 3Mean absolute percentage error of pal.
Pal data is trained with LR, PR, SVR, ANN, PLS, MT, RMT and leave-one-out cross validation is applied to test the performance.
Percentage improved by MT compared with ST in RMSE.
| MT-RMSE-imp | LR | PR | SVR | ANN | PLS |
|---|---|---|---|---|---|
| ber | 8.91 | 12.25 | 16.94 | 12.60 | 1.64 |
| pal | 21.57 | 19.39 | 29.30 | 19.09 | −28.98 |
| evo | 50.76 | 0.56 | 48.86 | 29.53 | −194.3 |
| rut | 14.65 | 8.07 | 11.36 | −0.05 | −69.01 |
| pae | 40.76 | 17.78 | 39.36 | 21.27 | −10.43 |
[a]MT-RMSE-imp = (ST_RMSE-MT_RMSE)/ST_RMSE*100%.
Percentage improved by MT compared with RMT in RMSE.
| RMT-RMSE-imp | LR | PR | SVR | ANN | PLS |
|---|---|---|---|---|---|
| ber | −9.73 | 5.52 | 18.06 | −7.30 | 8.79 |
| pal | −3.02 | 9.03 | 24.47 | 8.12 | −26.39 |
| evo | 48.34 | 7.76 | 47.61 | 21.51 | −160.35 |
| rut | 22.38 | 9.08 | 15.46 | 1.15 | −44.11 |
| pae | 43.38 | 24.16 | 42.79 | 25.69 | 1.99 |
[a]RMT-RMSE-imp = (RMT_RMSE-MT_RMSE)/RMT_RMSE*100%.
Percentage improved by MT compared with ST in MAPE.
| MT-MAPE-imp | LR | PR | SVR | ANN | PLS |
|---|---|---|---|---|---|
| ber | 19.13 | 24.95 | 27.66 | 24.97 | −2.07 |
| pal | 29.89 | 2163 | 39.37 | 23.57 | −45.56 |
| evo | 58.96 | −2.80 | 49.00 | 37.41 | −266.01 |
| rut | 29.11 | 6.50 | 21.10 | −0.69 | −80.01 |
| pae | 40.17 | 24.20 | 33.46 | 16.93 | −28.23 |
[a]MT-MAPE-imp = (ST_MAPE-MT_MAPE)/ST_MAPE*100%.
Percentage improved by MT compared with RMT in MAPE.
| RMT-MAPE-imp | LR | PR | SVR | ANN | PLS |
|---|---|---|---|---|---|
| ber | 4.82 | 12.16 | 31.44 | 2.10 | 9.94 |
| pal | 15.98 | 9.46 | 35.43 | 11.48 | −43.39 |
| evo | 57.31 | 5.95 | 44.62 | 11.97 | −216.28 |
| rut | 34.53 | 8.79 | 21.69 | 9.97 | −54.19 |
| pae | 43.30 | 33.11 | 42.32 | 22.21 | −16.00 |
[a]RMT-MAPE-imp = (RMT_MAPE-MT_MAPE)/RMT_MAPE*100%.
Percentage improved by MT compared with ST in MAE.
| MT-MAE-imp | LR | PR | SVR | ANN | PLS |
|---|---|---|---|---|---|
| ber | 10.31 | 14.30 | 19.07 | 13.08 | 2.90 |
| pal | 23.41 | 20.44 | 31.50 | 20.56 | −31.02 |
| evo | 52.55 | 0.62 | 50.63 | 30.48 | −200.84 |
| rut | 17.21 | 8.91 | 13.27 | 0.83 | −71.31 |
| pae | 41.89 | 18.35 | 40.10 | 21.87 | −10.78 |
[a]MT-MAE-imp = (ST_MAE-MT_MAE)/ST_MAE*100%.
Percentage improved by MT compared with RMT in MAE.
| RMT-MAE-imp | LR | PR | SVR | ANN | PLS |
|---|---|---|---|---|---|
| ber | −8.48 | 6.46 | 20.05 | −7.42 | 9.92 |
| pal | −2.35 | 9.48 | 25.73 | 8.03 | −28.76 |
| evo | 49.69 | 7.10 | 48.74 | 21.33 | −166.25 |
| rut | 25.04 | 9.41 | 17.36 | 1.60 | −45.50 |
| pae | 44.06 | 24.74 | 43.34 | 24.80 | 1.70 |
[a]RMT-MAE-imp = (RMT_MAE-MT_MAE)/RMT_MAE*100%.