| Literature DB >> 35093010 |
Qingyu An1, Jun Wu2, Jun Meng2, Zhijie Zhao2, Jin Jian Bai2, Xiaofeng Li3.
Abstract
BACKGROUND: Acquired immunodeficiency syndrome (AIDS) is a malignant infectious disease with high mortality caused by HIV (human immunodeficiency virus, and up to now there are no curable drugs or effective vaccines. In order to understand AIDS's development trend, we establish hybrid EMD-BPNN (empirical modal decomposition and Back-propagation artificial neural network model) model to forecast new HIV infection in Dalian and to evaluate model's performance.Entities:
Keywords: ARIMA; BPNN; EMD; HIV
Mesh:
Year: 2022 PMID: 35093010 PMCID: PMC8799978 DOI: 10.1186/s12879-022-07061-7
Source DB: PubMed Journal: BMC Infect Dis ISSN: 1471-2334 Impact factor: 3.090
Fig. 1The topological structure of BPNN model
Fig. 2Decomposition results of the original monthly HIV data series in Dalian by EMD
The different node number of hidden layer and the model’s fitting residuals
| The node number of one hidden layer | Fitting residuals (%) |
|---|---|
| 12 | 0.003581 |
| 11 | 0.003717 |
| 10 | 0.00381 |
| 9 | 0.003985 |
| 8 | 0.00363 |
| 7 | 0.003737 |
| 6 | 0.004291 |
| 5 | 0.00458 |
| 4 | 0.006355 |
| 3 | 0.008446 |
| 2 | 0.010625 |
| 1 | 0.019218 |
The number of HIV cases observed during 2018 and predicted values obtained from the EMD-BPNN model
| Month | IMF1 | IMF2 | IMF3 | IMF4 | Residue item | Predicted value | Observed value |
|---|---|---|---|---|---|---|---|
| 1 | − 0.4262 | − 2.6947 | − 2.1377 | − 3.1996 | 38.7255 | 30.2673 | 28 |
| 2 | − 1.2183 | − 1.6887 | − 1.0954 | − 3.1254 | 39.1768 | 32.049 | 30 |
| 3 | 5.6477 | 3.4268 | 0.1188 | − 3.193 | 39.5275 | 45.5278 | 46 |
| 4 | − 9.2236 | 4.842 | 1.3709 | − 3.3332 | 39.9215 | 33.5776 | 50 |
| 5 | 9.1589 | 2.0379 | 2.3587 | − 3.4737 | 40.2396 | 50.3214 | 66 |
| 6 | − 5.1057 | − 1.0484 | 2.5335 | − 3.5432 | 40.5752 | 33.4114 | 52 |
| 7 | 0.0473 | − 0.8585 | 2.2695 | − 3.5595 | 40.9097 | 38.8085 | 45 |
| 8 | 2.6705 | 0.1333 | 1.4116 | − 3.5628 | 41.104 | 41.7566 | 41 |
| 9 | 2.1908 | 0.5143 | 0.0681 | − 3.5633 | 41.2913 | 40.5012 | 38 |
| 10 | − 9.0812 | 0.9662 | − 1.0834 | − 3.5633 | 41.4653 | 28.7036 | 37 |
| 11 | 7.1544 | 0.6492 | − 2.2042 | − 3.5633 | 41.5809 | 43.617 | 25 |
| 12 | 1.4365 | − 0.7369 | − 2.798 | − 3.5633 | 41.6835 | 36.0218 | 35 |
| Total | 454.563 | 493 | |||||
| Absolute percentage error (%) | 7.797 |
The number of HIV cases observed during 2018 and predicted values obtained from the BPNN model
| Month | Predicted value | Observed value |
|---|---|---|
| 1 | 42.2638 | 28 |
| 2 | 48.471 | 30 |
| 3 | 37.5714 | 46 |
| 4 | 42.3757 | 50 |
| 5 | 56.0465 | 66 |
| 6 | 47.1118 | 52 |
| 7 | 52.4077 | 45 |
| 8 | 43.8192 | 41 |
| 9 | 48.7503 | 38 |
| 10 | 49.2205 | 37 |
| 11 | 30.0177 | 25 |
| 12 | 48.1137 | 35 |
| Total | 546.1693 | 493 |
| Absolute percentage error (%) | 10.785 |
Fig. 3The sequence of HIV incidence from 2004 to 2017
Residual diagnostics for different SARIMA models
| Model | Standard error (SE) | Log-likelihood | (AIC) | Schwarz Bayesian criterion (BIC) |
|---|---|---|---|---|
| ARIMA (1, 1, 0) (0, 1, 0) 12 | 0.612 | − 133.790 | 271.579 | 277.505 |
| ARIMA (1, 1, 0) (0, 1, 1) 12 | 0.485 | − 106.333 | 218.666 | 227.555 |
| ARIMA (1, 1, 0) (0, 1, 2) 12 | 0.462 | − 105.583 | 219.166 | 231.017 |
| ARIMA (1, 1, 1) (0, 1, 0) 12 | 0.532 | − 113.014 | 232.028 | 240.916 |
| ARIMA (1, 1, 1) (0, 1, 1) 12 | 0.402 | − 86.042 | 180.085 | 191.936 |
| ARIMA (1, 1, 1) (0, 1, 2) 12 | 0.424 | − 86.563 | 183.125 | 197.939 |
| ARIMA (1, 1, 2) (0, 1, 0) 12 | 0.521 | − 109.498 | 226.997 | 238.848 |
| ARIMA (1, 1, 2) (0, 1, 1) 12 | 0.401 | − 83.908 | 177.816 | 192.630 |
| ARIMA (1, 1, 2) (0, 1, 2) 12 | 0.403 | − 83.842 | 179.685 | 197.462 |
Autocorrelations analysis results of residuals for SARIMA (1,1,2)( 0,1,1)12 model
| LAG | Autocorrelation | Standard error | Box-Ljung Statistic | ||
|---|---|---|---|---|---|
| Value | Degrees of freedom | P value | |||
| 1 | − 0.043 | 0.081 | 0.276 | 1.000 | 0.599 |
| 2 | − 0.034 | 0.081 | 0.454 | 2.000 | 0.797 |
| 3 | − 0.127 | 0.081 | 2.927 | 3.000 | 0.403 |
| 4 | 0.002 | 0.080 | 2.927 | 4.000 | 0.570 |
| 5 | 0.013 | 0.081 | 2.953 | 5.000 | 0.707 |
| 6 | 0.051 | 0.080 | 3.360 | 6.000 | 0.762 |
| 7 | − 0.099 | 0.080 | 4.902 | 7.000 | 0.672 |
| 8 | 0.106 | 0.080 | 6.655 | 8.000 | 0.574 |
| 9 | 0.031 | 0.080 | 6.803 | 9.000 | 0.658 |
| 10 | − 0.052 | 0.080 | 7.236 | 10.000 | 0.703 |
| 11 | 0.097 | 0.079 | 8.733 | 11.000 | 0.647 |
| 12 | 0.081 | 0.079 | 9.778 | 12.000 | 0.635 |
| 13 | − 0.013 | 0.079 | 9.805 | 13.000 | 0.710 |
| 14 | − 0.041 | 0.079 | 10.077 | 14.000 | 0.757 |
| 15 | 0.074 | 0.078 | 10.974 | 15.000 | 0.754 |
| 16 | − 0.062 | 0.078 | 11.608 | 16.000 | 0.771 |
Fig. 4Autocorrelation function (ACF) of residuals for SARIMA (1,1,2)( 0,1,1)12 model
Fig. 5Partial autocorrelation functions of (PACF) of residuals for SARIMA ((1,1,2) (0,1,1)12
Observed number of HIV cases in 2018 and predicted values obtained from SARIMA (1,1,2)(0,1,1)12 model
| Month | Observed value | Predicted value | Predicted value95%CI |
|---|---|---|---|
| 1 | 28 | 33.128 | 14.607 ~ 75.130 |
| 2 | 30 | 24.133 | 10.639 ~ 54.742 |
| 3 | 46 | 45.609 | 19.774 ~ 105.198 |
| 4 | 50 | 43.043 | 18.563 ~ 99.809 |
| 5 | 66 | 39.427 | 16.782 ~ 92.624 |
| 6 | 52 | 43.186 | 18.297 ~ 101.933 |
| 7 | 45 | 35.672 | 14.906 ~ 85.371 |
| 8 | 41 | 35.021 | 14.572 ~ 84.166 |
| 9 | 38 | 37.461 | 15.431 ~ 90.938 |
| 10 | 37 | 28.312 | 11.602 ~ 69.090 |
| 11 | 25 | 37.745 | 15.270 ~ 93.297 |
| 12 | 35 | 43.519 | 17.519 ~ 108.106 |
| Total | 493 | 446.256 | |
| Absolute percentage error (%) | 9.482 |
Fig. 6The sequence of HIV incidence rate from 1999 to 2016
Fig. 7Autocorrelation function (ACF) for HIV incidence rate series with differenced once at the non-seasonal level
Fig. 8Partial autocorrelation functions of (PACF) for HIV incidence rate series with differenced once at the non-seasonal level
Autocorrelations analysis results of residuals for ARIMA (3,1,0) model
| LAG | Autocorrelation | Standard error | Box-Ljung Statistic | ||
|---|---|---|---|---|---|
| Value | Degrees of freedom | P value | |||
| 1 | 0.135 | 0.217 | 0.384 | 1.000 | 0.536 |
| 2 | 0.084 | 0.211 | 0.543 | 2.000 | 0.762 |
| 3 | − 0.199 | 0.204 | 1.489 | 3.000 | 0.685 |
| 4 | 0.015 | 0.197 | 1.494 | 4.000 | 0.828 |
| 5 | − 0.180 | 0.190 | 2.395 | 5.000 | 0.792 |
| 6 | − 0.041 | 0.183 | 2.446 | 6.000 | 0.875 |
| 7 | − 0.147 | 0.175 | 3.150 | 7.000 | 0.871 |
| 8 | − 0.129 | 0.167 | 3.749 | 8.000 | 0.879 |
| 9 | 0.030 | 0.158 | 3.785 | 9.000 | 0.925 |
| 10 | − 0.009 | 0.149 | 3.788 | 10.000 | 0.956 |
| 11 | − 0.110 | 0.139 | 4.415 | 11.000 | 0.956 |
| 12 | − 0.114 | 0.129 | 5.193 | 12.000 | 0.951 |
| 13 | 0.039 | 0.118 | 5.302 | 13.000 | 0.968 |
| 14 | − 0.009 | 0.105 | 5.309 | 14.000 | 0.981 |
| 15 | 0.067 | 0.091 | 5.848 | 15.000 | 0.982 |
| 16 | 0.054 | 0.075 | 6.365 | 16.000 | 0.984 |