| Literature DB >> 36164599 |
Abstract
Background: Tuberculosis (TB) remained one of the world's most deadly chronic communicable diseases. Future TB incidence prediction is a benefit for intervention options and resource-allocation planning. We aimed to develop rapid univariate prediction models for epidemics forecasting employment.Entities:
Keywords: SARIMA; SARIMA-ETS; TB incidence
Year: 2022 PMID: 36164599 PMCID: PMC9508881 DOI: 10.7717/peerj.13117
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 3.061
Hyperparameters used in themachine learning models.
| No | Series | Model | Estimated parameter | AICc/Accuracy | Residuals check (Ljung-Box test dat) |
|---|---|---|---|---|---|
| 1 | Training set of TB incidence (150 observations) | ARIMA(1,0,0)(2,1,0) by R, package: forecast, function: auto.arima | Coefficients: | AIC = −335.62, | Residuals from ARIMA(1,0,0)(2,1,0) with drift, Q* = 62.283, df = 20, |
| 2 | Simulation set of TB incidence (192 observations) | ARIMA(3,0,0)(2,1,0) by R, package: forecast, function: auto.arima | Coefficients: | AIC = −425.79 | Residuals from ARIMA(3,0,0)(2,1,0) with drift |
| 3 | Training set of TB incidence (150 observations) | ETS(A,A,A) | Smoothing parameters: alpha = 0.0102, beta = 0.0101, | AIC = −68.44, | Residuals from ETS(A,A,A); Q* = 61.132, df = 8, |
| 4 | Simulation set of TB incidence (192 observations) | ETS(A,A,A), | ETS(A,A,A) Call: ets (y = M) Smoothing parameters: alpha = 0.0738, | AIC = −25.56 | Residuals from ETS(A,A,A); Q* = 51.531, df = 8, |
| 5 | Training set of TB incidence (150 observations) | ARIMA-ETS | Hybrid forecast model comprised of the following models: arima with weight 0.5, ETS with weight 0.5 | RMSE = 0.0585 | Could not find appropriate degrees of freedom for this model |
| 6 | Simulation set of TB incidence (192 observations) | ARIMA-ETS | Hybrid forecast model comprised of the following models: arima with weight 0.5, ETS with weight 0.5 | RMSE = 0.0512 | Could not find appropriate degrees of freedom for this model |
Figure 1Tuberculosis incidence time series analysis of trend and seasonality.
(A) Tuberculosis incidence with a steady declining trend from 2005–2019. (B) Box-plotting to present a seasonality with two peaks occurring in March and May as well as a trough in November to the next February. (C) STL Decomposition of the time series.
Tuberculosis incidence forecasting until 2025 using the SARIMA and SARIMA-ETS-hybrid models.
| Ahead time | Actual-value (cases per 105-month | Models | Prediction (cases per 105-month) | 95% CI of PV | RMSE | MAE | MAPE | MASE |
|---|---|---|---|---|---|---|---|---|
| 3 months | 3.55 | SARIMA(1,0,0)(2,1,0)12 | 3.43 | [2.77–4.14] | 0.086 | 0.058 | 0.575 | 0.753 |
| ETS | 3.49 | [2.93–4.14] | 0.077 | 0.058 | 0.569 | 0.746 | ||
| Hybrid | 3.39 | [2.77–4.13] | 0.081 | 0.056 | 0.554 | 0.726 | ||
| 6 months | 3.3 | SARIMA | 3.34 | [2.76–4.15] | 0.069 | 0.047 | 0.459 | 0.606 |
| ETS | 3.29 | [2.76–3.97] | 0.060 | 0.046 | 0.449 | 0.593 | ||
| Hybrid | 3.4 | [2.79–4.15] | 0.064 | 0.045 | 0.445 | 0.587 | ||
| 12 months | 3.15 | SARIMA | 3.58 | [2.9–4.51] | 0.088 | 0.062 | 0.598 | 0.606 |
| ETS | 3.47 | [2.9–4.14] | 0.079 | 0.061 | 0.586 | 0.785 | ||
| Hybrid | 3.69 | [3.03–4.51] | 0.082 | 0.060 | 0.578 | 0.776 | ||
| 18 months | 3.15 | SARIMA | 3.16 | [2.55–4.0] | 0.090 | 0.069 | 0.668 | 0.898 |
| ETS | 3.09 | [2.55–3.73] | 0.080 | 0.065 | 0.627 | 0.849 | ||
| Hybrid | 3.23 | [2.62–4.0] | 0.084 | 0.066 | 0.639 | 0.859 | ||
| 24 months | 3 | SARIMA | 3.4 | [2.64–4.4] | 0.092 | 0.074 | 0.714 | 0.960 |
| ETS | 3.26 | [2.64–4.02] | 0.079 | 0.065 | 0.627 | 0.812 | ||
| Hybrid | 3.41 | [2.64–4.4] | 0.084 | 0.068 | 0.655 | 0.881 | ||
| 30 months | 3 | SARIMA | 2.89 | [2.38–3.51] | 0.091 | 0.073 | 0.699 | 0.942 |
| ETS | 2.90 | [2.28–3.69] | 0.079 | 0.064 | 0.619 | 0.833 | ||
| Hybrid | 2.98 | [2.28–3.9] | 0.084 | 0.067 | 0.646 | 0.870 |
Notes:
Evaluation of the models’ accuracy of the SARIMA. ETS and SARIMA-ETS hybrid in forecasting underlying trend (pseudo out-of-sample) performance.
Prediction: TB incidence (cases per 100,000-month); Hybrid: SARIMA-ETS-hybrid.
95% CI of PV: 95% confidence interval of predictive value for TB incidence.
Scheme 1Long-term predictions for tuberculosis incidence rates until 2030.
(A) SARIMA model, (B) ETS model, (C) SARIMA-ETS hybrid model.
The reductions in TB incidence predicted by SARIMA-ETS and SARIMA models.
| ARIMA | ETS | ARIMA-ETS-hybrid | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Year (reduction*) | TB incidence: TB cases per 100,000-year (*) | TB incidence: TB cases per 100,000-year (*) | TB incidence: TB cases per 100,000-year (*) | ||||||
| Forecast value | Lo. 99.5% | Hi. 99.5% | Forecast value | Lo. 99.5% | Hi. 99.5% | Forecast value | Lo. 99.5% | Hi. 99.5% | |
| 2021 | 32.380 | 26.317 | 39.839 | 32.471 | 27.016 | 39.028 | 32.495 | 27.589 | 38.150 |
| 2022 | 31.052 | 24.818 | 38.853 | 30.888 | 25.547 | 37.345 | 30.762 | 25.618 | 36.769 |
| 2023 | 29.261 | 23.008 | 37.214 | 29.382 | 24.158 | 35.735 | 29.331 | 23.790 | 36.121 |
| 2024 | 27.955 | 21.298 | 36.695 | 27.949 | 22.844 | 34.195 | 27.875 | 22.102 | 35.054 |
| 2025 | 26.646 (41.69%) | 19.933 (56.38%) | 35.620 (22.06%) | 26.586 (41.82%) | 21.601 (52.73%) | 32.721 (28.40%) | 26.531 (41.95%) | 20.561 (55.01%) | 34.138 (25.30%) |
| 2030 | 20.803 (54.48%) | 14.286 (68.74%) | 30.294 (33.71%) | 20.706 (54.69%) | 16.328 (64.27%) | 26.258 (42.54%) | 20.671 (54.77%) | 14.636 (67.97%) | 29.138 (36.24%) |
Note:
*Reduction = (Final incidence rate−incidence rate in 2015)/incidence rate in 2015. WHO milestone and targets for TB incidence rate reduction in 2020, 2025, 2030 and 2035 by 20%, 50%, 80% compared with the 2015 (Int J Tuberc Lung Dis. 2018 Jul; 22(7): 723–730.).