| Literature DB >> 25123979 |
Michael J Kane1, Natalie Price, Matthew Scotch, Peter Rabinowitz.
Abstract
BACKGROUND: Time series models can play an important role in disease prediction. Incidence data can be used to predict the future occurrence of disease events. Developments in modeling approaches provide an opportunity to compare different time series models for predictive power.Entities:
Mesh:
Year: 2014 PMID: 25123979 PMCID: PMC4152592 DOI: 10.1186/1471-2105-15-276
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Summary of the retrospective ARIMA model
| Autoregressive 1 | Moving average 1 | Moving average 2 | Intercept | |
|---|---|---|---|---|
| Coefficient estimate | 0.8367 | -0.2841 | -0.2469 | 6.2052 |
| Standard error | 0.0700 | 0.0915 | 0.0647 | 1.5203 |
| p-value | 0.0014 | 0.0054 | 0.0044 | 0.0041 |
Figure 1The retrospective predictions. The retrospective predictions for the ARIMA model and the Random Forest model along with the actual outbreak counts.
Figure 2The retrospective residuals. The retrospective residuals for the ARIMA and Random Forest model.
Retrospective random forest variable importance
| Variable name | Percent increase in MSE |
|---|---|
| Outbreak Lag 1 | 0.9132981 |
| Outbreak lag 2 | 3.753933 |
| Outbreak lag 3 | 10.2857484 |
| Temp lag 1 | 3.6919581 |
| Temp Lag 2 | 3.5478696 |
| Temp lag 3 | 4.3230635 |
| Humidity lag 1 | 3.6816384 |
| Humidity lag 2 | 6.4406314 |
| Humidity lag 3 | 3.5846449 |
Figure 3The simulated prospective predictions. The simulated prospective predictions for the ARIMA model and the Random Forest model along with the actual outbreak counts.
Figure 4The simulated prospective residuals. The simulated prospective residuals for the ARIMA and Random Forest model.
Comparing the MSE of the models
| Retrospective | Prospective | |
|---|---|---|
| ARIMA | 26.9597 | 28.7412 |
| Random Forest | 6.3195 | 24.8101 |
The confusion matrix under the null
| Predicted | |||
|---|---|---|---|
| Up | Down | ||
|
| Up | 0.3685 | 0.2222 |
| Down | 0.2553 | 0.154 | |
Simulated prospective Random Forest confusion matrix
| Predicted | |||
|---|---|---|---|
| Up | Down | ||
|
| Up | 0.5083 | 0.0825 |
| Down | 0.1155 | 0.2937 | |
Figure 5The actual and predicted changes for the Random Forest model. The actual and predicted changes in outbreak for each day of the simulated prospective study using the Random Forest model.
Figure 6Quantile plot of the residuals for the simulated prospective analysis using Random Forests. The normal quantile plot of the differences between the predicted changes and the actual changes using the Random Forest model.