| Literature DB >> 35937210 |
Tianyu Feng1, Zhou Zheng1, Jiaying Xu1, Minghui Liu1, Ming Li1, Huanhuan Jia1, Xihe Yu1.
Abstract
Objective: This cross-sectional research aims to develop reliable predictive short-term prediction models to predict the number of RTIs in Northeast China through comparative studies. Methodology: Seasonal auto-regressive integrated moving average (SARIMA), Long Short-Term Memory (LSTM), and Facebook Prophet (Prophet) models were used for time series prediction of the number of RTIs inpatients. The three models were trained using data from 2015 to 2019, and their prediction accuracy was compared using data from 2020 as a test set. The parameters of the SARIMA model were determined using the autocorrelation function (ACF) and the partial autocorrelation function (PACF). The LSTM uses linear as the activation function, the mean square error (MSE) as the loss function and the Adam optimizer to construct the model, while the Prophet model is built on the Python platform. The root mean squared error (RMSE), mean absolute error (MAE) and Mean Absolute Percentage Error (MAPE) are used to measure the predictive performance of the model. Findings: In this research, the LSTM model had the highest prediction accuracy, followed by the Prophet model, and the SARIMA model had the lowest prediction accuracy. The trend in medical expenditure of RTIs inpatients overlapped highly with the number of RTIs inpatients.Entities:
Keywords: comparative study; machine learning; predictive models; road traffic injuries; time series analysis
Mesh:
Year: 2022 PMID: 35937210 PMCID: PMC9354624 DOI: 10.3389/fpubh.2022.946563
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
Figure 1Architecture of the RTIs predictive model.
Figure 2The LSTM cell consists of an input gate, an output gate and an oblivion gate. A and B are activation functions.
Figure 3Number and healthcare expenditure of RTIs inpatients for the period 2015–2020.
The healthcare expenditure arising from RTIs in Jilin Province from 2015 to 2020.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| Maximum | 935,866.8 | 847,102.3 | 868,335.4 | 916,696.8 | 566,297.9 | 912,172 |
| Minimum | 33.27 | 50.5 | 49.34 | 19 | 7.5 | 6 |
| Mean | 29,908.97 | 50,232.74 | 42,422.14 | 36,734.02 | 34,335.39 | 43,454.7 |
| Q1 | 4,858.538 | 7,075.955 | 6,847.453 | 5,856.933 | 5,978.21 | 7,955.33 |
| Median | 13,152 | 27,712.31 | 22,004.19 | 18,998.58 | 18,013.31 | 19,147.01 |
| Q3 | 39,305.65 | 64,587.82 | 52,659.97 | 40,473.39 | 49,811.57 | 57,307.12 |
Figure 4ACF and PACF images of the SARIMA (1,1,0), (2,1,3)12 model.
Figure 5Residual analysis of the SARIMA (1,1,0),(2,1,3)12 model.
Prophet and LSTM parameters and their values.
|
|
|
|
|---|---|---|
| LSTM | Layers | 3 |
| No. of neurons | {16,32,64} | |
| Learning rate | 0.01 | |
| Dropout | 0.3 | |
| Optimizer | Adam | |
| Batch size | 3 | |
| Maximum Epochs | 1,000 | |
| Activation Function | Linear | |
| Prophet | Growth | Logistic |
| Changepoint Range | 0.8 | |
| Holidays | CN | |
| Changepoint Prior Scale | 0.05 |
Figure 6Loss function for LSTM models.
LSTM, Prophet, and SARIMA prediction effect evaluation parameters.
|
|
|
| |
|---|---|---|---|
|
| |||
| RMSE | 46.12 | 143.58 | 208.95 |
| MAE | 39.93 | 121.2717 | 191.48 |
| MAPE | 20.26% | 71.04% | 92.82% |
Actual values vs. predicted values from the three models.
|
|
|
|
|
|
|---|---|---|---|---|
| January | 259 | 225.29 | 379.84 | 408.59 |
| February | 64 | 148.88 | 349.81 | 361.90 |
| March | 276 | 223.70 | 388.75 | 363.91 |
| April | 287 | 267.50 | 407.86 | 528.79 |
| May | 268 | 299.20 | 486.73 | 602.42 |
| June | 332 | 379.61 | 503.07 | 601.83 |
| July | 487 | 448.03 | 562.20 | 663.29 |
| August | 551 | 504.08 | 560.51 | 668.26 |
| September | 501 | 510.49 | 564.63 | 658.03 |
| October | 571 | 490.83 | 543.43 | 621.43 |
| November | 377 | 359.91 | 451.45 | 541.87 |
| December | 197 | 214.41 | 371.83 | 447.52 |
Figure 7Visualizations of predicted and actual values for SARIMA, Prophet, and LSTM models.