| Literature DB >> 35899849 |
Xiaoqing Cheng1,2, Wendong Liu1, Xuefeng Zhang1, Minghao Wang3, Changjun Bao1, Tianxing Wu3.
Abstract
Hepatitis E is an increasingly serious worldwide public health problem that has attracted extensive attention. It is necessary to accurately predict the incidence of hepatitis E to better plan ahead for future medical care. In this study, we developed a Bi-LSTM model that incorporated meteorological factors to predict the prevalence of hepatitis E. The hepatitis E data used in this study are collected from January 2005 to March 2017 by Jiangsu Provincial Center for Disease Control and Prevention. ARIMA, GBDT, SVM, LSTM and Bi-LSTM models are adopted in this study. The data from January 2009 to September 2014 are used as the training set to fit models, and data from October 2014 to March 2017 are used as the testing set to evaluate the predicting accuracy of different models. Selecting models and evaluating the effectiveness of the models are based on mean absolute per cent error (MAPE), root mean square error (RMSE) and mean absolute error (MAE). A total of 44 923 cases of hepatitis E are detected in Jiangsu Province from January 2005 to March 2017. The average monthly incidence rate is 0.35 per 100 000 persons in Jiangsu Province. Incorporating meteorological factors of temperature, water vapour pressure, and rainfall as a combination into the Bi-LSTM Model achieved the state-of-the-art performance in predicting the monthly incidence of hepatitis E, in which RMSE is 0.044, MAPE is 11.88%, and MAE is 0.0377. The Bi-LSTM model with the meteorological factors of temperature, water vapour pressure, and rainfall can fully extract the linear and non-linear information in the hepatitis E incidence data, and has significantly improved the interpretability, learning ability, generalisability and prediction accuracy.Entities:
Keywords: Forecast; hepatitis E; mathematical model
Mesh:
Year: 2022 PMID: 35899849 PMCID: PMC9386790 DOI: 10.1017/S0950268822001303
Source DB: PubMed Journal: Epidemiol Infect ISSN: 0950-2688 Impact factor: 4.434
Fig. 1.The structure of our proposed model.
Fig. 2.The structure of a LSTM cell.
Fig. 3.The incidence of hepatitis E in Jiangsu Province from 01.2005 to 03.2017.
Results of five models for monthly incidence of hepatitis E prediction
| RMSE | MAPE (%) | MAE | |
|---|---|---|---|
| ARIMA | 0.139 | 48.23 | 0.130 |
| SVM | 0.074 | 19.46 | 0.061 |
| GBDT | 0.097 | 23.32 | 0.078 |
| LSTM | 0.060 | 19.27 | 0.052 |
| Bi-LSTM | 0.054 | 15.78 | 0.044 |
Combinations of meteorological factors, ascending by RMSE
| combinations of meteorological factors | RMSE |
|---|---|
| Temperature + Water Vapour Pressure + Rainfall | 0.044 |
| Water Vapour Pressure | 0.049 |
| Temperature + Water Vapour Pressure + Wind Speed | 0.049 |
| Atmosphere + Water Vapour Pressure + Humidity + Rainfall | 0.049 |
| Atmosphere + Water Vapour Pressure | 0.049 |
| Temperature + Water Vapour Pressure | 0.049 |
| Water Vapour Pressure + Rainfall | 0.049 |
| Temperature + Atmosphere + Water Vapour Pressure + Humidity | 0.050 |
| Temperature | 0.050 |
| Water Vapour Pressure + Wind Speed | 0.050 |
| Atmosphere + Humidity | 0.051 |
| Atmosphere + Water Vapour Pressure + Wind Speed + Rainfall | 0.051 |
| Wind Speed + Humidity + Rainfall | 0.051 |
| Atmosphere + Water Vapour Pressure + Rainfall | 0.052 |
| Temperature + Atmosphere | 0.052 |
Results of six models for monthly incidence of hepatitis E prediction
| RMSE | MAPE (%) | MAE | |
|---|---|---|---|
| ARIMA | 0.139 | 48.23 | 0.130 |
| SVM | 0.074 | 19.46 | 0.061 |
| GBDT | 0.097 | 23.32 | 0.078 |
| LSTM | 0.060 | 19.27 | 0.052 |
| Bi-LSTM | 0.054 | 15.78 | 0.044 |
| Bi-LSTM + Best Climate | 0.044 | 11.88 | 0.0377 |
Fig. 4.Plot of observed monthly incidence of hepatitis E and predicted values via different models.
Fig. 5.Predictive Intervals of (a) ARIMA model (b) LSTM model (c) BiLSTM model (d) BiLSTM model with best meteorological factors.