| Literature DB >> 33424432 |
Tao Han1, Francisco Nauber Bernardo Gois2, Ramsés Oliveira2, Luan Rocha Prates2, Magda Moura de Almeida Porto2.
Abstract
The COVID-19 pandemic continues to have a destructive effect on the health and well-being of the global population. A vital step in the battle against it is the successful screening of infected patients, together with one of the effective screening methods being radiology examination using chest radiography. Recognition of epidemic growth patterns across temporal and social factors can improve our capability to create epidemic transmission designs, including the critical job of predicting the estimated intensity of the outbreak morbidity or mortality impact at the end. The study's primary motivation is to be able to estimate with a certain level of accuracy the number of deaths due to COVID-19, managing to model the progression of the pandemic. Predicting the number of possible deaths from COVID-19 can provide governments and decision-makers with indicators for purchasing respirators and pandemic prevention policies. Thus, this work presents itself as an essential contribution to combating the pandemic. Kalman Filter is a widely used method for tracking and navigation and filtering and time series. Designing and tuning machine learning methods are a labor- and time-intensive task that requires extensive experience. The field of automated machine learning Auto Machine Learning relies on automating this task. Auto Machine Learning tools enable novice users to create useful machine learning units, while experts can use them to free up valuable time for other tasks. This paper presents an objective method of forecasting the COVID-19 outbreak using Kalman Filter and Auto Machine Learning. We use a COVID-19 dataset of Ceará, one of the 27 federative units in Brazil. Ceará has more than 235,222 confirmed cases of COVID-19 and 8850 deaths due to the disease. The TPOT automobile model showed the best result with a 0.99 of R 2 score.Entities:
Keywords: AutoML; COVID-19; Forecast; Kalman Filter
Year: 2021 PMID: 33424432 PMCID: PMC7783486 DOI: 10.1007/s00500-020-05503-5
Source DB: PubMed Journal: Soft comput ISSN: 1432-7643 Impact factor: 3.732
Fig. 1COVID-19 cases curve in the state of Ceará
Fig. 2COVID-19 cases curve in Fortaleza, capital of Ceará
Fig. 3Proposed use of Kalman Filter with hybrid database
Method errors to short term experiments
| Method | MAE | RMSE | |
|---|---|---|---|
| KF + SEIR + CE | 216.65 | 245.89 | 0.983 |
| Kalman Filter | 342.83 | 388.52 | 0.959 |
| KF + SEIR | 517.85 | 758.68 | 0.844 |
| H2O | 5.35 | 71.53 | 0.96 |
| TPOT | 1.35 | 11.38 | 0.99 |
Fig. 4Kalman Filter result short term Ceará
Fig. 5Kalman Filter predictions with data for 5, 10, or 15 days
Fig. 6Prediction for the COVID-19 death rate curve in the state of Ceará one month before the curve plateau
H2O AutoML Results for Ceará COVID-19 with H2O AutoML
| Name of model | Mean_ residual_ deviance | rmse | mse | mae | rmsle |
|---|---|---|---|---|---|
| GLM_1_AutoML_ 20200923 _163324 | 71.90 | 8.47 | 71.9087 | 5.61 | 0.52 |
| StackedEnsemble_ BestOfFamily _AutoML_20200923 _163324 | 75.21 | 8.67 | 75.21 | 5.65 | 0.47 |
| StackedEnsemble_ AllModels_ AutoML_20200923 _163324 | 75.701 | 8.70 | 75.70 | 5.67 | 0.47 |
| GBM_3_AutoML_ 20200923_163324 | 91.67 | 9.57 | 91.67 | 5.85 | 0.31 |
| DRF_1_AutoML_ 20200923_163324 | 92.31 | 9.60 | 92.3166 | 5.88 | 0.30 |
| GBM_1_AutoML_ 20200923_163324 | 94.72 | 9.73 | 94.72 | 5.99 | 0.30 |
| GBM_2_AutoML_ 20200923_163324 | 101.57 | 10.07 | 101.57 | 6.17 | 0.32 |
Fig. 7Prediction for the COVID-19 death rate curve in the state of Ceará with H2O.ai
Fig. 8Prediction for the COVID-19 death rate curve in the state of Ceará with TPOT AutoML
TPOT AutoML Results for Ceará COVID-19 deaths data set
| Generation 1—Current best internal CV score: − 4.615450248020459 | |
| Generation 2—Current best internal CV score: − 4.615450248020459 | |
| Generation 3—Current best internal CV score: − 4.615450248020459 | |
| Generation 4—Current best internal CV score: − 4.452279209324271 | |
| Generation 5—Current best internal CV score: − 3.961737356996695 | |
| Best pipeline: KNeighborsRegressor(MaxAbsScaler (PolynomialFeatures(input_matrix, degree = 2, include_bias = False, interaction_only = False)), n_neighbors = 60, p = 2, weights = distance) |
Method errors to long term predictions
| Method | MAE | RMSE | R2 |
|---|---|---|---|
| TPOT | 1.35 | 11.38 | 0.99 |
| KF + SEIR + CE | 216.65 | 245.89 | 0.983 |
| Prophet | 11,825.02 | 16,070.89 | 0.275 |
| Holt Winters | 9158.26 | 21,149.54 | 0.007 |
| SEIR | 564.79 | 723.29 | 0.858 |