| Literature DB >> 32565882 |
Zlatan Car1, Sandi Baressi Šegota1, Nikola Anđelić1, Ivan Lorencin1, Vedran Mrzljak1.
Abstract
Coronavirus (COVID-19) is a highly infectious disease that has captured the attention of the worldwide public. Modeling of such diseases can be extremely important in the prediction of their impact. While classic, statistical, modeling can provide satisfactory models, it can also fail to comprehend the intricacies contained within the data. In this paper, authors use a publicly available dataset, containing information on infected, recovered, and deceased patients in 406 locations over 51 days (22nd January 2020 to 12th March 2020). This dataset, intended to be a time-series dataset, is transformed into a regression dataset and used in training a multilayer perceptron (MLP) artificial neural network (ANN). The aim of training is to achieve a worldwide model of the maximal number of patients across all locations in each time unit. Hyperparameters of the MLP are varied using a grid search algorithm, with a total of 5376 hyperparameter combinations. Using those combinations, a total of 48384 ANNs are trained (16128 for each patient group-deceased, recovered, and infected), and each model is evaluated using the coefficient of determination (R2). Cross-validation is performed using K-fold algorithm with 5-folds. Best models achieved consists of 4 hidden layers with 4 neurons in each of those layers, and use a ReLU activation function, with R2 scores of 0.98599 for confirmed, 0.99429 for deceased, and 0.97941 for recovered patient models. When cross-validation is performed, these scores drop to 0.94 for confirmed, 0.781 for recovered, and 0.986 for deceased patient models, showing high robustness of the deceased patient model, good robustness for confirmed, and low robustness for recovered patient model.Entities:
Mesh:
Year: 2020 PMID: 32565882 PMCID: PMC7260624 DOI: 10.1155/2020/5714714
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Figure 1The process of AI modeling is shown. First, the data is collected and placed into a dataset. Part of that data is used for training and testing the various MLP hyperparameter combinations, in an attempt to find the best possible architecture. The most successful model can then be used to determine the future instances.
Figure 2Overview of geographical distributions for number of patients infected with COVID-19 at 15th of February 2020 (a), 1st of March 2020 (b), and 12th of March 2020 (c).
Hyperparameters used in training. First column lists the hyperparameter name, while the possible values of the hyperparameter are listed in the second column. The last column presents the number of hyperparameters, with the last row showing the total number of hyperparameter combinations, obtained and used during the grid search algorithm execution.
| Hyperparameter | Possible values | Count |
|---|---|---|
| Solver | Adam, LBFGS | 2 |
| Initial learning rate | 0.00001, 0.01, 0.1, 0.5 | 4 |
| Learning rate adjustment | Constant, adaptive, invscaling | 3 |
| Hidden layer sizes | (3), (6), (4, 4), (3, 3, 3), (6, 6, 6), (4, 3, 4), (12, 12, 12), (4, 4, 3, 3), (4, 4, 4, 4), (6, 6, 6, 6), (10, 5, 5, 10), (3, 3, 3, 3, 3), (10, 10, 10, 10, 10), (12, 12, 6, 6, 3, 3) | 14 |
| Activation functions | ReLU, identity, logistic, tanh | 4 |
| Regularization parameter | 0.00001, 0.001, 0.01, 0.1 | 4 |
| Total number of hyperparameter combinations | 5376 | |
Hyperparameters of MLPs for best models achieved. Each column is one of the models—predicting the number of infected, recovered, and deceased patients. Hyperparameters that resulted in the best model are shown in rows.
| Hyperparameter | Infected model | Recovered model | Deceased model |
|---|---|---|---|
| Solver | LBFGS | LBFGS | LBFGS |
| Initial learning rate | 0.1 | 0.5 | 0.01 |
| Learning rate adjustment | Constant | Constant | Adaptive |
| Hidden layer tuple | (4, 4, 4, 4) | (4, 4, 4, 4) | (4, 4, 4, 4) |
| Activation function | ReLU | ReLU | ReLU |
| L2 regularization parameter | 0.0001 | 0.001 | 0.01 |
Figure 3Comparison of real and modeled data. Comparison of the number of cases for each input into the dataset are shown for infected (a), deceased (c), and recovered (e) patients, while the trend of the data and model through the days analyzed are shown for the infected (b), deceased (d), and recovered (f) patients.
The results of k-fold cross-validation, (k = 5). Average scores for each goal and the standard deviation are shown.
| Goal | Average |
|
|---|---|---|
| Confirmed | 0.94 | 0.037 |
| Recovered | 0.781 | 0.072 |
| Deceased | 0.986 | 0.021 |
Training times in minutes for each goal, using 5-fold cross-validation and grid search of 5376 items. Training time measured using 16 48-thread HPC nodes. Average training time across all goals is shown in the bottom.
| Goal | Training time (min) |
|---|---|
| Confirmed | 2428 |
| Recovered | 2436 |
| Deceased | 2209 |
| Average | 2357.67 |