| Literature DB >> 36249903 |
Abstract
The last two years have been marked by the emergence of Coronavirus. The pandemic has spread in most countries, causing substantial changes all over the world. Many studies sought to analyze phenomena related to the pandemic from different perspectives. This study analyzes data from the governorates of the Kingdom of Saudi Arabia (the KSA), proposing a broad analysis that addresses three different research objectives. The first is to identify the main factors affecting the variations between KSA governorates in the cumulative number of COVID-19 infections. The study uses principal component regression. Results highlight the significant positive effects of the number of schools in each governorate, and classroom density within each school on the number of infections in the KSA. The second aim of this study is to use the number of COVID-19 infections, in addition to its significant predictors, to classify KSA governorates using the K-mean cluster method. Findings show that all KSA governorates can be grouped into two clusters. The first cluster includes 31 governorates that can be considered at greater risk of Covid infections as they have higher values in all the significant determinants of Covid infections. The last objective is to compare between traditional statistical methods and artificial intelligence techniques in predicting the future number of COVID-19 infections, with the aim of determining the method that provides the highest accuracy. Results also show that multilayer perceptron neural network outperforms others in forecasting the future number of COVID-19. Finally, the future number of infections for each cluster is predicted using multilayer perceptron neural network method.Entities:
Keywords: Health humanities; Medical humanities
Year: 2022 PMID: 36249903 PMCID: PMC9540145 DOI: 10.1057/s41599-022-01208-2
Source DB: PubMed Journal: Humanit Soc Sci Commun ISSN: 2662-9992
Fig. 1KSA regions classified by number of COVID-19 infections.
Descriptive statistics of the study variables.
| Variable | Scale | Code/Label | Mean | Standard deviation | Minimum | Maximum |
|---|---|---|---|---|---|---|
| Population size | Governorate | POPULATION | 196,364 | 572,128 | 6798 | 5,236,901 |
| Number of houses | Governorate | HOUSES | 255,471 | 263,336 | 3019 | 909,228 |
| Number of undergraduate male schools | Governorate | MSCHOOLS | 84 | 106 | 8 | 859 |
| Number of undergraduate female schools | Governorate | FSCHOOLS | 87 | 110 | 3 | 874 |
| Average of classroom intensity in male schools | Governorate | MAV INTENSITY | 25.23 | 6.68 | 12.73 | 42.73 |
| Average of classroom intensity in female schools | Governorate | FAV INTENSITY | 24.20 | 7.19 | 10.10 | 43.47 |
| Number of employees | region | EMPLOYEES | 744,361 | 1,110,039 | 64,847 | 3,887,768 |
| Percentage of people 65+ years old | region | OLD | 7.06 | 5.99 | 1.30 | 24.50 |
| Average of monthly salary for employees | region | AV SALARY | 14,849 | 2967 | 12,421 | 21,189 |
| Percentage of people covered by health insurance | region | INSURANCE | 27.61 | 10.50 | 16.50 | 49.24 |
| Number of hospitals | region | HOSPITALS | 3 | 7 | 1 | 54 |
| Number of beds | region | BEDS | 511 | 1479 | 50 | 14,110 |
| Number of nurses | region | NURSES | 935 | 3087 | 25 | 30,719 |
| Number of doctors | region | DOCTORS | 384 | 1283 | 18 | 12,386 |
| Number of healthy centers | region | CENTERS | 149 | 110 | 43 | 415 |
Means of the study variables in KSA regions.
| • | |
| • *OLD, INSURANCE and AV SALARY in each region are estimated by the average of all governorates in that region. |
Estimated principal component regression coefficients of the study regression model.
| Variable | Estimated coefficient | Standardized coefficient | ||
|---|---|---|---|---|
| 0.16*** | 0.13 | 9.12 | <0.00001 | |
| 0.19*** | 0.15 | 2.45 | 0.02 | |
| 0.19*** | 0.12 | 3.68 | 0.0009 | |
| 0.18*** | 0.12 | 3.40 | 0.0002 | |
| 0.65*** | 0.13 | 2.43 | 0.02 | |
| 0.48*** | 0.10 | 2.84 | 0.008 | |
| −0.02 | −0.02 | −0.57 | 0.58 | |
| 0.01 | 0.02 | 0.80 | 0.43 | |
| −0.58*** | −0.14 | −2.47 | 0.019 | |
| 0.14*** | 0.09 | 2.86 | 0.007 | |
| 0.11*** | 0.10 | 4.11 | 0.0003 | |
| 0.09*** | 0.10 | 4.81 | 0.00004 | |
| 0.10*** | 0.10 | 5.11 | 0.00002 |
***Significant at 99% confidence interval.
Fig. 2Scree diagram chart for defining number of clusters using elbow rule.
Fig. 3Silhouette analysis for optimal number of clusters.
Fig. 4Number and percentage of KSA governorates in each cluster.
Fig. 5Means of the cluster variables for each cluster.
Cluster centers for K-mean cluster.
| Variable | Cluster | |
|---|---|---|
| 1 | 2 | |
| 1.31 | −0.57 | |
| 1.53 | 0.51 | |
| 0.13 | −0.04 | |
| 1.06 | −0.32 | |
| 1.52 | −0.45 | |
| 1.52 | −0.45 | |
| 1.54 | −0.46 | |
| 1.29 | −0.39 | |
| 1.30 | −0.39 | |
| 0.84 | −0.39 | |
| 0.88 | −0.39 | |
| −0.04 | 0.01 | |
Estimated parameters for the forecasted models.
| Model | SARIMA | SES | MLP | LSTM |
|---|---|---|---|---|
| Cluster one | ARIMA(0,1,6)(0,0,1) | Winters’ Additive: alpha: | Type of training: Optimization algorithm: Max Epoch: Number of hidden layers: number of units: Activation function: | Activation function: Learn Rate Schedule: Learn rate: Gradient Threshold: Hidden layer : |
| Cluster two | ARIMA(3,1,0)(1,0,1) | Holt: alpha: |
Forecasting errors measures for the four compared methodsis.
| ● Arrows |
Fig. 6Forecasting the daily number of Covid infections in KSA governorates clusters.
a Observed and expected number of daily Covid infections for cluster one. b Observed and expected number of daily Covid infections for cluster two.