| Literature DB >> 34253768 |
Siawpeng Er1, Shihao Yang2, Tuo Zhao3.
Abstract
The global spread of COVID-19, the disease caused by the novel coronavirus SARS-CoV-2, has casted a significant threat to mankind. As the COVID-19 situation continues to evolve, predicting localized disease severity is crucial for advanced resource allocation. This paper proposes a method named COURAGE (COUnty aggRegation mixup AuGmEntation) to generate a short-term prediction of 2-week-ahead COVID-19 related deaths for each county in the United States, leveraging modern deep learning techniques. Specifically, our method adopts a self-attention model from Natural Language Processing, known as the transformer model, to capture both short-term and long-term dependencies within the time series while enjoying computational efficiency. Our model solely utilizes publicly available information for COVID-19 related confirmed cases, deaths, community mobility trends and demographic information, and can produce state-level predictions as an aggregation of the corresponding county-level predictions. Our numerical experiments demonstrate that our model achieves the state-of-the-art performance among the publicly available benchmark models.Entities:
Mesh:
Year: 2021 PMID: 34253768 PMCID: PMC8275764 DOI: 10.1038/s41598-021-93545-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Prediction of county-level and state-level weekly total number of deaths.
| Training size | Method | County level | State level | ||
|---|---|---|---|---|---|
| Week 1 | Week 2 | Week 1 | Week 2 | ||
| 0.5 (2020-03-07 to 2020-08-22) | Naive | 2.0000 | 2.2765 | 51.5932 | 71.8391 |
| State | – | – | 64.1419 | 81.9421 | |
| County | 1.9205 | 2.2808 | 48.0625 | 67.1583 | |
| Mixup | 1.9227 | 2.2442 | 51.4493 | 63.4738 | |
| COURAGE | 1.8691 | 2.1602 | 47.7390 | 60.8701 | |
| 0.6 (2020-03-07 to 2020-09-24) | Naive | 2.2302 | 2.5698 | 58.7599 | 84.1373 |
| State | – | – | 64.1419 | 81.9421 | |
| County | 2.1805 | 2.5331 | 53.0127 | 73.8226 | |
| Mixup | 2.0932 | 2.3526 | 52.5268 | 70.6528 | |
| COURAGE | 2.0984 | 2.3633 | 51.1680 | 68.7564 | |
| 0.7 (2020-03-07 to 2020-10-28) | Naive | 2.6275 | 3.0295 | 71.8298 | 102.1820 |
| State | – | – | 101.2264 | 123.0914 | |
| County | 2.4912 | 2.8654 | 65.3611 | 85.1719 | |
| Mixup | 2.5054 | 2.8034 | 66.9685 | 83.6469 | |
| COURAGE | 2.4583 | 2.7547 | 64.9884 | 81.9787 | |
| 0.8 (2020-03-07 to 2020-12-01) | Naive | 3.0036 | 3.3499 | 80.5957 | 106.7175 |
| State | – | – | 118.9543 | 147.5468 | |
| County | 2.8240 | 3.1477 | 74.7763 | 98.2125 | |
| Mixup | 2.8258 | 3.0506 | 71.6854 | 84.4188 | |
| COURAGE | 2.7670 | 2.9753 | 70.8481 | 86.3737 | |
The comparison metrics used is MAE, and the testing dataset starts from the 2020-08-23, 2020-09-25, 2020-10-29 and 2020-12-02 to 2021-02-07 for each corresponding training dataset.
Different prediction periods for the weekly total number of deaths.
| Prediction period | Model | Week 1 | Week 2 |
|---|---|---|---|
| 2020-08-23 to 2020-09-24 | Naive | 26.6183 | 28.9819 |
| County | 23.2328 | 25.2783 | |
| Mixup | 25.3481 | 26.5330 | |
| COURAGE | 23.5522 | 25.3955 | |
| 2020-09-25 to 2020-10-28 | Naive | 27.6227 | 41.1483 |
| County | 24.0228 | 31.3968 | |
| Mixup | 27.0248 | 37.3008 | |
| COURAGE | 24.1806 | 32.2430 | |
| 2020-10-29 to 2020-12-01 | Naive | 59.7121 | 95.9124 |
| County | 48.3641 | 62.4764 | |
| Mixup | 51.0948 | 64.5994 | |
| COURAGE | 49.1359 | 61.9947 | |
| 2020-12-02 to 2021-01-17 | Naive | 80.5957 | 106.7175 |
| County | 74.7763 | 98.2125 | |
| Mixup | 71.6854 | 84.4188 | |
| COURAGE | 70.8481 | 86.3737 | |
| 2021-01-18 to 2021-03-14 (use last trained model) | Naive | 84.5587 | 122.9295 |
| County | 75.8700 | 82.0913 | |
| Mixup | 76.3645 | 80.7036 | |
| COURAGE | 75.0621 | 79.7467 |
The prediction metrics reported is MAE.
Figure 1New York’s weekly total number of deaths for Week 1 (left) predictions and Week 2 (right) predictions. Vertical lines separate different prediction periods as in Table 2. The last dashed vertical line marks the prediction period of recent data using our last trained model. “Target” is the true reported number of deaths of New York. More plots for other major states are presented in Supplementary Information.
Comparison among different models for average MAE (from 2020-11-07 to 2021-02-06).
| Model | Week 1 | Week 2 | Average |
|---|---|---|---|
| Karlen[ | 56.2163 | 77.3440 | 66.7801 |
| Ensemble[ | 57.3227 | 79.1330 | 68.2278 |
| 61.1206 | 76.9096 | 69.0151 | |
| Mixup | 61.7890 | 77.1330 | 69.4610 |
| UMass-MB[ | 57.1578 | 84.5514 | 70.8546 |
| Oliver Wyman[ | 58.9645 | 85.6277 | 72.2961 |
| MOBS[ | 60.4486 | 84.9539 | 72.7012 |
| County | 62.7057 | 83.5585 | 73.1321 |
| GT-DeepCOVID[ | 67.1962 | 90.1549 | 78.6756 |
| USC[ | 68.1082 | 93.3954 | 80.7518 |
Figure 2Transformer based model architecture.
Figure 3Overview of prediction flow. The county-level and state-level predictions for weekly total number of deaths are for the next week (Week 1) and the second week (Week 2) from the current week (Week 0) input data.