| Literature DB >> 34764597 |
Zhijin Wang1, Bing Cai1.
Abstract
Predicting the number of COVID-19 cases in a geographical area is important for the management of health resources and decision making. Several methods have been proposed for COVID-19 case predictions but they have important limitations in terms of model interpretability, related to COVID-19's incubation period and major trends of disease transmission. To be able to explain prediction results in terms of incubation period and transmission trends, this paper presents the Multivariate Shapelet Learning (MSL) model to learn shapelets from historical observations in multiple areas. An experimental evaluation was done to compare the prediction performance of eleven algorithms, using the data collected from 50 US provinces/states. Results show that the proposed method is effective and efficient. The learned shapelets explain increasing and decreasing trends of new confirmed cases, and reveal that the COVID-19 incubation period in the USA is around 28 days.Entities:
Keywords: COVID-19; Interpretability; Multivariate; Prediction; Shapelet learning
Year: 2021 PMID: 34764597 PMCID: PMC8102854 DOI: 10.1007/s10489-021-02391-6
Source DB: PubMed Journal: Appl Intell (Dordr) ISSN: 0924-669X Impact factor: 5.086
Symbols and semantics
| Symbol | Semantic |
|---|---|
| area number | |
| time step number | |
| shapelet number | |
| time step number in test set | |
| look-back window size | |
| outpatient cases matrix, | |
| shapelet matrix, | |
| weight matrix of shapelets, | |
| distance matrix, | |
| element of distance matrix | |
| softmin distance matrix, | |
| element of softmin distance matrix | |
| input matrix | |
| output matrix | |
| inputs | |
| outputs |
Fig. 1The schematic illustration of Multivariate Shapelet Learning (MSL)
The basic statistics of COVID-19 cases on provinces/states in US
| Province/State | Cumulative confirmed cases | New confirmed cases | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Jan. 22 | Sep. 17 | Min | Max | Mean | STD | Jan. 22 | Sep. 17 | Min | Max | Mean | STD | |
| Alabama | 0 | 141757 | 0 | 141757 | 36947.39 | 45718.81 | 0 | 670 | 0 | 2399 | 593.13 | 629.13 |
| Alaska | 0 | 6537 | 0 | 6537 | 1370.08 | 1880.15 | 0 | 105 | 0 | 186 | 27.35 | 35.81 |
| Arizona | 0 | 211660 | 0 | 211660 | 64232.76 | 79516.87 | 0 | 1753 | 0 | 4877 | 885.61 | 1179.11 |
| Arkansas | 0 | 73211 | 0 | 73211 | 18151.74 | 22659.75 | 0 | 992 | 0 | 1799 | 308.03 | 327.04 |
| California | 0 | 775037 | 0 | 775037 | 214083.88 | 256530.43 | 0 | 3716 | 0 | 15117 | 3242.83 | 3390.47 |
| Colorado | 0 | 63125 | 0 | 63125 | 23762.43 | 20826.01 | 0 | 459 | 0 | 978 | 264.12 | 196.09 |
| Connecticut | 0 | 55386 | 0 | 55386 | 28919.85 | 21610.45 | 0 | 220 | 0 | 2109 | 231.85 | 338.79 |
| Delaware | 0 | 19318 | 0 | 19318 | 7655.05 | 6635.96 | 0 | 84 | 0 | 458 | 81.01 | 82.31 |
| Florida | 0 | 674456 | 0 | 674456 | 179394.29 | 233791.23 | 0 | 3255 | 0 | 15300 | 2821.99 | 3582.07 |
| Georgia | 0 | 300903 | 0 | 300903 | 82195.76 | 96708.81 | 0 | 1847 | 0 | 4813 | 1259.01 | 1264.64 |
| Hawaii | 0 | 11105 | 0 | 11105 | 1791.38 | 2878.14 | 0 | 159 | 0 | 354 | 46.49 | 79.93 |
| Idaho | 0 | 36489 | 0 | 36489 | 8638.55 | 11587.30 | 0 | 396 | 0 | 729 | 152.68 | 194.31 |
| Illinois | 0 | 270294 | 0 | 270294 | 97540.25 | 86778.57 | 0 | 2056 | 0 | 5594 | 1130.94 | 903.57 |
| Indiana | 0 | 108646 | 0 | 108646 | 34472.90 | 33179.01 | 0 | 837 | 0 | 1660 | 454.59 | 347.82 |
| Iowa | 0 | 77204 | 0 | 77204 | 21994.40 | 23189.08 | 0 | 552 | 0 | 2681 | 323.03 | 323.78 |
| Kansas | 0 | 51164 | 0 | 51164 | 13198.72 | 14992.99 | 0 | 444 | 0 | 1019 | 214.08 | 233.71 |
| Kentucky | 0 | 59370 | 0 | 59370 | 14637.71 | 17108.50 | 0 | 606 | 0 | 1152 | 248.42 | 271.10 |
| Louisiana | 0 | 159304 | 0 | 159304 | 52856.44 | 53069.35 | 0 | 478 | 0 | 3840 | 667.04 | 778.09 |
| Maine | 0 | 4962 | 0 | 4962 | 2014.03 | 1720.29 | 0 | 21 | 0 | 78 | 20.77 | 17.34 |
| Maryland | 0 | 118519 | 0 | 118519 | 44839.28 | 40592.53 | 0 | 631 | 0 | 1784 | 495.91 | 376.92 |
| Massachusetts | 0 | 126128 | 0 | 129182 | 67834.98 | 51331.95 | 0 | 429 | 0 | 4973 | 559.68 | 699.48 |
| Michigan | 0 | 126722 | 0 | 126722 | 51045.24 | 40587.95 | 0 | 980 | 0 | 1991 | 530.22 | 451.32 |
| Minnesota | 0 | 86722 | 0 | 86722 | 26087.94 | 27645.40 | 0 | 909 | 0 | 1154 | 362.85 | 307.00 |
| Mississippi | 0 | 91935 | 0 | 91935 | 25341.64 | 29785.48 | 0 | 701 | 0 | 1775 | 384.67 | 409.11 |
| Missouri | 0 | 109557 | 0 | 109557 | 24191.67 | 30221.85 | 0 | 1704 | 0 | 2197 | 458.40 | 531.13 |
| Montana | 0 | 9647 | 0 | 9647 | 1832.43 | 2657.63 | 0 | 216 | 0 | 221 | 40.37 | 56.03 |
| Nebraska | 0 | 39921 | 0 | 39921 | 12990.62 | 12755.38 | 0 | 502 | 0 | 727 | 167.22 | 151.74 |
| Nevada | 0 | 74595 | 0 | 74595 | 19988.43 | 24952.28 | 0 | 347 | 0 | 1447 | 312.55 | 358.78 |
| New Hampshire | 0 | 7781 | 0 | 7781 | 3507.97 | 2887.02 | 0 | 0 | 0 | 217 | 32.82 | 35.41 |
| New Jersey | 0 | 198361 | 0 | 198361 | 109793.85 | 78471.83 | 0 | 569 | 0 | 4305 | 830.09 | 1123.26 |
| New Mexico | 0 | 27199 | 0 | 27199 | 9169.31 | 9371.14 | 0 | 158 | 0 | 460 | 113.80 | 97.31 |
| New York | 0 | 447262 | 0 | 447262 | 258941.11 | 176566.31 | 0 | 896 | 0 | 11434 | 1871.39 | 2717.55 |
| North Carolina | 0 | 189576 | 0 | 189576 | 52557.93 | 61548.51 | 0 | 1552 | 0 | 2603 | 793.21 | 725.05 |
| North Dakota | 0 | 16723 | 0 | 16723 | 3468.78 | 4219.70 | 0 | 390 | 0 | 467 | 70.12 | 91.57 |
| Ohio | 0 | 141585 | 0 | 141585 | 42668.13 | 44362.01 | 0 | 1067 | 0 | 1733 | 592.41 | 475.22 |
| Oklahoma | 0 | 73318 | 0 | 73318 | 16100.90 | 21232.70 | 0 | 1034 | 0 | 1400 | 306.77 | 358.16 |
| Oregon | 0 | 30060 | 0 | 30060 | 8186.89 | 9562.41 | 0 | 210 | 0 | 430 | 125.77 | 123.57 |
| Pennsylvania | 0 | 152775 | 0 | 152775 | 62616.12 | 51557.84 | 0 | 925 | 0 | 2297 | 639.23 | 487.05 |
| Rhode Island | 0 | 23488 | 0 | 23488 | 10640.30 | 8551.32 | 0 | 130 | 0 | 648 | 98.28 | 122.57 |
| South Carolina | 0 | 135446 | 0 | 135446 | 35195.04 | 44694.39 | 0 | 1324 | 0 | 2454 | 566.76 | 635.48 |
| South Dakota | 0 | 17686 | 0 | 17686 | 4863.82 | 4784.92 | 0 | 395 | 0 | 623 | 74.00 | 86.92 |
| Tennessee | 0 | 178140 | 0 | 178140 | 45448.44 | 55820.45 | 0 | 1053 | 0 | 3314 | 745.36 | 804.80 |
| Texas | 0 | 701350 | 0 | 701350 | 178059.59 | 231505.99 | 0 | 4543 | 0 | 14962 | 2934.52 | 3349.69 |
| Utah | 0 | 60658 | 0 | 60658 | 17292.81 | 19452.15 | 0 | 911 | 0 | 954 | 253.80 | 229.25 |
| Vermont | 0 | 1705 | 0 | 1705 | 840.71 | 590.89 | 0 | 3 | 0 | 72 | 7.14 | 10.12 |
| Virginia | 0 | 137367 | 0 | 137367 | 44273.90 | 44122.85 | 0 | 1098 | 0 | 2015 | 574.76 | 432.00 |
| Washington | 1 | 81198 | 1 | 81198 | 27132.35 | 26221.48 | 1 | 386 | 0 | 1738 | 339.90 | 293.07 |
| West Virginia | 0 | 13434 | 0 | 13434 | 3128.39 | 3716.86 | 0 | 232 | 0 | 351 | 56.23 | 65.37 |
| Wisconsin | 0 | 93819 | 0 | 93819 | 24643.87 | 27612.16 | 0 | 1660 | 0 | 1660 | 392.55 | 376.29 |
| Wyoming | 0 | 4652 | 0 | 4652 | 1269.30 | 1369.22 | 0 | 86 | 0 | 126 | 19.53 | 21.39 |
“STD” denotes standard deviation
Fig. 2The sensitiveness of shapelet size C in terms of RAE, RSE and CORR. The windows size T is fixed at 28. Both the optimal values of RAE and RSE are found when C = 3, and the optimal value of CORR is found at C = 2 aRAE versus C. b RSE versus C. c CORR versus C
Fig. 3The sensitiveness of window size T in terms of RAE, RSE and CORR. The shapelet size C is fixed at 3. The optimal values of these metrics are found at T = 28, see the red dash lines a RAE versus T. b RSE versus T. c CORR versus T
Fig. 4The comparison of twelve methods in terms of RAE, RSE and CORR. The windows size T is fixed at 28. The shapelet size C is set to 3 a RAE comparison. b RSE comparison. c CORR comparison
Fig. 5The visualization of normalized learned shapelets within MSL, given shapelet size C = 3 and window size T = 28 a Shapelet 1. b Shapelet 2. c Shapelet 3