| Literature DB >> 36105847 |
Aman Ullah1, Tao Wang1,2, Weixin Yao3.
Abstract
In this paper, under the stationary α-mixing dependent samples, we develop a novel nonlinear modal regression for time series sequences and establish the consistency and asymptotic property of the proposed nonlinear modal estimator with a shrinking bandwidth h under certain regularity conditions. The asymptotic distribution is shown to be identical to the one derived from the independent observations, whereas the convergence rate ( n h 3 in which n is the sample size) is slower than that in the nonlinear mean regression. We numerically estimate the proposed nonlinear modal regression model by the use of a modified modal expectation-maximization (MEM) algorithm in conjunction with Taylor expansion. Monte Carlo simulations are presented to demonstrate the good finite sample (prediction) performance of the newly proposed model. We also construct a specified nonlinear modal regression to match the available daily new cases and new deaths data of the COVID-19 outbreak at the state/region level in the United States, and provide forward predictions up to 130 days ahead (from 24 August 2020 to 31 December 2020). In comparison to the traditional nonlinear regressions, the suggested model can fit the COVID-19 data better and produce more precise predictions. The prediction results indicate that there are systematic differences in spreading distributions among states/regions. For most western and eastern states, they have many serious COVID-19 burdens compared to Midwest. We hope that the built nonlinear modal regression can help policymakers to implement fast actions to curb the spread of the infection, avoid overburdening the health system and understand the development of COVID-19 from some points.Entities:
Keywords: COVID‐19; MEM algorithm; dependent data; modal regression; nonlinear; prediction
Year: 2022 PMID: 36105847 PMCID: PMC9461089 DOI: 10.1111/rssa.12849
Source DB: PubMed Journal: J R Stat Soc Ser A Stat Soc ISSN: 0964-1998 Impact factor: 2.175
FIGURE 1Visualization of the total number of cases and deaths in the world‐23 August 2020; data source: Tencent News https://new.qq.com/ch/antip/
FIGURE 2Visualization of the total number of cases and deaths in the US‐23 August 2020; data source: the GitHub repository managed by The New York Times https://github.com/nytimes/covid‐19‐data
Results of simulations—DGP 1
| Modal estimation | Mean estimation | |||||||
|---|---|---|---|---|---|---|---|---|
| Sample size |
| MSE( |
| MSE( |
| MSE( |
| MSE( |
|
| 1.9329 (0.2288) | 0.0566 | 2.0078 (0.0574) | 0.0033 | 1.0073 (0.2371) | 0.0560 | 1.9974 (0.0402) | 0.0016 |
|
| 1.9604 (0.0924) | 0.0101 | 1.9995 (0.0313) | 0.0010 | 1.0000 (0.1816) | 0.0328 | 2.0008 (0.0241) | 0.0006 |
|
| 1.9620 (0.0817) | 0.0081 | 2.0003 (0.0237) | 0.0006 | 0.9956 (0.1440) | 0.0207 | 1.9999 (0.0193) | 0.0004 |
|
| 1.9620 (0.0603) | 0.0051 | 1.9985 (0.0178) | 0.0003 | 0.9886 (0.1043) | 0.0110 | 1.9990 (0.0153) | 0.0002 |
| True value |
|
|
|
| ||||
FIGURE 3Empirical density of the standardized estimate
FIGURE 4Boxplots of average of coverage probabilities: the numbers 2, 4, 6 and 10 represent the values of n = 200, 400, 600 and 1000 respectively
Model comparison results
| New cases‐mode | New deaths‐mode | New cases‐mean | New deaths‐mean | New cases‐median | New deaths‐median | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| State/region |
|
|
|
|
|
|
|
|
|
|
|
|
| AL | 0.1712 | 4.5563 |
|
| 0.1734 | 5.0252 | 0.6953 | 36.3657 |
|
| 0.9213 | 50.8288 |
| AK |
|
|
|
| 0.7199 | 17.8641 | 0.1198 | 79.8023 | 2.3017 | 33.9916 | 0.1682 | 89.3547 |
| AZ |
|
|
|
| 0.9339 | 12.2357 | 0.8226 | 24.8236 | 1.5375 | 17.1057 | 3.2440 | 32.2442 |
| AR | 0.1280 | 4.4861 |
|
| 0.1595 | 4.9279 | 0.6668 | 28.4478 |
|
| 0.9910 | 34.4291 |
| CA |
|
|
| 13.5211 | 0.2576 | 4.6241 | 0.5313 |
| 12.0594 | 36.3353 | 0.6386 | 13.9525 |
| CO |
|
|
|
| 0.1576 | 5.7275 | 1.5153 | 72.3817 | 0.4542 | 10.7173 | 2.0659 | 66.9498 |
| CT |
|
|
|
| 5.4543 | 51.1498 | 2.1439 | 179.9120 | 5.2566 | 48.3522 | 6.5791 | 171.9621 |
| DE |
| 7.7782 |
| 85.1702 | 0.2454 | 8.3739 | 0.8193 | 125.5970 | 0.2779 |
| 0.7644 |
|
| DC |
|
|
| 82.0909 | 0.7385 | 19.4940 | 1.6516 | 175.4791 | 0.4884 | 15.3523 | 0.8240 |
|
| FL |
|
|
|
| 0.2255 | 4.4887 | 0.4822 | 11.2948 | 0.1124 | 3.4112 | 0.3507 | 9.4744 |
| GA |
|
| 1.17771 | 22.4311 | 0.0953 | 3.2290 | 1.3582 | 24.5763 | 0.2448 | 5.4379 |
|
|
| HI |
|
|
|
| 4.5141 | 36.6774 | 0.5695 | 95.8169 | 8.1706 | 51.3927 | 0.6140 | 97.5388 |
| ID |
|
|
|
| 0.4780 | 10.1412 | 1.1558 | 49.4825 | 1.5833 | 20.1322 | 2.6503 | 91.1905 |
| IL |
|
| 2.0747 |
| 0.3712 | 7.1152 | 2.8030 | 523.2636 | 0.5619 | 9.2363 |
| 48.3631 |
| IN | 0.1591 | 5.0592 |
|
| 0.3958 | 8.5265 | 1.5668 | 40.3907 |
|
| 1.1215 | 51.2706 |
| KS |
| 19.9173 |
|
| 1.3489 | 19.4022 | 0.6648 | 44.1607 | 1.5040 |
| 0.6261 | 42.8812 |
| KY |
|
| 0.4734 |
| 0.3371 | 8.3026 | 0.8958 | 36.7502 | 0.2675 | 7.3576 |
| 23.6336 |
| LA |
| 14.2872 |
|
| 6.5771 |
| 2.9178 | 42.0480 | 6.5429 | 19.8462 | 2.6146 | 37.7073 |
| IA |
|
| 1.0872 | 39.7959 | 0.1445 | 5.5619 | 1.2304 | 44.0175 | 0.2799 | 7.1513 |
|
|
| ME | 0.7938 |
|
|
| 0.8567 | 25.3532 | 0.1676 | 91.5552 |
| 31.6516 | 0.3118 | 124.5080 |
| MD |
|
|
|
| 0.1471 | 5.1483 | 1.1255 | 41.9215 | 0.0374 | 2.5885 | 1.6134 | 62.2135 |
| MA |
|
|
|
| 0.4097 | 9.5495 | 1.6480 | 42.7498 | 4.0086 | 84.2134 | 2.1522 | 61.7372 |
| MI |
|
|
|
| 0.86664 | 13.7508 | 3.3902 | 76.3392 | 1.1114 | 15.5525 | 2.4700 | 59.1702 |
| MN |
|
| 2.1624 | 62.5585 | 0.1180 | 4.0501 | 2.8030 | 73.1131 | 0.7755 | 12.7442 |
|
|
| MS |
|
|
| 21.6170 | 0.2119 | 5.9219 | 0.5972 | 24.6271 | 0.3400 | 7.6840 | 0.4983 |
|
| MO |
|
|
|
| 0.2882 | 7.1869 | 0.9254 | 30.6017 | 2.0424 | 19.4914 | 0.7243 | 26.3912 |
| MT |
|
|
| 60.1072 | 1.6448 | 26.1567 | 0.4806 |
| 5.3211 | 48.6101 | 0.7768 | 95.9944 |
| NE |
|
|
| 37.3174 | 0.4029 | 9.7425 | 0.4927 |
| 0.5193 | 11.0812 | 0.5364 | 38.3479 |
| NV | 0.0630 |
|
|
|
| 3.0970 | 1.2392 | 36.1752 | 0.2832 | 7.4281 | 1.3796 | 36.8643 |
| NH |
|
| 0.6415 | 125.4310 | 0.5377 | 20.4409 | 0.7696 | 142.2721 | 0.4894 | 19.4310 |
|
|
| NJ |
|
|
|
| 2.0952 | 25.1485 | 1.0478 | 40.9881 | 1.3815 | 20.2057 | 7.0072 | 153.6538 |
| NM |
|
|
|
| 0.3769 | 10.7114 | 0.3389 | 28.0368 | 0.3004 | 9.3521 | 0.2964 | 32.2350 |
| NY |
|
|
|
| 0.1438 | 5.3248 | 1.2468 | 38.8174 | 1.1538 | 15.5820 | 2.7453 | 64.9087 |
| NC | 0.3222 | 7.0517 | 0.5711 | 27.0946 | 0.4167 | 8.2013 |
|
|
|
| 0.6438 | 28.1160 |
| ND |
|
|
|
| 0.4928 | 12.2427 | 0.5751 | 72.0739 | 0.8129 | 15.8381 | 0.3398 | 42.7843 |
| OH |
| 2.8443 |
|
| 0.0643 |
| 1.2982 | 46.2033 | 0.3008 | 7.2960 | 2.1708 | 73.6845 |
| OK |
|
|
|
| 0.2220 | 5.9597 | 1.5633 | 48.5069 | 1.0923 | 15.2377 | 1.4307 | 46.3320 |
| OR |
|
|
| 48.0773 | 0.0434 | 2.8604 | 0.6585 | 46.4323 | 1.2971 | 16.7406 | 0.9339 |
|
| PA |
|
| 2.0029 | 44.4196 | 0.1419 | 4.8088 | 3.6654 | 60.8070 | 0.2093 | 5.9761 |
|
|
| PR |
|
|
|
| 1.4482 | 15.9853 | 2.7094 | 75.2085 | 2.1473 | 21.9538 | 3.3145 | 83.7110 |
| RI |
|
|
|
| 7.0779 | 61.0051 | 1.9419 | 164.4378 | 9.9050 | 70.2124 | 0.9570 | 69.1517 |
| SC |
|
|
|
| 0.2432 | 6.1175 | 0.6080 | 18.1253 | 0.1153 | 4.0323 | 0.7424 | 20.3135 |
| SD |
|
|
|
| 0.3197 | 9.9167 | 0.2088 | 23.0946 | 1.0279 | 18.9569 | 0.7436 | 100.0031 |
| TN |
|
|
|
| 0.1475 | 4.2999 | 0.6678 | 21.8297 | 0.2820 | 6.3490 | 0.3492 | 16.3966 |
| TX |
|
|
|
| 0.3448 | 5.6411 | 0.7228 | 14.0341 | 0.1099 | 2.6563 | 0.2809 | 9.1692 |
| UT |
| 6.6142 | 0.5060 | 33.4387 | 0.5707 | 11.4507 |
|
| 0.2272 |
| 0.7189 | 45.1290 |
| VA |
|
|
|
| 0.0958 | 3.7875 | 1.0224 | 38.1446 | 0.1015 | 3.4359 | 1.6647 | 55.0330 |
| WA | 0.1298 | 4.8612 |
|
| 0.3073 | 7.4457 | 0.7100 | 27.0861 |
|
| 1.0449 | 31.2122 |
| WV |
|
|
|
| 0.3392 | 10.2841 | 1.5321 | 87.9995 | 1.0661 | 20.3992 | 2.3734 | 112.7576 |
| WI |
|
|
|
| 0.0697 | 3.3731 | 1.1918 | 45.2079 | 0.1367 | 4.6163 | 1.1583 | 44.6260 |
| VT |
|
| – | – | 0.7570 | 36.6008 | – | – | 1.8541 | 65.3600 | – | – |
| WY |
|
| – | – | 0.7930 | 12.4550 | – | – | 0.9138 | 16.4421 | – | – |
Notes: When the dataset has zero values, we instead use transformation for the whole data. When calculating MAPE, we eliminate all log(1)=0 values. For VT and WY, the existing death data are not sufficient for predicting (most values are zero). Thus, we do not have the predicted new deaths results for these two states. The bold numbers represent the best results among modal, mean and median regressions. Particularly, for MSE and MAPE, bold numbers represent the smallest values. For the sake of thoroughness, we also list the results obtained from the robust nonlinear regression with the bisquare weight in Table 7 in Online Appendix B, where modal regression still shows some advantages. Because does not translate well to modal and median regressions, we do not report it here for model comparison. However, we calculate the modified for modal regression using a kernel‐based objective function, and the results indicate good fit performance of modal regression.
Modal prediction results
| Predictions of modal regression 24 August 2020–21 December 2020 | Predictions of modal regression 24 August 2020–21 December 2020 | ||||
|---|---|---|---|---|---|
| 09/30 10/31 11/30 12/31 | 09/30 10/31 11/30 12/31 | 09/30 10/31 11/30 12/31 | 09/30 10/31 11/30 12/31 | ||
| State/region | Total new cases | Total new deaths | State/region | Total new cases | Total new deaths |
| AL | 45570/46151/52293/61847 | 600/538/545/583 | AK | 2759/2735/3025/3503 | 3/3/3/4 |
| AZ | 2772/712/737/945 | 456/528/634/789 | AR | 38586/45110/56629/73506 | 288/272/290/326 |
| CA | 296260/380190/446670/530800 | 2295/3086/3740/4693 | CO | 10461/73826/63510/58704 | 9/0/0/0 |
| CT | 83/0/0/0 | 0/0/0/0 | DE | 2360/1704/1450/1329 | 0/0/0/0 |
| DC | 1569/1107/946/881 | 0/0/0/0 | FL | 270170/323480/384240/473570 | 2738/2860/3256/3870 |
| GA | 101620/108200/125330/151120 | 1315/1137/1121/1170 | HI | 8046/7494/8178/9367 | 1/1/1/1 |
| ID | 8692/9657/11487/14158 | 45/41/43/47 | IL | 45842/14954/4759/1301 | 46/0/0/0 |
| IN | 21824/16657/14641/13683 | 65/0/0/0 | KS | 7895/6849/6749/7039 | 55/35/24/17 |
| KY | 17470/17932/20270/23934 | 134/85/61/44 | LA | 18705/15715/15186/15651 | 385/230/167/131 |
| IA | 19712/16834/16233/16550 | 93/27/0/0 | ME | 508/300/206/148 | 0/0/0/0 |
| MD | 15219/8544/5695/4027 | 18/0/0/0 | MA | 1050/0/0/0 | 134/0/0/0 |
| MI | 17657/13613/12608/12549 | 20/0/0/0 | MN | 32478/32085/35103/40228 | 63/0/0/0 |
| MS | 34407/35406/40023/47283 | 649/592/599/642 | MO | 35374/35535/39682/46351 | 123/52/18/0 |
| MT | 12332/15629/17901/21389 | 12/12/13/15 | NE | 4193/2000/1059/636 | 122/115/121/135 |
| NV | 22775/24441/28663/34930 | 279/249/250/267 | NH | 295/85/19/0 | 0/0/0/0 |
| NJ | 3104/1123/523/264 | 0/0/0/0 | NM | 8305/7397/7328/7681 | 85/31/5/0 |
| NY | 12006/6420/4858/4076 | 68/0/0/0 | NC | 82554/81715/88581/100270 | 538/528/579/665 |
| ND | 5588/5533/6095/7036 | 34/28/26/25 | OH | 45069/47630/52908/61855 | 311/142/69/28 |
| OK | 24683/26744/31587/38723 | 122/97/87/83 | OR | 13887/14217/15953/18724 | 113/101/101/108 |
| PA | 15625/10309/8261/7179 | 20/0/0/0 | PR | 16332/18541/22792/29017 | 87/75/77/84 |
| RI | 82/0/0/0 | 0/0/0/0 | SC | 70597/91851/120560/163670 | 847/815/871/977 |
| SD | 2663/2052/1859/1799 | 64/63/70/80 | TN | 73717/77510/89298/107170 | 672/669/735/847 |
| TX | 994910/1679200/2255200/3082600 | 3222/3464/4222/5328 | UT | 19034/19507/20285/22113 | 90/87/92/104 |
| VA | 35771/30929/30338/31505 | 474/371/340/332 | WA | 44721/63229/81864/108910 | 498/473/492/540 |
| WV | 4456/4308/4623/5217 | 14/7/4/2 | WI | 35719/36597/40896/47835 | 75/29/8/0 |
| VT | 89/53/37/28 | – | WY | 1509/1448/1555/1756 | – |
Notes: The results represent the total number of modal predicted new cases and new deaths between 24 August and 30 September, between 1 October and 31 October, between 1 November and 30 November, and between 1 December and 31 December, separately.
FIGURE 5Visualization of the Total Number of Modal Predicted New Cases and New Deaths across the US. (a) Predicted new cases 24 August–30 September; (b) Predicted new deaths 24 August–30 September; (c) Predicted new cases 1 October–31 October; (d) Predicted new deaths 1 October–31 October; (e) Predicted new cases 1 November–30 November; (f) Predicted new deaths 1 November–30 November; (g) Predicted new cases 1 December–31 December; (h) Predicted new deaths 1 December–31 December
FIGURE 6Visualization of the Total Number of Modal Predicted New cases and New Deaths across the US after Removing CA, TX and FL. (a) Predicted new cases 24 August–30 September; (b) Predicted new deaths 24 August–30 September; (c) Predicted new cases 1 October–31 October; (d) Predicted new deaths 1 October–31 October; (e) Predicted new cases 1 November–30 November; (f) Predicted new deaths 1 November–30 November; (g) Predicted new cases 1 December–31 December; (h) Predicted new deaths 1 December–31 December