| Literature DB >> 35042866 |
Abstract
The emergence and rapid spread of novel variants of concern (VOC) of the coronavirus 2 constitute a major challenge for spatial disease surveillance. We explore the possibility to use close to real-time crowdsourced data on reported VOC cases (mainly the Alpha variant) at the local area level in Germany. The aim is to use these data for early-stage estimates of the statistical association between VOC reporting and the overall COVID-19 epidemiological development. For the first weeks in 2021 after international importation of VOC to Germany, our findings point to significant increases of up to 35-40% in the 7-day incidence rate and the hospitalization rate in regions with confirmed VOC cases compared to those without such cases. This is in line with simultaneously produced international evidence. We evaluate the sensitivity of our estimates to sampling errors associated with the collection of crowdsourced data. Overall, we find no statistical evidence for an over- or underestimation of effects once we account for differences in data representativeness at the regional level. This points to the potential use of crowdsourced data for spatial disease surveillance, local outbreak monitoring and public health decisions if no other data on new virus developments are available.Entities:
Mesh:
Year: 2022 PMID: 35042866 PMCID: PMC8766449 DOI: 10.1038/s41598-021-04573-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Temporal and spatial distribution of VOC cases in Germany. Panel (A) counts NUTS-3 regions with confirmed VOC cases over time. Panel (B) compares the 7-day incidence rate (by day of reporting) for Germany, Schleswig-Holstein, and Flensburg. Panel (C) shows the spatial spread of VOC cases by February 4, 2021. Flensburg and Cologne, Düren and Leverkusen report most confirmed cases.
Figure 2SCM estimates for the relative percentage increase in the 7-day incidence rate for Flensburg (Panel A) and NRW regions (Panel B) and the hospitalization rate for Flensburg (Panel C) and NRW regions (Panel D) vis-à-vis their synthetic control groups. Treatment start is set to January 5, 2021 in all cases. 90% confidence intervals are constructed on the basis of pseudo P values (see “Methods” section for details).
Difference-in-difference estimations for association between first reporting of variants of concern (VOC) and 7-day incidence rates at the local area level in Germany.
| Treatment start | (1) | (2) | (3) | (4) |
|---|---|---|---|---|
| Day first reporting | 7 days before | 14 days before | 21 days before | |
| VOC Reporting | 13.14 | 7.63 | 3.61 | 0.08 |
| [4.2,22.1] | [ | [ | [ | |
| N | 22441 | 22441 | 22441 | 22441 |
| NUTS-3 regions | 401 | 401 | 401 | 401 |
| VOC Reporting | 15.09 | 17.83 | 17.74 | 13.65 |
| [ | [2.7,32.9] | [2.6,32.8] | [ | |
| N | 12712 | 12712 | 12712 | 12712 |
| NUTS-3 regions | 227 | 227 | 227 | 227 |
| VOC Reporting | 35.00 | 30.65 | 24.63 | 17.61 |
| [17.9,52.1] | [17.3,44.0] | [11.6,37.7] | [4.8,30.4] | |
| N | 13552 | 13552 | 13552 | 13552 |
| NUTS-3 regions | 242 | 242 | 242 | 242 |
| Fixed Effects & Controls | Yes | Yes | Yes | Yes |
Across columns, the dependent variable is the 7-day incidence rate in a NUTS-3 region at a given day. We always consider the time period between November 15, 2020 and February 4, 2021. In Panel A, we take into account all 401 NUTS-3 regions in Germany. By February 4, 2021, 204 NUTS-3 regions reported a VOC case and 197 did not (Panel B: 30 NUTS-3 regions with at least one reported VOC case before January 22, 2021; Panel C: 45 NUTS-3 regions belonging to the top 10% of sample regions in terms of VOC count). In Panel A, VOC Reporting is a dummy variable defined as one if a VOC case has been reported in a NUTS-3 region, else zero (Panel B: VOC Reporting is one if a VOC case has been reported before January 22; Panel C, VOC Reporting is one if a regions belongs to the top-10% regions in terms of VOC count). Some observations have been dropped because we lack information on some control variables (e.g., on Daily mobility change in relation to 2019 for December 4–7, 2020). We include NUTS-3 and day fixed effects and a linear and a squared trend for four different NUTS-3 region types. We also control for a 1 day, a 7-days and a 14-days lag of the following variables: first, the reported SARS-CoV-2 cases within the previous 7 days, second, the reported SARS-CoV-2 cases within the previous 7 days in neighboring NUTS-3 regions, third, the daily mean temperature at 2 m above ground in , and fourth, the daily mobility change in relation to 2019. We include (but do not show) a constant in all regressions. 95% confidence interval based on clustered SE (on NUTS-2 level) in parentheses; , , .
Difference-in-difference estimations for association between first reporting of variants of concern and number of COVID-19 patients in intensive care at the local area level in Germany.
| (1) | (2) | (3) | |
|---|---|---|---|
| Baseline | First VOC reported before Jan22, 2021 | Top-10% regions in terms of VOC count | |
| VOC reporting | 0.37 | 0.50 | 1.30 |
| [ | [ | [0.5,2.1] | |
| N | 22217 | 12656 | 13496 |
| NUTS-3 regions | 397 | 226 | 241 |
| VOC reporting | 0.14 | 0.71 | |
| [ | [ | [0.2,1.2] | |
| N | 22217 | 12656 | 13496 |
| NUTS-3 regions | 397 | 226 | 241 |
| Fixed effects & Controls | Yes | Yes | Yes |
In Panel A, across columns, the dependent variable is the number of COVID-19 patients in intensive care in a NUTS-3 region at a given day. Panel B is similar but refers to patients in intensive care with artificial ventilation. We always consider the time period between November 15, 2020 and February 4, 2021. In Panel A, we report results for 397 NUTS-3 regions in Germany, for which we have daily data on the number of patients in intensive care. By February 4, 2021, 204 NUTS-3 regions reported a VOC case and 197 did not (Panel B: 30 NUTS-3 regions with at least one reported VOC case before January 22, 2021; Panel C: 45 NUTS-3 regions belonging to the top 10% of sample regions in terms of VOC count). In Panel A, VOC Reporting is a dummy variable defined as one if a VOC case has been reported in a NUTS-3 region, else zero (Panel B: VOC Reporting is one if a VOC case has been reported before January 22; Panel C, VOC Reporting is one if a regions belongs to the top-10% regions in terms of VOC count). Some observations have been dropped because we lack information on some control variables (e.g., on Daily mobility change in relation to 2019 for December 4–7, 2020). We include NUTS-3 and day fixed effects and a linear and a squared trend for four different NUTS-3 region types. We also control for a 1 day, a 7-days and a 14-days lag of the following variables: first, the reported SARS-CoV-2 cases within the previous 7 days, second, the reported SARS-CoV-2 cases within the previous 7 days in neighboring NUTS-3 regions, third, the daily mean temperature at 2 m above ground in , and fourth, the daily mobility change in relation to 2019. We include (but do not show) a constant in all regressions. 95% confidence interval based on clustered SE (on NUTS-2 level) in parentheses; , , .
Figure 3Panel event study estimates for trend development in the 7-day incidence rate (Panel A and B) and the hospitalization rate (Panel C and D) in regions with confirmed VOC cases around the day of first VOC reporting. Hollow squares show estimated daily point estimates; grey squares report cumulative estimates beyond the maximum number of leads (10) and lags (20) around the start of the first VOC reporting in treated regions. The dashed vertical line indicates that the last day before treatment start (− 1) serves as benchmark period for the estimated daily lead and lag coefficients.
Figure 4Comparison of cumulative VOC counts covered by different data sources in the first calendar weeks (CW) 2021. Panel (A) shows time-series for the development of cumulative VOC cases in Germany reported by the virus variant-tracking crowdsourcing project by[15] and those reported by German health authorities and documented in[14,24,25] later on. The time period shown covers the first 9 calendar weeks in 2021 (January 4 to March 7, 2021). Bars in Panel (A) measure the percentage share of VOC cases (Alpha, Beta and Gamma variants, jointly) in all submitted genomes for Germany to the Global Initiative On Sharing All Influenza Data (GISAID) hCoV-19Tracking of Variants project. Data are obtained from[10]. Panel (B) shows box plots for the regional distribution of post-sampling ratio (PSR) weights by calendar weeks. Details on the calculation of PSR weights exploiting differences between VOC cases covered in the crowdsourced database and the RKI bulletins for individual German federal states as shown in this figure are given in the main text.
Figure 5Regional variation of VOC coverage in crowdsourced data. The individual panels correlate the cumulative development of VOC cases covered in the close to real-time virus variant-tracking crowdsourcing project by[15] to those reported by German health authorities later on and documented in[14,24,25] at the German federal state (NUTS-1) level. Dashed lines in each panel indicate the linear fit between crowdsourced and RKI data.
VOC effect differences and tests for equality of regression coefficients between original near-time and weighted estimates using post-sampling ratio weights.
| (1) | (2) | (3) | |
|---|---|---|---|
| Baseline | Reported before Jan 22, 2021 | Top-10% regions in terms of VOC count | |
| 7-day Incidence rate | 3.88 | ||
| (0.48) | (0.24) | (0.50) | |
| Hospitalization rate | 0.09 | ||
| (0.04) | (0.03) | (0.07) | |
| Hospitalization rate (with art. ventilation) | 0.05 | ||
| (0.02) | (0.02) | (0.14) | |
| 7-day Incidence rate | 7.19 | ||
| (0.91) | (0.38) | (0.98) | |
| Hospitalization rate | 0.21 | ||
| (0.12) | (0.06) | (0.48) | |
| Hospitalization rate (with art. ventilation) | 0.10 | ||
| (0.09) | (0.10) | (0.28) | |
Results report the difference in the estimated VOC coefficients between the original near-time estimates reported in Tables 1 and 2 and the re-weighted specification using post-sampling ratio weights (and their squared values) for the 7-day incidence rate, the hospitalization rate (all patients in intensive care) and the hospitalization rate for patients with artificial ventilation. The reported z Values are the resulting test statistic from a z-test for coefficient equality across different regression models defined as , where is the standard error of [26,27]. A statistically significant positive coefficient difference would point to an overestimation of the association between VOC reporting and the development of an epidemiological indicator at the local area level in Germany for the crowdsourced VOC data. Both the original and the re-weighted estimation specification use the same set of regressors as reported in the footnotes of Tables 1 and 2 and in the SI appendix.
Figure 6Difference in estimated daily PES coefficients and statistical tests for coefficient equality across models. The time-series plot in Panel (A) shows the development in the coefficient difference for the dynamic effect of VOC reporting on the 7-day incidence rate as , where are the daily PES coefficients from Panel A of Fig. 3 and are the corresponding re-estimated coefficients using post-sampling ratio weights. Both specifications follow the estimation setup described in the footnote of Fig. 3 and Panel A plots coefficient differences for lags (20) around the start of the first VOC reporting in a treated regions. Bars report the z Values obtained from a z-test of coefficient equality across specification as described in the notes of Table 3. Panels (B) shows coefficient differences for the effect on the 7-day incidence rate and corresponding z Values for the subsample of treated regions belonging to the top 10% in terms of VOC count. Panels (C,D) report the corresponding overall and subsample results for the hospitalization rate (all patients in intensive care).