| Literature DB >> 33746458 |
Frank Nielsen1, Gautier Marti2, Sumanta Ray3, Saumyadipta Pyne4.
Abstract
Social distancing and stay-at-home are among the few measures that are known to be effective in checking the spread of a pandemic such as COVID-19 in a given population. The patterns of dependency between such measures and their effects on disease incidence may vary dynamically and across different populations. We described a new computational framework to measure and compare the temporal relationships between human mobility and new cases of COVID-19 across more than 150 cities of the United States with relatively high incidence of the disease. We used a novel application of Optimal Transport for computing the distance between the normalized patterns induced by bivariate time series for each pair of cities. Thus, we identified 10 clusters of cities with similar temporal dependencies, and computed the Wasserstein barycenter to describe the overall dynamic pattern for each cluster. Finally, we used city-specific socioeconomic covariates to analyze the composition of each cluster. © Indian Statistical Institute 2021.Entities:
Keywords: COVID-19.; Clustering; Mobility; Optimal transport; Time series; Wasserstein distance
Year: 2021 PMID: 33746458 PMCID: PMC7961163 DOI: 10.1007/s13571-021-00255-0
Source DB: PubMed Journal: Sankhya B (2008) ISSN: 0976-8386
Figure 1The dendrograms show 3 hierarchical clusterings of cities aHC1 (N, M, t), b HC2 (N, ΔM, t), and c HC3 (N, , t) using Ward’s linkage. Based on visual inspection of the seriated distance matrix, 10 clusters were identified in each case, as shown on the heatmaps
Figure 2HCMapper is used for comparing 3 hierarchical clusterings: HC1(N, M, t), HC2(N, ΔM, t) and HC3(N, , t). The cluster sizes and divergences across the clusterings are shown with blue rectangles and grey edges respectively
Figure 3The geographic distribution of the 10 clusters of COVID-19 affected U.S. cities as identified by HC3 are shown. The county corresponding to each city is mapped in its cluster-specific color
Details of the 10 clusters of COVID-19 affected U.S. cities as identified by HC3
| Id | Size | Members (FIPS code of the corresponding counties) |
|---|---|---|
| 1 | 9 | 4027, 6001, 6013, 6019, 1073, 4019, 1097, 1101, 4013 |
| 2 | 11 | 12011, 10003, 11001, 9003, 9009, 9001, 8059, 8123, 8005, 8031, 8041 |
| 3 | 14 | 6029, 6037, 6059, 6065, 6073, 6067, 6071, 6075, 6081, 6083, 6085, 8001, 6107, 6111 |
| 4 | 16 | 55101, 55059, 55079, 53053, 55009, 53061, 53077, 49035, 49049, 53033, 51510, 51760, 48201, 48375, 48439, 48453 |
| 5 | 26 | 39153, 40109, 41051, 39095, 39099, 39049, 39061, 46099, 47037, 47149, 47157, 42101, 44003, 44007, 45079, 42095, 42069, 42077, 42003, 42011, 48157, 48113, 48121, 48141, 48029, 48085 |
| 6 | 16 | 34021, 34031, 34039, 35001, 36001, 36029, 36061, 39035, 37183, 38017, 37119, 37067, 37081, 36067, 36119, 37063 |
| 7 | 19 | 22071, 23005, 24510, 25005, 25009, 25013, 25017, 25021, 25023, 26049, 25025, 25027, 27123, 26163, 27053, 26125, 26161, 26081, 26099 |
| 8 | 10 | 33011, 34007, 34013, 34017, 28049, 29510, 31055, 31109, 32003, 32031 |
| 9 | 13 | 12031, 12057, 12095, 12099, 12071, 12086, 12103, 12105, 17043, 13089, 13095, 13121, 17031 |
| 10 | 17 | 22033, 22051, 19153, 19193, 20209, 21111, 22017, 17089, 17201, 17097, 17197, 18003, 18057, 18089, 18097, 18141, 19013 |
Figure 4The boxplots show the differences across the identified 10 clusters of cities identified by HC3 in terms of the values of the 8 most significant covariates: a Reaction Time (RT), b Hispanic percent, c Black percent, d population size, e senior percent, f population density 2010, g persons per household, and h SVI ses. We jittered the overlapping RT points for easy visualization
Figure 5The overall temporal pattern of dependency between normalized measures of mobility and COVID-19 incidence for each cluster of cities identified by HC3 is shown along 3 dimensions (N, , t). The Wasserstein barycenters of the 10 clusters are depicted within the unit cube with the darker dots representing later points in time (z-axis)
Figure 6The most significant of the static city-specific covariates in discrimination of the 10 clusters identified by HC3. The contributions towards each cluster are measured by a the embedded method of RF classifier (MDI), and b the mean Shapley values for each covariate