| Literature DB >> 36060216 |
Md Yusuf Sarwar Uddin1, Rezwana Rafiq2.
Abstract
The spread of the COVID-19 pandemic is observed to follow the shape of "waves" (i.e., the rise and fall of population-adjusted daily new infection cases with time). Different geographic regions of the world have experienced different position and span of these waves over time. The presence and strength of these waves broadly characterize the dynamics of the pandemic spread in a given area, so their characterization is important to draw meaningful intervention and mitigation plans tailored for that area. In this paper, we propose a novel technique to represent the trend of COVID-19 spread as a sequence of a fixed-length text string defined on three symbols: R (rise), S (Steady), and F (fall). These strings, termed as trend strings, enabled us searching for specific patterns in them (such as for waves). After analyzing county-level infection data, we observe that, US counties-despite their wide variation in trend strings-can be grouped into a number of heterogeneous classes each of which might have a representative COVID spread pattern over time (in terms of presence and propensity of waves). To this end, we conduct a latent class analysis to cluster 3142 US counties into four distinct classes based on their wave characteristics for one year pandemic data (January 2020 to January 2021). We observe that counties in each class have distinct socio-demographics, location, and human mobility characteristics. In short summary, counties have differing number of waves (class 1 counties have only one wave and class 3 counties have three) and their positions also vary (class 1 had the wave later in the year whereas class 3 had waves throughout the year). We believe that this way of characterizing pandemic waves would provide better insights in understanding the complex dynamics of COVID-19 spread and its future evolution, and would, therefore, help in taking class-specific policy interventions.Entities:
Keywords: COVID-19 pandemic; Human mobility; Latent class analysis; Pandemic waves; Spread patterns
Year: 2022 PMID: 36060216 PMCID: PMC9428116 DOI: 10.1016/j.patrec.2022.08.017
Source DB: PubMed Journal: Pattern Recognit Lett ISSN: 0167-8655 Impact factor: 4.757
US County-level Summary Statistics (Number of counties, N = 3,142)
| Variable | Source | Description |
|---|---|---|
| COVID-19 Infection rate | Number of daily new cases per 1000 people | |
| Test completed | Number of COVID-19 tests completed per 1000 people | |
| Imported cases | Number of daily external trips by infectious persons from out of county | |
| Social distancing index (An integer from 0–100 measuring the extent of social distancing) | Weighted sum of % of people staying home and reduction of human movement in different aspects, such as work trips, non-work trips, and distance traveled. ‘0’ indicates no social distancing; ‘100’ denotes all residents are staying at home and no visitors are entering the county | |
| Percentage change in non-workplace visits | Percentage change in visits to non-workplace (retail, recreation, grocery and pharmacy) with respect to the baseline (Jan 3 - Feb 6, 2020) | |
| Age above 60 | % of people older than 60 years | |
| African-Americans | % of African-Americans | |
| Population Density | Population density as total number of people/total land area (sqmile) | |
| Metro status | Metro status based on 2013 NCHS urban-rural classification; 1=metro, 0=nonmetro | |
Fig. 1(a) Daily infection cases over the entire US, (b) Fraction of counties exceeding a certain infection count.
Notations used in this paper
| Defining Pandemic Waves | |
| Index to denote a county | |
| Number of counties | |
| Time interval index | |
| Daily infection rate at time | |
| Moving average of daily infections | |
| The slope of the trend line | |
| The cutoff threshold of slopes for RISE and FALL | |
| Pattern string defined on alphabet {R, F, S} | |
| Index to denote a class/group (total | |
| Attribute of a county | |
| Possible values of attribute | |
| Binary indicator if county | |
| Covariates of county | |
| Class probabilities or mixing probabilities | |
| Class-conditional probabilities | |
| Coefficients of covariates for class |
Fig. 2Daily infection trend for Worcester county, Maryland (FIPS code 24047). The corresponding pattern string (SSRFR) appears at the bottom.
Fig. 3Conceptual diagram of the Latent Class Analysis
Fig. 4Median daily new cases per class and density plot of wave heights.
Class conditional probabilities of indicator variables (-table)
| Class 1 | Class 2 | Class 3 | Class 4 | |
|---|---|---|---|---|
| Early wave? | ||||
| Yes | 0.00 | 0.00 | 0.00 | |
| No | 0.00 | 1.00 | ||
| Mid wave? | ||||
| Yes | 0.00 | 1.00 | ||
| No | 0.00 | 0.53 | 0.00 | |
| Late wave or rise? | ||||
| Had wave (rise & fall) | 0.00 | |||
| Had only rise | 0.20 | 0.13 | 0.45 | |
| No wave or rise | 0.03 | 0.00 | 0.00 | 0.00 |
| Height of early wave | ||||
| Wave does not exist | 0.00 | 1.00 | ||
| Low | 0.00 | 0.00. | 0.33 | 0.00 |
| Medium | 0.00 | 0.00 | 0.33 | 0.00 |
| High | 0.00 | 0.00 | 0.34 | 0.00 |
| Height of mid wave | ||||
| Wave does not exist | 0.00 | 0.00 | ||
| Low | 0.00 | 0.16 | 0.24 | |
| Medium | 0.00 | 0.30 | 0.16 | 0.39 |
| High | 0.00 | 0.31 | 0.16 | |
| Height of late wave | ||||
| Neither wave nor rise | 0.03 | 0.00 | 0.00 | 0.00 |
| Low | 0.27 | |||
| Medium | 0.28 | 0.35 | 0.33 | 0.33 |
| High | 0.33 | 0.30 | 0.22 | |
| Number of waves | ||||
| Zero | 0.22 | 0.00 | 0.00 | 0.00 |
| One | 0.00 | 0.17 | ||
| Two | 0.01 | 0.11 | ||
| Three or more | 0.00 | 0.08 | 0.01 | |
| Metropolitan status | ||||
| Metro area | 0.31 | 0.27 | ||
| Non-metro area | 0.56 | 0.45 |
Fig. 5Pandemic wave patterns of four classes. (a) Class 1 has one late wave, (b) Class 2 has mid and late waves, (c) Class 3 have mixture of waves in all windows, (d) Class 4 has mid wave and late rise.
Coefficients of active covariates with respect to class 1 (-table)
| Active covariates | Class 2 | Class 3 | Class 4 |
|---|---|---|---|
| Intercept | 3.016 | 2.316 | -0.681 |
| Frac. of people aged 60+ | -4.390*** | -5.268*** | -3.239** |
| Frac. of African-Americans | 2.385*** | 6.789*** | 6.159*** |
| Social distance index | |||
| In early wave | -0.039** | 0.037** | 0.022 |
| In mid wave | 0.003 | 0.045* | 0.164*** |
| In late wave | 0.009 | -0.061** | -0.172*** |
| Test done per 1K (in log) | |||
| In early wave | 1.923*** | 1.087 | -0.244 |
| In mid wave | -1.571 | -1.990 | -7.231*** |
| In late wave | -0.064 | 0.637 | 5.690*** |
| Imported cases (in log) | |||
| In early wave | -0.733*** | 1.133*** | -0.437* |
| In mid wave | 2.933*** | 0.828** | 7.339*** |
| In late wave | -2.178*** | -1.734*** | -5.793*** |
| *, **, *** mean 10%, 5% and 1% level of significance respectively | |||
Class-wise probability-weighted summary for covariates
| Active covariates | Class 1 | Class 2 | Class 3 | Class 4 |
|---|---|---|---|---|
| People aged 60+ (%) | 27.0 | 25.0 | 24.0 | 24.0 |
| African-Americans (%) | 3.0 | 6.0 | 13.0 | 19.0 |
| Social distance index | 27.3 | 26.8 | 28.3 | 28.1 |
| Imported cases (early) | 3151 | 2649 | 17649 | 4881 |
| Population density | 85.7 | 101.6 | 547.7 | 260.5 |
| Cumulative inf. cases | 81.9 | 108.3 | 115.9 | 101.3 |
| Work % change | -22.5 | -22.6 | -24.9 | -24.3 |
| Non-work % change (early) | -11.8 | -7.8 | -16.3 | -11.3 |
| Non-work % change (mid) | 12.3 | 4.3 | -1.6 | -7.4 |
| Non-work % change (late) | -12.9 | -15.6 | -18.9 | -16.8 |
Fig. 6Distribution of four latent classes across the US
Distribution of latent classes across four US regions
| Region | Class 1 | Class 2 | Class 3 | Class 4 |
|---|---|---|---|---|
| (#counties) | (972) | (891) | (701) | (578) |
| Northeast | 109 (PA) | 8 (PA) | 94 (NY) | 6 (NY) |
| Midwest | 445 (MO) | 334 (KS) | 255 (IN) | 21 (IN) |
| South | 227 (KY) | 407 (TX) | 292 (GA) | 496 (TX) |
| West | 191 (MT) | 142 (CA) | 60 (CO) | 55 (CA) |
US County-level Summary Statistics (Number of counties, N = 3,142)
| Variable | Source | Description | Window 1 | Window 2 | Window 3 |
|---|---|---|---|---|---|
| Jan 22–May 30 | Jun 1–Sep 18 | Sep 19–Jan 21 | |||
| COVID-19 Infection rate | Number of daily new cases per 1000 people | 0.02 | 0.12 | 0.49 | |
| COVID-19 status | |||||
| Test completed | Number of COVID-19 tests already completed per 1000 people | 47.20 | 316.96 | 798.73 | |
| Imported cases | Number of daily external/inter-regional trips by infectious persons from out of state/county | ||||
| Human Mobility | |||||
| Social distance index (An integer from 0–100 measuring the extent of social distancing) | A weighted sum of percentage of people staying home and reduction of human movement in different aspects, such as work trips, non-work trips, and distance traveled. ‘0’ indicates no social distancing; ‘100’ denotes all residents are staying at home and no visitors are entering the county | 29.37 | 25.29 | 28.75 | |
| Percentage change in non-workplace visits | Percentage change in visits to non-workplace (retail, recreation, grocery and pharmacy) with respect to the baseline (Jan 3 - Feb 6, 2020) | -11.70 | 1.96 | -15.99 | |
| Socio-demographic and Location Characteristics | |||||
| Age older than 60 | Percentage of people older than 60 years | 25.27 | 25.27 | 25.27 | |
| African-Americans | Percentage of African-Americans | 8.92 | 8.92 | 8.92 | |
| Population Density | Population density as total number of people/total land area (sqmile) | 224.44 | 224.44 | 224.44 | |
| Metropolitan status | Metropolitan status indicator based on 2013 NCHS urban-rural classification scheme; 1=metro, 0=nonmetro | 0.37 | 0.37 | 0.37 | |