| Literature DB >> 32611104 |
Abstract
This paper proposes a cluster-based method to analyze the evolution of multivariate time series and applies this to the COVID-19 pandemic. On each day, we partition countries into clusters according to both their cases and death counts. The total number of clusters and individual countries' cluster memberships are algorithmically determined. We study the change in both quantities over time, demonstrating a close similarity in the evolution of cases and deaths. The changing number of clusters of the case counts precedes that of the death counts by 32 days. On the other hand, there is an optimal offset of 16 days with respect to the greatest consistency between cluster groupings, determined by a new method of comparing affinity matrices. With this offset in mind, we identify anomalous countries in the progression from COVID-19 cases to deaths. This analysis can aid in highlighting the most and least significant public policies in minimizing a country's COVID-19 mortality rate.Entities:
Mesh:
Year: 2020 PMID: 32611104 PMCID: PMC7328914 DOI: 10.1063/5.0013156
Source DB: PubMed Journal: Chaos ISSN: 1054-1500 Impact factor: 3.642
FIG. 1.Smoothed number of clusters as a function of time, defined in Sec. II A. In (a), the blue and orange curves track the number of clusters for cases and deaths, respectively, from 12/31/2019 to 04/30/2020. In (b), the curves are shown after translation by the optimal series evolution offset, defined in Sec. III, computed to be . There is a strong similarity between the two curves up to this offset: both peak at 17 clusters before declining, suggesting reduced spread in the data.
FIG. 2.Heat maps track the changing cluster membership of the 15 most severely impacted countries with respect to their counts of (a) cases and (b) deaths, respectively. Cluster membership, determined by Ckmeans.1d.dp, depicts COVID-19 severity relative to the rest of the world. Clusters are ordered with 1 being the worst impacted at any time. Darker and lighter colors correspond to smaller and greater numbered cluster labels and represent worse and less affected clusters, respectively.
FIG. 3.Cluster evolution dendrograms, defined in Section II B for (a) cases and (b) deaths. These apply hierarchical clustering to the distance between adjacency matrices at varying times , thereby grouping different dates according to the cluster structures at these times. The -axis excludes the first 50 days for cases and 66 days for deaths, as the cluster structure of counts is trivial before these periods, respectively. Each cluster is an unbroken interval of dates. There is a clear break in the cluster structure between 03/01 and 03/02 for cases, and 03/18 and 03/19 for deaths, with a 17-day difference.
Cluster consistency offset for various adjacency and affinity matrices at different starting dates. These are determined by minimizing the normalized total offset difference in Eq. (3), as well as its analog for Gaussian and adjacency matrices. The parameter m is defined in Eq. (2).
| Optimal cases vs deaths offset | |||||
|---|---|---|---|---|---|
| Start date | Gaussian | Gaussian | Gaussian | Adj | Aff |
| 12/31/2019 | 16 | 16 | 16 | 20 | 16 |
| 01/13/2020 | 12 | 13 | 14 | 20 | 15 |
| 01/21/2020 | 12 | 13 | 14 | 19 | 15 |
| 01/31/2020 | 12 | 13 | 14 | 19 | 15 |
FIG. 4.The normalized total offset difference as a function of the offset , defined in Eq. (3). The convex nature of this plot indicates that is a globally optimal value.
The ten most anomalous countries in progression from cases to deaths as defined by their anomaly score from Sec. IV and a lag of τ = 16. AE: United Arab Emirates, AT: Austria, AU: Australia, BD: Bangladesh, BE: Belgium, BY: Belarus, CA: Canada, CN: China, DE: Germany, DO: Dominican Republic, ES: Spain, FR: France, ID: Indonesia, IE: Ireland, IL: Israel, IN: India, IR: Iran, IT: Italy, JP: Japan, KR: South Korea, ME: Mexico, MY: Malaysia, NL: Netherlands, NO: Norway, QA: Qatar, SG: Singapore, SW: Sweden, TR: Turkey, UA: Ukraine, UK: United Kingdom, US: United States, ZA: South Africa.
| Ten most anomalous countries: inconsistency matrix analysis | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Date | A1 | A2 | A3 | A4 | A5 | A6 | A7 | A8 | A9 | A10 |
| 01/28/2020 | US | UK | IT | IL | IE | IR | ID | IN | DE | FR |
| 02/07/2020 | US | DO | IT | IL | IE | IR | ID | IN | DE | FR |
| 02/17/2020 | SG | JP | KR | AU | MY | US | DE | FR | AE | CA |
| 02/27/2020 | IR | SG | MY | IT | AU | US | DE | UK | AE | CA |
| 03/08/2020 | IT | IR | SG | MY | DE | AE | CA | JP | ES | US |
| 03/18/2020 | ES | SG | IT | IR | AE | UK | NL | FR | US | KR |
| 03/28/2020 | QA | ES | TR | UK | SG | KR | AE | BY | US | IT |
| 04/07/2020 | QA | SG | KR | UK | CN | UA | NO | ZA | AU | TR |
| 04/17/2020 | BD | QA | SG | UK | AU | KR | BE | ZA | AT | FR |
| 04/27/2020 | QA | SG | BD | ME | AU | UK | SW | BE | DE | IL |
FIG. 5.(a) depicts the affinity matrix for case counts at 04/27/2020, (b) depicts the deaths affinity matrix for 04/11/2020, and (c) depicts the inconsistency matrix with an offset of from Table I. Only countries with greater than 5000 cases at 04/30 are included and ordered alphabetically along the axes. The more prominent the respective row and column in the inconsistency matrix, the more anomalous the country. The three most prominent anomalies in (c) are Qatar, Singapore, and Bangladesh.
Cluster-based evolution analysis
Mathematical objects and definitions.
| Mathematical objects glossary | |
|---|---|
| Object | Description |
| Distance matrix between log counts | |
| Aff( | Standard affinity matrix |
| Gaussian affinity matrix | |
| Unsmoothed number of clusters obtained as average of six methods | |
| Smoothed number of clusters | |
| Adj( | Adjacency matrix coding cluster outputs for |
| Frobenius distance between adjacency matrix of various dates | |
| Series evolution offset with respect to number of clusters | |
| Cluster consistency offset with respect to cluster membership | |
| Inc( | Lag-adjusted inconsistency matrix |
| Anomaly score of country | |
| Lag-adjusted death rate of country | |