| Literature DB >> 32707989 |
Andrea Maugeri1, Martina Barchitta1, Antonella Agodi1,2.
Abstract
While several efforts have been made to control the epidemic of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in Italy, differences between and within regions have made it difficult to plan the phase two management after the national lockdown. Here, we propose a simple and immediate clustering approach to categorize Italian regions working on the prevalence and trend of SARS-CoV-2 positive cases prior to the start of phase two on 4 May 2020. Applying both hierarchical and k-means clustering, we identified three regional groups: regions in cluster 1 exhibited higher prevalence and the highest trend of SARS-CoV-2 positive cases; those classified into cluster 2 constituted an intermediate group; those in cluster 3 were regions with a lower prevalence and the lowest trend of SARS-CoV-2 positive cases. At the provincial level, we used a similar approach but working on the prevalence and trend of the total SARS-CoV-2 cases. Notably, provinces in cluster 1 exhibited the highest prevalence and trend of SARS-CoV-2 cases. Provinces in clusters 2 and 3, instead, showed a median prevalence of approximately 11 cases per 10,000 residents. However, provinces in cluster 3 were those with the lowest trend of cases. K-means clustering yielded to an alternative cluster solution in terms of the prevalence and trend of SARS-CoV-2 cases. Our study described a simple and immediate approach to monitor the SARS-CoV-2 epidemic at the regional and provincial level. These findings, at present, offered a snapshot of the epidemic, which could be helpful to outline the hierarchy of needs at the subnational level. However, the integration of our approach with further indicators and characteristics could improve our findings, also allowing the application to different contexts and with additional aims.Entities:
Keywords: COVID-19; clustering; epidemiology; positive cases
Mesh:
Year: 2020 PMID: 32707989 PMCID: PMC7432053 DOI: 10.3390/ijerph17155286
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Characteristics of the Italian regions.
| Regions | Residents | Prevalence of Positive Cases (Per 10,000 Residents) a | Trend of Positive Cases (%) b | Number of Tests (Per 10,000 Residents) a |
|---|---|---|---|---|
| Abruzzo | 1,315,196 | 14.2 | −8.00% | 309.5 |
| Apulia | 4,048,242 | 7.3 | 1.50% | 164.1 |
| Basilicata | 567,118 | 3.4 | −10.60% | 250.6 |
| Bolzano c | 527,750 | 12.6 | −29.30% | 838.3 |
| Calabria | 1,956,687 | 3.6 | −10.20% | 198.5 |
| Campania | 5,826,860 | 4.7 | −5.30% | 148.5 |
| Emilia-Romagna | 4,452,629 | 20.3 | −26.00% | 442.6 |
| Friuli Venezia Giulia | 1,215,538 | 8.9 | −13.60% | 616.9 |
| Lazio | 5,896,693 | 7.4 | −3.90% | 255.9 |
| Liguria | 1,556,981 | 22.8 | −0.80% | 350 |
| Lombardy | 10,036,258 | 36.8 | 4.20% | 409.4 |
| Marche | 1,531,753 | 20.9 | −3.40% | 420.5 |
| Molise | 308,493 | 5.9 | −9.50% | 229.3 |
| Piedmont | 4,375,865 | 35.7 | 0.80% | 393.5 |
| Sardinia | 1,648,176 | 4.2 | −11.20% | 168.3 |
| Sicily | 5,026,989 | 4.4 | 3.80% | 170.9 |
| Trento c | 539,898 | 23.1 | −27.00% | 761.2 |
| Tuscany | 3,736,968 | 14.3 | −11.00% | 403.8 |
| Umbria | 884,640 | 2.1 | −36.20% | 438.9 |
| Valle d’Aosta | 126,202 | 8.6 | −53.60% | 641.8 |
| Veneto | 4,905,037 | 14.9 | −17.60% | 771.0 |
a Data are referred to 3 May 2020. b Weekly trend from 27 April to 3 May 2020. c Autonomous provinces.
Figure 1Description and correlation of regional indicators: (A) the distribution of Italian regions by prevalence of SARS-CoV-2 positive cases, their trend, and the number of tests performed; and (B) the correlations between the prevalence of SARS-CoV-2 positive cases, their trend, and the number of tests performed; the results are reported as Spearman’s correlation coefficient from −1 (in red) to 1 (in green).
Figure 2Scatter plots of the relationships of the number of tests performed with (A) the prevalence of SARS-CoV-2 positive cases and (B) their trend. Indicators are reported as log-transformed values.
Figure 3Dendrogram of the hierarchical clustering of regions based on Ward’s criterion.
Average silhouette width for the hierarchical clustering of regions.
| Number of Clusters | Average Silhouette Width |
|---|---|
| 2 | 0.566 (0.166) |
| 3 | 0.632 (0.099) |
| 4 | 0.493 (0.197) |
| 5 | 0.530 (0.248) |
Figure 4Scatter plot illustrating how the egional clusters were distributed on the prevalence of SARS-CoV-2 positive cases and their trend. Clustering solution was obtained by the hierarchical clustering and consolidated by the k-means algorithm.
Comparisons of the prevalence of SARS-CoV-2 positive cases, their trend and the number of tests between regional clusters.
| Clusters | Prevalence of Positive Cases Per 10,000 Residents a | Trend of Positive Cases b | Number of Tests Per 10,000 Residents a |
|---|---|---|---|
| Cluster 1 | 29.3 (15.2) | 0% (6.1) | 401.5 (56.9) |
| Cluster 2 | 7.4 (9.9) | −10.6% (12.3) | 255.9 (446.0) |
| Cluster 3 | 5.4 (6.5) | −44.9% (17.4) | 540.4 (202.9) |
| 0.011 | 0.007 | 0.356 |
Results are reported as the median (interquartile range), with p-values based on the Kruskal–Wallis test. a Data are referred to 3 May 2020 b Weekly trend from 27 April to 3 May 2020.
Figure 5Dendrogram of the hierarchical clustering of the provinces based on Ward’s criterion.
Average silhouette width for the hierarchical clustering of provinces.
| Number of Clusters | Average Silhouette Width |
|---|---|
| 2 | 0.399 (0.169) |
| 3 | 0.402 (0.229) |
| 4 | 0.368 (0.214) |
| 5 | 0.377 (0.225) |
Hierarchical clusters’ composition of provinces.
| Clusters | Provinces |
|---|---|
| Cluster 1 | Alessandria; Aosta; Asti; Belluno; Bergamo; Biella; Bologna; Brescia; Como; Cremona; Cuneo; Fermo; Firenze; Forlì-Cesena; Genova; Imperia; La Spezia; Lecco; Lodi; Mantova; Massa Carrara; Milano; Modena; Monza e della Brianza; Novara; Parma; Pavia; Pesaro e Urbino; Pescara; Piacenza; Reggio nell’Emilia; Rimini; Savona; Sondrio; Torino; Trento; Trieste; Varese; Venezia; Verbano-Cusio-Ossola; Vercelli; Verona; Vicenza |
| Cluster 2 | Arezzo; Avellino; Bari; Barletta-Andria-Trani; Benevento; Brindisi; Cagliari; Caltanissetta; Catania; Chieti; Enna; Ferrara; Foggia; Gorizia; Latina; Lecce; Livorno; Lucca; Macerata; Matera; Messina; Napoli; Nuoro; Palermo; Pistoia; Pordenone; Potenza; Prato; Ragusa; Rieti; Roma; Siracusa; Taranto; Terni; Treviso; Vibo Valentia; Viterbo |
| Cluster 3 | Agrigento; Ancona; Ascoli Piceno; Bolzano; Campobasso; Caserta; Catanzaro; Cosenza; Crotone; Frosinone; Grosseto; Isernia; L’Aquila; Oristano; Padova; Perugia; Pisa; Ravenna; Reggio di Calabria; Rovigo; Salerno; Sassari; Siena; Sud Sardegna; Teramo; Trapani; Udine |
Figure 6Scatter plot illustrating how provincial clusters were distributed on the prevalence of SARS-CoV-2 cases and their trend: (A) the clustering solution obtained by hierarchical clustering; and (B) the clustering solution obtained using the k-means algorithm.
Comparisons of the prevalence of SARS-CoV-2 cases and their trend between clusters of provinces.
| Clusters | Prevalence of Total Cases Per 10,000 Residents | Trend of Total Cases (%) |
|---|---|---|
|
| ||
| Cluster 1 | 61.0 (31.0) | 5.7% (4.4) |
| Cluster 2 | 11.2 (14.0) | 3.9% (2.4) |
| Cluster 3 | 11.6 (12.7) | 1.5% (0.9) |
| <0.001 | <0.001 | |
|
| ||
| Cluster 1 | 53.0 (38.1) | 3.2% (1.1) |
| Cluster 2 | 42.7 (44.0) | 6.7% (3.3) |
| Cluster 3 | 7.4 (8.5) | 2.2% (2.0) |
| <0.001 | <0.001 |
Results are reported as the median (interquartile range), with p-values based on the Kruskal–Wallis test. Data are referred to 3 May 2020. Weekly trend from 27 April to 3 May 2020.
K-means clusters’ composition of provinces.
| Clusters | Provinces |
|---|---|
| Cluster 1 | Ancona; Aosta; Bergamo; Biella; Bolzano; Brescia; Cremona; Enna; Ferrara; Forlì-Cesena; La Spezia; Lecco; Lodi; Lucca; Macerata; Mantova; Massa Carrara; Modena; Padova; Parma; Pesaro e Urbino; Pordenone; Prato; Ravenna; Reggio nell’Emilia; Rieti; Rimini; Sondrio; Treviso; Trieste; Vercelli |
| Cluster 2 | Alessandria; Arezzo; Asti; Avellino; Belluno; Bologna; Brindisi; Caltanissetta; Chieti; Como; Cuneo; Fermo; Firenze; Foggia; Genova; Gorizia; Imperia; Matera; Milano; Monza e della Brianza; Novara; Palermo; Pavia; Pescara; Piacenza; Pistoia; Roma; Savona; Terni; Torino; Trento; Varese; Venezia; Verbano-Cusio-Ossola; Verona; Vicenza |
| Cluster 3 | Agrigento; Ascoli Piceno; Bari; Barletta-Andria-Trani; Benevento; Cagliari; Campobasso; Caserta; Catania; Catanzaro; Cosenza; Crotone; Frosinone; Grosseto; Isernia; L’Aquila; Latina; Lecce; Livorno; Messina; Napoli; Nuoro; Oristano; Perugia; Pisa; Potenza; Ragusa; Reggio di Calabria; Rovigo; Salerno; Sassari; Siena; Siracusa; Sud Sardegna; Taranto; Teramo; Trapani; Udine; Vibo Valentia; Viterbo |