| Literature DB >> 35270551 |
Lisa Bauleo1, Simone Giannini2, Andrea Ranzi2, Federica Nobile1, Massimo Stafoggia1, Carla Ancona1, Ivano Iavarone3.
Abstract
The large availability of both air pollution and COVID-19 data, and the simplicity to make geographical correlations between them, led to a proliferation of ecological studies relating the levels of pollution in administrative areas to COVID-19 incidence, mortality or lethality rates. However, the major drawback of these studies is the ecological fallacy that can lead to spurious associations. In this frame, an increasing concern has been addressed to clarify the possible role of contextual variables such as municipalities' characteristics (including urban, rural, semi-rural settings), those of the resident communities, the network of social relations, the mobility of people, and the responsiveness of the National Health Service (NHS), to better clarify the dynamics of the phenomenon. The objective of this paper is to identify and collect the municipalities' and community contextual factors and to synthesize their information content to produce suitable indicators in national environmental epidemiological studies, with specific emphasis on assessing the possible role of air pollution on the incidence and severity of the COVID-19 disease. A first step was to synthesize the content of spatial information, available at the municipal level, in a smaller set of "summary indexes" that can be more easily viewed and analyzed. For the 7903 Italian municipalities (1 January 2020-ISTAT), 44 variables were identified, collected, and grouped into five information dimensions a priori defined: (i) geographic characteristics of the municipality, (ii) demographic and anthropogenic characteristics, (iii) mobility, (iv) socio-economic-health area, and (v) healthcare offer (source: ISTAT, EUROSTAT or Ministry of Health, and further ad hoc elaborations (e.g., OpenStreetMaps)). Principal component analysis (PCA) was carried out for the five identified dimensions, with the aim of reducing the large number of initial variables into a smaller number of components, limiting as much as possible the loss of information content (variability). We also included in the analysis PM2.5, PM10 and NO2 population weighted exposure (PWE) values obtained using a four-stage approach based on the machine learning method, "random forest", which uses space-time predictors, satellite data, and air quality monitoring data estimated at the national level. Overall, the PCA made it possible to extract twelve components: three for the territorial characteristics dimension of the municipality (variance explained 72%), two for the demographic and anthropogenic characteristics dimension (variance explained 62%), three for the mobility dimension (variance explained 83%), two for the socio-economic-health sector (variance explained 58%) and two for the health offer dimension (variance explained 72%). All the components of the different dimensions are only marginally correlated with each other, demonstrating their potential ability to grasp different aspects of the spatial distribution of the COVID-19 pathology. This work provides a national repository of contextual variables at the municipality level collapsed into twelve informative factors suitable to be used in studies on the association between chronic exposure to air pollution and COVID-19 pathology, as well as for investigations on the role of air pollution on the health of the Italian population.Entities:
Keywords: COVID-19; air pollution; contextual factors; data synthesis techniques; epidemiology; principal component analysis (PCA)
Mesh:
Year: 2022 PMID: 35270551 PMCID: PMC8910469 DOI: 10.3390/ijerph19052859
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Characteristics # of the contextual continuous variables for each dimension under study and air pollution exposure across the 7903 Italian municipalities.
| Variables | Label | Mean | SD | Min | p5 | p25 | p50 | p75 | p95 | Max |
|---|---|---|---|---|---|---|---|---|---|---|
|
| ||||||||||
| Area (km2) as of 1 January 2020 | area | 38.2 | 50.8 | 0.12 | 4.37 | 11.5 | 22.4 | 44.6 | 125.7 | 1287.4 |
| Altitude (m above sea level) | elevation | 355 | 296 | 0 | 12 | 114 | 289 | 520 | 920 | 2035 |
|
| ||||||||||
| Resident population as of 31 December 2019 (number) | population_2019 | 7622 | 42,801 | 30 | 284 | 1005 | 2459 | 6317 | 25,090 | 2,837,332 |
| Population density ratio (population over area) | pop_density | 303.7 | 649.7 | 0.80 | 12.6 | 43.5 | 105.5 | 281.5 | 1189 | 12,178 |
| Population density (cell 1 km2) | pop_maximun | 1888 | 2444 | 12 | 165 | 509 | 1079 | 2,316 | 6429 | 35,271 |
| Percentage of population over-65 years as of 31 December 2019 (%) | over 65 | 25.5 | 5.40 | 8.64 | 17.7 | 21.9 | 24.9 | 28.3 | 34.8 | 62.3 |
| Impervious Surface Areas (cell 1 km2) | ISA | 58.6 | 42.7 | 0 | 0 | 0 | 79 | 100 | 100 | 101 |
| Value of the night brightness index (cell 1 km2) | LAN | 23.1 | 41.9 | 0 | 2.22 | 7.28 | 15.2 | 28.5 | 63.6 | 1013 |
| Percentage of urban coverage (cell 1 km2) | pcturb | 41.2 | 27.6 | 0 | 0 | 22.2 | 37.6 | 59.7 | 94.4 | 100 |
| Length of the roads (cell 1 km2) | l_roads | 12,332 | 5551 | 1673 | 5426 | 8382 | 11,337 | 15,134 | 22,477 | 51,711 |
|
| ||||||||||
| Attraction index (mean 2014–2015) * | attraction_index | 23.2 | 12.2 | 0 | 6.69 | 13.8 | 21.4 | 30.7 | 45.7 | 83.1 |
| Self-containment index (mean 2014–2015) ** | self_cont_index | 34.7 | 13.3 | 2.48 | 15.5 | 25.3 | 33.2 | 42.5 | 59.7 | 89.1 |
| Extra-municipal movements *** | mov_extra | 1439 | 2633 | 1 | 79 | 267 | 638 | 1654 | 5,127 | 90,063 |
| Intra-municipal movements **** | mov_intra | 2214 | 18,293 | 0 | 24 | 163 | 483 | 1382 | 6793 | 1,284,994 |
| Total movements: number of individuals who move for work or study | mov_tot | 3653 | 20,008 | 5 | 118 | 457 | 1181 | 3126 | 11,836 | 1,340,818 |
|
| ||||||||||
| Household income (average 2014–2015 in €) ***** | income | 13,000 | 3124 | 3796 | 8037 | 10,285 | 13,453 | 15,245 | 17,612 | 29,985 |
| Entrepreneurship rate (2014–2015 average): number of companies per 100,000 res. | entrepr_rate | 62.6 | 24.2 | 9.64 | 33.3 | 47.8 | 59.5 | 73.1 | 99.6 | 407.4 |
| Cardiovascular diseases hospitalization rate (annual average 2013–2018 per 100 residents) | rate_R_cir | 1.18 | 0.31 | 0.17 | 0.80 | 0.97 | 1.12 | 1.31 | 1.76 | 4.37 |
| Respiratory diseases hospitalization rate (annual average 2013–2018 per 100 residents) | rate_R_res | 0.69 | 0.18 | 0 | 0.45 | 0.58 | 0.67 | 0.78 | 1.00 | 2.82 |
| All causes hospitalization rate (annual average 2013–2018 per 100 residents) | rate-R_tot | 4.95 | 0.67 | 1.06 | 4.03 | 4.51 | 4.88 | 5.29 | 6.14 | 12.3 |
| Cardiovascular diseases mortality rate (annual average 2013–2017 per 100 residents) | rate_M_cir | 0.46 | 0.22 | 0 | 0.20 | 0.31 | 0.42 | 0.55 | 0.85 | 2.37 |
| Respiratory diseases mortality rate (annual average 2013–2017 per 100 residents) | rate_M_res | 0.09 | 0.06 | 0 | 0.02 | 0.05 | 0.08 | 0.11 | 0.20 | 1.06 |
| All causes mortality rate (annual average 2013–2017 per 100 residents) | rate_M_tot | 1.19 | 0.43 | 0 | 0.66 | 0.89 | 1.12 | 1.38 | 1.97 | 5.62 |
|
| ||||||||||
| Minimum distance between the municipality (centroid) and a health facility (meters) | dist_healthcare_facility | 9403 | 5987 | 65 | 1753 | 5,228 | 8428 | 12,635 | 20,122 | 152,024 |
| Minimum distance between the municipality (centroid) and an emergency room (meters) | dist_er | 10,751 | 6382 | 57 | 2470 | 6274 | 9733 | 14,050 | 22,206 | 151,546 |
| Number workers in healthcare residences | workers_heacareres | 217 | 566 | 2 | 13 | 34 | 71 | 172 | 736 | 5940 |
|
| ||||||||||
| Population weighted exposure of PM2.5 (annual mean 2016–2019 µg/m3) | pm25_2016_2019_pop | 14.6 | 4.98 | 6.11 | 8.48 | 10.5 | 12.9 | 19.1 | 23.4 | 27.4 |
| Population weighted exposure of PM10 (annual mean 2016–2019 µg/m3) | pm10_2016_2019_pop | 21.1 | 6.46 | 6.62 | 11.8 | 16.0 | 20.1 | 26.1 | 32.6 | 37.5 |
| Population weighted exposure of NO2 (annual mean 2016–2019 µg m3) | no2_2016_2019_pop | 14.5 | 6.74 | 4.23 | 6.38 | 8.73 | 13.0 | 19.3 | 26.1 | 46.3 |
# mean, standard deviation (SD), percentiles (p5, p25, p50, p75, p95), minimum (min) and maximum (max) value; * number of non-resident individuals who carry out work or study activities in the municipality over the total mobility flows (active residents plus outgoing flows of residents); ** number of resident individual who carry out work or study activities in the municipality over the total mobility flows (active residents plus outgoing flows of residents); *** number of individuals who travel outside the municipality of residence for work or study reasons; **** number of individuals who move within the municipality of residence for work or study; ***** ratio between the total gross income of registered households and the total number of members of registered households.
Distribution of the 7903 Italian municipalities with respect to contextual categorical variables for each dimension under study.
| Variable | Label | Number of Municipalities | % |
|---|---|---|---|
|
| |||
| Coastal municipality | coastal | 642 | 8.12 |
| Island municipality | island | 34 | 0.43 |
| Coastal area * | coastal_area | 1165 | 14.7 |
| Degree of urbanization: | |||
| Cities or “densely populated areas” | urbanizzaztio_1 | 255 | 3.23 |
| Small towns and suburbs or “intermediate population density areas” | 2607 | 33.0 | |
| Rural areas or “sparsely populated areas” | 5041 | 63.8 | |
|
| |||
| Number of airports within 30 km from the municipality boundaries | airports_30 km | ||
| 1 | 2891 | 36.6 | |
| 2 or more | 386 | 4.88 | |
| Number of railway stations in the municipality | n_railway_station | ||
| 1 | 1296 | 16.4 | |
| 2–3 | 387 | 4.89 | |
| 4–5 | 57 | 0.72 | |
| 6 or more | 29 | 0.33 | |
|
| |||
| Socio economic position (SEP) ** | SEP_cat | ||
| Low | 1550 | 19.6 | |
| Middle-low | 1582 | 20.0 | |
| Middle | 1610 | 20.4 | |
| Middle-high | 1592 | 20.1 | |
| High | 1569 | 19.9 | |
|
| |||
| Number of teaching hospitals | num_polyclinics | ||
| 1 | 27 | 0.34 | |
| 2–3 | 12 | 0.45 | |
| 3 or more | 6 | 0.12 | |
| Number of general hospitals | num_hosp | ||
| 1 | 518 | 6.55 | |
| 2–3 | 22 | 0.28 | |
| 3 or more | 7 | 0.09 | |
| Number of public or private foundations | num_ircss | ||
| 1 | 38 | 0.48 | |
| 2–3 | 5 | 0.07 | |
| 3 or more | 3 | 0.03 | |
| Number of accredited private nursing homes | num_nh | ||
| 1 | 220 | 2.78 | |
| 2–3 | 48 | 0.61 | |
| 3 or more | 17 | 0.19 | |
| Number of acute care beds | nbeds_acute_ord | ||
| 1–10 | 19 | 0.24 | |
| 10–50 | 127 | 1.61 | |
| 51–150 | 235 | 2.97 | |
| 150 or more | 269 | 3.40 | |
| Number of long-term hospital beds | nbeds_lstay_ord | ||
| 1–10 | 99 | 1.25 | |
| 10–50 | 183 | 2.32 | |
| 51–150 | 31 | 0.39 | |
| 150 or more | 5 | 0.06 | |
| Number of rehabilitation beds | nbeds_rehab_ord | ||
| 1–10 | 61 | 0.77 | |
| 10–50 | 214 | 2.71 | |
| 51–150 | 114 | 1.44 | |
| 150 or more | 30 | 0.38 | |
| Number of intensive care beds | nbeds_ICU_ord | ||
| 1–10 | 254 | 3.21 | |
| 10–50 | 83 | 1.05 | |
| 51–150 | 14 | 0.18 | |
| 150 or more | 3 | 0.04 | |
| Number of emergency department | n_ps | ||
| 1 | 219 | 2.77 | |
| 2–3 | 7 | 0.09 | |
| 3 or more | 4 | 0.05 | |
| Number of family counseling | n_fam_counseling | ||
| 1 | 436 | 5.52 | |
| 2–3 | 123 | 1.56 | |
| 3 or more | 30 | 0.38 | |
| Number of nursing residences | n_healthcare_residences | ||
| At least 1 | 481 | 6.09 |
* municipalities with at least 0% of the surface at a maximum distance of 10 km from the sea; ** data from the 2011 Italian Census and calibrated on a regional basis; *** on 31 December 2019.
Figure 1Spearman correlation matrices among the variables within each dimension: geographic characteristic (a), demographic and anthropogenic characteristics (b), mobility (c), socio-economic-health characteristics (d), availability of health care (e). Variable labels are listed in Table 1.
Figure 2Results of the principal component analysis (PCA); the contributions of individual contextual covariates on the selected components are displayed for the five dimensions (geographic characteristic (a), demographic and anthropogenic characteristics (b), mobility (c), socio-economic and health status of the population (d), availability of health care (e)). Variable labels are listed in Table 1.
Figure 3Correlation matrix among the components resulted from the principal component analysis.
Spearman correlation coefficients among pollution variables and components resulted from the principal component analysis for the five dimensions (geographic characteristic, demographic and anthropogenic characteristics, mobility, socio-economic and health status of the population, availability of health care).
| PCA Dimension | PM2.5 | PM10 | NO2 |
|---|---|---|---|
|
| |||
| First component | 0.48 | 0.62 | 0.50 |
| Second component | −0.80 | −0.79 | −0.77 |
| Third component | 0.05 | 0.01 | −0.11 |
|
| |||
| First component | 0.42 | 0.51 | 0.58 |
| Second component | −0.35 | −0.36 | −0.33 |
|
| |||
| First component | 0.37 | 0.43 | 0.51 |
| Second component | 0.47 | 0.46 | 0.58 |
| Third component | −0.20 | −0.15 | −0.20 |
|
| |||
| First component | −0.44 | −0.46 | −0.53 |
| Second component | 0.29 | 0.17 | 0.36 |
|
| |||
| First component | 0.45 | 0.45 | 0.57 |
| Second component | −0.47 | −0.45 | −0.57 |