| Literature DB >> 32471302 |
Raffaele Fronza1, Marina Lusic2,3, Manfred Schmidt1, Bojana Lucic2,3.
Abstract
The global outbreak of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection causing coronavirus disease 2019 (COVID-19) has reached over five million confirmed cases worldwide, and numbers are still growing at a fast rate. Despite the wide outbreak of the infection, a remarkable asymmetry is observed in the number of cases and in the distribution of the severity of the COVID-19 symptoms in patients with respect to the countries/regions. In the early stages of a new pathogen outbreak, it is critical to understand the dynamics of the infection transmission, in order to follow contagion over time and project the epidemiological situation in the near future. While it is possible to reason that observed variation in the number and severity of cases stems from the initial number of infected individuals, the difference in the testing policies and social aspects of community transmissions, the factors that could explain high discrepancy in areas with a similar level of healthcare still remain unknown. Here, we introduce a binary classifier based on an artificial neural network that can help in explaining those differences and that can be used to support the design of containment policies. We found that SARS-CoV-2 infection frequency positively correlates with particulate air pollutants, and specifically with particulate matter 2.5 (PM2.5), while ozone gas is oppositely related with the number of infected individuals. We propose that atmospheric air pollutants could thus serve as surrogate markers to complement the infection outbreak anticipation.Entities:
Keywords: PM2.5; SARS-CoV-2; infection dynamics; ozone; viral outbreak
Mesh:
Substances:
Year: 2020 PMID: 32471302 PMCID: PMC7354543 DOI: 10.3390/v12060588
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
The correlation vectors between population density (PD), PM2.5, PM10, NH3, O3, dew-point temperature (TD), temperature (T), relative humidity (RH), wind (W), pressure (P) and number of SARS-CoV-2 cases in Italian provinces (Cas).
| PD | Cas | |
|---|---|---|
| PD | 1.000 | 0.034 |
| PM10 | 0.377 | 0.586 |
| PM2.5 | 0.354 | 0.597 |
| NH3max | 0.084 | 0.693 |
| O3max | −0.244 | −0.444 |
| TD | 0.045 | −0.327 |
| T | 0.094 | −0.284 |
| RH | −0.108 | −0.253 |
| WIND | 0.006 | −0.327 |
| P | 0.188 | −0.025 |
The correlation matrix among the pollutant variables (PM2.5, PM10, NH3, O3) and the atmospheric variables (TD, T, RH, and P) in the Italian provinces.
| TD | T | RH | WIND | P | |
|---|---|---|---|---|---|
| PM10 | −0.209 | −0.136 | −0.282 | −0.441 | 0.269 |
| PM2.5 | −0.300 | −0.221 | −0.336 | −0.520 | 0.192 |
| NH3 | −0.227 | −0.168 | −0.253 | −0.377 | 0.164 |
| O3 | 0.471 | 0.351 | 0.502 | 0.439 | 0.025 |
Figure 1Correlation between SARS-CoV-2 cases in 107 Italian provinces using PM2.5, PM10 and O3. The scatterplots display the values of three atmospheric factors and the number of cases per million in 107 provinces (left panels) and selected provinces with regional capitals (middle panels). (A) PM2.5; (B) PM10; (C) O3. Different colors represent Italian provinces at different latitudes. Red dots: provinces with a latitude higher than 44.84N; black dots: provinces with a latitude comprised between 41.50N and 44.86N; green dots: provinces with a latitude lower than 41.50N. (D–F) The scatterplots display the concentration of PM2.5, PM10, and O3. Red dots represent Italian provinces, blue dots FGS regions.
Figure 2Evaluation of the cross-influence between PM2.5 and O3. Lower panel: the scatter plots between the selected factor and the number of cases per million. The LOESS curve is computed in each scatterplot and shown in red. Each plot is restricted to show datapoints that belong to the provinces that fall in the corresponding range of the conditioning factor. Upper plot: the range of the values that define each level of conditioning. The overlap among the levels is 0.1. (A) the scatterplot of PM2.5 conditioned to PM2.5; (B) the scatterplot of O3 conditioned to O3; (C) the scatterplot of O3 conditioned to PM2.5.
Figure 3ANN performance assessment. (A) Performance values of the ANN on the 107 Italian provinces (left) and FGS (right) data. SE sensitivity, SP specificity, ACC accuracy, PRC precision. The dots represent the ANN average performance based on 100 Monte Carlo cross-validations. Bars represent the standard deviation. Red dots Italian provinces, blue dots FGS regions and black dots random dataset. (B) The histogram represents the number of the escalated Italian provinces (107) considering the PM2.5 and O3 concentrations in four months, March, June, September, and December, representative of the four seasons: Spring, Summer, Autumn, and Winter. Red bars, number of escalated provinces, grey bars, remaining non escalated provinces. Statistical analysis were performed using multiple t test corrected with the Sidak–Boneferroni method for multiple comparisons (p < 0.001). Black asterisk indicates that the classifier performs better than the null classifier. Red asterisk indicates that the classifier performs worse than the null classifier. (C–F) The spatial administrative maps representing PM2.5 (μg/m3, red), O3 (ppm, blue), prediction (number of positive predictions, orange) and actual reported cases (number of cases per million, black) for the Italian provinces. (E) The color intensity on the map represents the number of times that provinces in the test dataset were positive for the outbreak in one hundred Monte Carlo cross-validations. (F) The number of actual reported cases was limited to 3000 per million to increase the dynamic range of the map.
Figure 4Correlation between hospitalized and not hospitalized cases in 21 Italian regions using PM2.5 and O3. (A) Scatter plot of the hospitalized (red points) and not hospitalized (black points) versus the concentration of PM2.5 in 21 Italian regions. Red line, positive correlation (R2 = 0.4891, p = 0.0004); black line, no significant correlation (R2 = 0.01154, p = 0.6430). The labels represent the four regions with the highest number of cases per million. (B) Scatter plot of the hospitalized (red points) and not hospitalized (black points) versus the concentration of ozone in 21 Italian regions. Red line, negative correlation (R2 = 0.2355, p = 0.0257); black line negative correlation (R2 = 0.1590, p = 0.0734). The labels represent the four regions with the highest number of cases per million.