| Literature DB >> 32886309 |
Marco Mele1, Cosimo Magazzino2.
Abstract
This study uses two different approaches to explore the relationship between pollution emissions, economic growth, and COVID-19 deaths in India. Using a time series approach and annual data for the years from 1980 to 2018, stationarity and Toda-Yamamoto causality tests were performed. The results highlight unidirectional causality between economic growth and pollution. Then, a D2C algorithm on proportion-based causality is applied, implementing the Oryx 2.0.8 protocol in Apache. The underlying hypothesis is that a predetermined pollution concentration, caused by economic growth, could foster COVID-19 by making the respiratory system more susceptible to infection. We use data (from January 29 to May 18, 2020) on confirmed deaths (total and daily) and air pollution concentration levels for 25 major Indian cities. We verify a ML causal link between PM2.5, CO2, NO2, and COVID-19 deaths. The implications require careful policy design.Entities:
Keywords: COVID-19; Economic growth; India; Machine learning; Pollution; Time series
Mesh:
Substances:
Year: 2020 PMID: 32886309 PMCID: PMC7472938 DOI: 10.1007/s11356-020-10689-0
Source DB: PubMed Journal: Environ Sci Pollut Res Int ISSN: 0944-1344 Impact factor: 4.223
List of the variables
| Variable | Description | Source |
|---|---|---|
| PCGDP | GDP per capita in 2000 US$ (converted at Geary Khamis PPPs) | FRED |
| CO2 | CO2 emissions (metric tons per capita) | OGD |
| NO2 | NO2 emissions (metric tons per capita) | World Bank |
| PM2.5 | Primary particulate matter (metric tons per capita) | OGD |
Exploratory data analysis
| Variable | Mean | SD | Minimum | Maximum | Ex. kurtosis | 10-Trim | IQR |
|---|---|---|---|---|---|---|---|
| PCGDP | 3.1942 | 1.9372 | 2.1931 | 8.1946 | 0.8745 | 3.95 | 1.6910 |
| CO2 | 0.9164 | 0.1911 | 0.1975 | 1.9782 | 0.1794 | 0.86 | 0.4937 |
| NO2 | 5.1946 | 0.1943 | 5.1416 | 6.5946 | − 0.9634 | 5.10 | 0.5795 |
| PM2.5 | 0.8512 | 0.1845 | 0.1867 | 1.9245 | 0.1864 | 0.84 | 0.4765 |
Fig. 1The ML process
Results for unit roots and stationarity tests
| ADF | ERS | PP | KPSS | |
|---|---|---|---|---|
| Level | ||||
| PCGDP | − 2.765 (− 3.124) | − 1.008 (− 3.120) | − 2.120 (− 3.040) | 0.450*** (0.145) |
| CO2 | − 2.790 (− 3.005) | − 1.142 (− 3.150) | − 2.150 (− 3.680) | 0.375*** (0.145) |
| NO2 | − 1.195 (− 3.086) | − 1.790 (− 3.710) | − 3.746 (− 3.785) | 0.150*** (0.145) |
| PM2.5 | − 2.456 (− 3.142) | − 1.158 (− 3.145) | − 2.125 (− 1.580) | 0.350*** (0.145) |
| First differences | ||||
| PCGDP | − 6.350*** (− 2.008) | − 5.522*** (−2.950) | − 6.488*** (− 2.960) | 0.314* (0.460) |
| CO2 | − 3.052*** (− 2.378) | − 1.792 (−2.622) | − 8.350*** (− 2.644) | 0.240 (0.460) |
| NO2 | − 3.480** (− 1.900) | − 2.378*** (−2.005) | − 8.005*** (− 2.916) | 0.110 (0.460) |
| PM2.5 | − 3.050** (− 2.250) | − 1.850 (−2.422) | − 8. 125** (− 2.486) | 0.250 (0.460) |
*p < 0.1; **p < 0.05; ***p < 0.01. 5% Critical values are given in parentheses
Toda-Yamamoto causality tests results
| Dep. variable | PCGDP | CO2 | NO2 | PM2.5 |
|---|---|---|---|---|
| PCGDP | - | 2.125 (0.250) | 2.265 (0.300) | 2.150 (0.450) |
| CO2 | 11.450*** (0.000) | - | 0.607 (0.450) | 0.845 (0.350) |
| NO2 | 11.159*** (0.000) | 2.305 (0.380) | - | 3.402 (0.450) |
| PM2.5 | 10.260*** (0.000) | 3.120 (0.280) | 0.848 (0.500) | - |
*p < 0.1; **p < 0.05; ***p < 0.01
Rank of predictor and significant causality results
| Rank of predictor | Number of repetitions | Percentage (%) | AC | AUPRC |
|---|---|---|---|---|
| PM2.5 → deaths | 17,945 | 0.87 | 4.948 | True |
| NO2 → deaths | 17,168 | 0.86 | 4.795 | False |
| CO2 → deaths | 18,191 | 0.92 | 4.741 | False |
| lnPM2.5 → lnDeaths | 18,178 | 0.91 | 4.124 | False |
| dPM2.5 → dDeaths | 18,197 | 0.74 | 4.172 | False |
| dNO2 → dDeaths | 18,189 | 0.74 | 4.124 | False |
| d.lnPM2.5 → d.lnDeaths | 21,419 | 0.73 | 4.189 | False |
| d.lnNO2 → d.lnDeaths | 21,444 | 0.70 | 4.144 | False |
| PM2.5(s) → deaths(s) | 16,973 | 0.70 | 4.277 | False |
| NO2s → deaths(s) | 16,211 | 0.74 | 4.211 | False |
In Appendix Table 6: D2C core commands
AC: Average Causality value. AUPRC: Area Under the Precision Recall Curve. True P-val. < 0.05. False P-val. ≥ 0.05. ln is the logarithmic transformation; (s) is the square of the considered values; there are 7 unused variables
D2C core commands
| numpy import | |
| _scipy import poly | |
| _scipy.linalg import | |
| _def d2c (sys,method='zoh'): | |
| sycs: continous system ss or tf | |
| if is intance (sys,TransferFunction): | |
| sys=tf2ss(sys) | |
| flag=1 | |
| n=shape (a) [0] | |
| nb=shape (b) [1] | |
| nc=shape (c) [0] | |
| nd=shape (d) [1] | |
| nf=shape (f) [0] | |
| ng=shape (g) [1] | |
| nh=shape (h) [0] | |
| ni=shape (i) [1] | |
| nl=shape (l) [0] | |
| nm=shape (m) [1] | |
| if b[0,0]==1: | |
| A=0 | |
| B=b/sys.dt | |
| C=c | |
| D=d | |
| E=e | |
| F=f | |
| H=h | |
| I=i | |
| L=l | |
| M=m | |
| elif method== 'foh': | |
| A=(2/Ts)*(a-I)*inv(a+i) | |
| B=tk*iab | |
| c=tk*(c*inv(I+a)) | |
| D=d-(c*iab) |
Fig. 2Importance test results. Source: our elaboration with BGML
Fig. 3Predictive linear regression test. Sources: our elaboration with NN designer
Fig. 4Perform training test. Sources: our elaborations with NN designer
Fig. 5Concentration (μg/m3) PM2.5 in Delhi. Sources: our elaboration on hourly data. https://openaq.org/