Literature DB >> 35035123

Real-time CO2 emissions estimation in Spain and application to the COVID-19 pandemic.

Luis F S Merchante1,2, Delia Clar1, Alberto Carnicero1,2, Francisco J Lopez-Valdes1,2, Jesús R Jimenez-Octavio1,2.   

Abstract

CO2 emissions are one of the major contributors to global warming. The variety of emission sources and the nature of CO2 hinders estimating its concentration in real time and therefore to adopt flexible policies that contribute to its control and, ultimately, to reduce its effects. Spain is not exempted from this challenge and CO2 emissions are published only at the end of the year and as an aggregated value for the whole country, without recognising the existing differences between the regions (the so-called, Autonomous Communities). The recent COVID-19 pandemic is a clear example of the need of accurate and fast estimation methods so that policies can be tailored to the current status and not to a past one. This paper provides a method to estimate monthly emissions of CO2 for each AACC in Spain based on data that are published monthly by the relevant administrations. The paper discusses the approximations needed in the development of the method, predicts the drop in emissions due to the reduced industrial activity during the pandemic in Spain and provides the estimation of future emissions under three recovery scenarios after the pandemic.
© 2021 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  00–02; 62–07; CO2; COVID-19; Emissions; Forecast

Year:  2021        PMID: 35035123      PMCID: PMC8743041          DOI: 10.1016/j.jclepro.2021.126425

Source DB:  PubMed          Journal:  J Clean Prod        ISSN: 0959-6526            Impact factor:   9.297


Introduction

Measuring greenhouse gas emissions has been one of the main concerns of many governments for the last decades. Even if methane has the most potential in contribution to global warming, carbon dioxide (CO2) currently ranks first in affecting global warming due to its abundance in the atmosphere. In addition, CO2 is the primary greenhouse gas (GHG) originated from human activities (United States Environmental Protection Agency, 2019). The Environmental Protection Agency of the United States and the most recent report from the Mitigation of Climate Change working group of the Intergovernmental Panel on Climate Change (IPCC) of the United Nations identified that the vast majority of anthropogenic carbon dioxide emissions come from combustion of fossil fuels (principally coal, oil, and natural gas), with additional contributions coming from deforestation, changes in land use, soil erosion and agriculture (including livestock) (United States Environmental Protection Agency, 2019; Mitigation of Climate Change working group of the Intergovernmental Panel on Climate Change, 2018). Since 1970, CO2 emissions have increased by about 90%, with emissions from fossil fuel combustion and industrial processes contributing to about 78% of the total GHG emissions increase from 1970 to 2011. The variety of sources and the nature of the non-anthropogenic CO2 that is naturally present on the atmosphere difficult estimating its concentration in real time (Le Quéré et al., 2020). Consequently, CO2 emission values are normally released in an aggregated manner at the end of each year. When instantaneous concentration of CO2 emission are required, estimations must be made using proxy data which could be available almost at real time. These estimations are usually based on satellite images (Doll et al., 2000; Meng et al., 2014; Ghosh et al.; Shi et al., 2016; Nassar et al., 2017) but other approaches use proxy variables such as the fractional change in activity levels for each sector (Le Quéré et al., 2020) or other socio-economic variables (Begum et al., 2015; Hong et al., 2018) to estimate the instantaneous concentration of CO2 emissions. In this regard, a great variety of short-term and long-term forecasting techniques have been used to estimate GHG emissions. A useful review can be found in Table 1 from reference Hong et al. (2018). Most of those techniques are based on Evolutionary Algorithms (Karabulut et al., 2008; Mousavi et al., 2014; Fang et al., 2018), although Artificial Networks are also very popular on this domain (Behrang et al., 2011; Kankal et al., 2011; Ardakani and Ardehali, 2014; Guo et al., 2018; Heydari et al., 2019). Fewer references can be found on Support Vector Machines (Sun and Liu, 2016; Saleh et al., 2016; Ahmadi et al., 2019) or Regressions (Köne and Büke, 2010; Azadeh et al., 2017; Hosseini et al., 2019). Not many studies have been found testing other Machine Learning techniques, especially those based on ensemble methods (Dietterich, 2000) like Random Forest (Wei et al., 2018), Adaboost (Zhou et al., 2019) or voting of Multi-layer Perception Classifiers (Khan and Awasthi, 2019).
Table 1

Main features of regression techniques used for forecasting emissions.

CommunityDescriptionMain features
Linear RegressorFits a linear model to minimize the quadratic mean squared errorStates a simple base line of accuracy
K-Nearest Neighbors RegressorThe value is predicted by local interpolation of the nearest data in the k-neighbourhoodSimple, non-parametric, robust to noisy data
Decision Trees RegressorNon-parametric method by learning decision rules resulting in local linear regressionsNon-parametric, interpretable
Random Forest RegressorEnsemble of Decision Trees to improve generalizabilityMore resistant to overfitting, very stable
Gradient Boosting RegressorEnsemble of weak decision trees models. New predictors are fitted with mistakes committed by previous predictorsReduce bias and variance
Epsilon Support Vector RegressorFit a hyper-plane to the data transformed by a RBF kernel. Some error controlled by epsilon is toleratedComputational complexity does not depend on the input data dimensionality
Kernel Ridge RegressorCombines ordinary Least Squares with L2 penalty on the coefficients with kernel trickEfficient non-linear fitting
Main features of regression techniques used for forecasting emissions. An alternative to proxy variables are time series of CO2 emissions to be used with Grey system theory and ARIMA models (Lin et al., 2011; Pao et al., 2012; Lotfalipour et al., 2013; García-Martos et al., 2013; Yuan et al., 2016; Pao and Tsai, 2011). Unfortunately, methods that estimate CO2 emissions based on trends of the historical time series are not reliable when extraordinary events, such as the COVID-19 pandemic, arrive. Indeed, few months ago, the outbreak of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2 or COVID-19) and the global health crisis announced by the World Health Organization on March 11th, 2020 shattered many of these estimations. COVID-19 has spread around the planet and many governments have imposed lock-downs with different restrictions that affect severely the mobility and the industrial and economical activity (Bert et al., 2020; IHS Markit, 2020; Ozili & Arun, 0000; Bartik et al., 2020; del Rio-Chanona et al., 2020). These restrictions met the objective of reducing the virus spread, but they also had a significant impact in other areas, like in the environmental field, where emissions decreased to unknown levels in the last years. Thus, this unexpected scenario revealed the limitations of the aforementioned methods based on historical data to estimate CO2 emissions in real time. This study focuses on the development of a method to estimate almost realtime CO2 emission levels in Spain. According with the data from the Emission Database for Global Atmospheric Research (Joint Research Centre, 2020) and the International Energy Agency (International Energy Agency, 2020), the historical trend of CO2 emission for Spain is far from the figures from top CO2 emitting countries (Shirmohammadi et al., 2020). Spain ranks in position in contribution to the global CO2 emission, producing a 0.7% of the global share (Worldometer, 2020). The main goal of this work is to determine whether proxy variables related to energy consumption can be considered as sufficiently robust metrics for almost real time CO2 emission estimations. If the model works, it can be used to predict the effect of future policies on the generation of pollutants. To show the robustness of the model, it will be applied to the case of modeling the CO2 emissions of the Autonomous Communities1 (AACC) of Spain and to forecast these emissions under three potential scenarios of economic activity recovery after COVID-19.

Data and methodology

Data

Publicly available data per each Spanish AACC in the period 2011–2018 were retrieved from several sources (see Table 2 ). Feature selection techniques were applied, so that the CO2 emissions model for each AACC could use different predictors, in order to contribute to reveal the nature of CO2 emissions within each one. The list of variables that were explored to model the CO2 emissions in the different Autonomous Communities are listed in Table 2.
Table 2

Variables used to train the model.

VariableFrequencyAbsent AACCSource
CO2 global anthropogenicmonthlyCeutaMITECO1
CO2 non renewable energy generationmonthlyCeutaREE2
Meteorology (average temperature and precipitations)monthlyAEMET3
GBPannualINE4
GBP chained indexquarterlyCeuta and MelillaAIREF5
PopulationannualINE4
GBP per capita (computed as the division of GBP and Population)anual
Number of accidents with victims on intercity roadsmonthlyMelillaDGT6
Number of victims (deaths, severe and minor)monthlyMelillaDGT6
Energy demandmonthlyREE2
Number of passengers in urban bus tripsmonthlyBaleares, Cantabria, La Rioja, Ceuta and MelillaINE4
Number of real state operationsquarterlyMITMA7
Number of mortgagesmonthlyINE4
Overnight stays in hotelmonthlyINE4
Retail sales indexmonthlyINE4
Services sector activity indicatormonthlyCeuta and MelillaINE4
Industry sector activity indicatormonthlyCeuta and MelillaINE4
Registered workers to Social Security SystemmonthlySS8
Consumption of petrol products for transportationmonthlyCORE9
Consumption of petrol products for home heating and industrymonthlyCORE9
Consumption of petrol productsmonthlyCORE9
Public administration and other resident sectors creditsmonthlyBE10
Public administration and other resident sectors depositsmonthlyBE10
DeathsmonthlyINE4

Spanish Ministry of Environment.

Red Eléctrica Española.

State Metereological Agency.

Statistic National Institute.

Independent Authority of Fiscal Responsibility.

Directorate General for Traffic and Department of Security of Basque Government.

Ministry of Transports and Mobility.

Social Security System.

Strategic Oil Products Reserves Public Corporation.

Central Bank of Spain.

Variables used to train the model. Spanish Ministry of Environment. Red Eléctrica Española. State Metereological Agency. Statistic National Institute. Independent Authority of Fiscal Responsibility. Directorate General for Traffic and Department of Security of Basque Government. Ministry of Transports and Mobility. Social Security System. Strategic Oil Products Reserves Public Corporation. Central Bank of Spain.

Apportioning yearly reported variables into monthly figures

The most recent information about emissions was retrieved from reports from the Ministry for Ecological Transition and Demographic Challenge (Ministerio para la Transición Ecológica y el Reto Demográfico, 2020), where equivalent emissions for AACC are provided from 1990 to 2018. It should be noted that the volume of CO2 is provided only annually. Since the model to be developed in this study seeks to predict the emissions on a monthly basis (to be able to detect the impact of rapid events such as the COVID-19 crisis), this variable needed to be transformed into monthly figures before being able to include it in the model. To this end, an approach based on monthly reported energy indicators was followed. These indicators are published by Red Eléctrica Española (REE, the only high voltage electric transport operator in Spain) (Red Eléctrica Española, 2020). REE reports on the monthly emissions produced by non-renewable energy generation in Spain and on the energy produced monthly by every AACC, desegregated by energy type. The approximation is that the monthly distribution found on the emissions from non-renewable energy generation applies also to the global anthropogenic CO2 monthly distribution. This hypothesis is supported by a very similar behaviour between both annual time series as depicted in Fig. 1 . If their annual values are closely correlated, it has sense finding the same behaviour on monthly distributions.
Fig. 1

kTmCO2eq anthropogenic vs TmCO2eq from non-renewable generation (data standardized).

kTmCO2eq anthropogenic vs TmCO2eq from non-renewable generation (data standardized). The approach followed to apportion in monthly figures the yearly amount of CO2 emissions is detailed in Algorithm 1. Approach to apportion yearly reported emissions of CO2 in monthly figures

Model construction, training and validation

In this study, it was considered that regressions techniques and, more precisely, regression meta-algorithms based on ensembles could provide useful models without sacrificing accuracy. Several of those regression techniques were tested and compared by means of the coefficient, thanks to recent implementations of Machine Learning libraries that allowed the authors to train numerous modeling techniques with few efforts (Pedregosa et al., 2011; Pytorch, 2020; TensorFlow, 2020). The interpretation of the coefficient is that the closer the value is to one, the more accurate the prediction is. In this context, negative values mean that the average of the data provides a better fit to the outcomes than the predicted values. Before training, data were standardized (removing the mean and scaling to unit variance), as many regression techniques assume a Gaussian distribution of the attributes. This process is not required for regression algorithms that do not make this Gaussian assumption (e.g. Decision Trees), but it is mandatory for several others. The regression techniques used in this stud ywere: K-Nearest Neighbors Regressor, Decision Trees, Random Forest and Gradient Boosting Regressors, Epsilon-Support Vector Regression, Linear and Kernel Ridge Regressors. Table 1 shows the main features for each technique. Modelling was combined with feature selection (by means of Sequential Forward Floating Selection (Pudil et al., 1994)) and hyper-parameter searching techniques to find the best fit. All the models were trained with data from years 2011–2016, and validated with data from 2017 to 2018. The processed described above is resumed in Algorithm 2. It is only after training and validation that the model can be used to predict the emissions in 2019 and 2020. Method followed for modelling CO2 emissions

Scenarios used to predict future CO2 emissions

Finally, the model obtained was used to predict the change in emission levels in three hypothetical scenarios that can potentially occur associated to the recovery after the COVID-19 pandemic. At the time of writing this document, the Statistic National Institute of Spain had estimated a −21.5% Gross Domestic Product (GDP) fall for the quarter of 2020 (flash estimate) (Instituto Nacional de Estadística, 2020). Based on this prediction, this study hypothesized that, due to the pandemic, the GDP in Spain remained at this level until the end of 2020. Under this assumption, three potential recovery scenarios were simulated, as follows: Scenario 1, V-shape recovery: economic activity will get back to the level of January 2020 in a linear fashion by January 2022. Scenario 2, Slow V-shape recovery: similar to the previous one, but full recovery will be reached by January 2023. Scenario 3, U-shape recovery: in which the economic activity in January 2022 would be similar to that of January 2021, rising back to the values of January 2020 by January 2023.

Results

Model selection and results validation

As aforementioned, all the algorithms from Table 1 were trained and tested using 50 repetitions of a 5-fold cross-validation process. Each cross-validated training returned five whose mean and standard deviation were averaged over the 50 repetitions. Table 3 shows the average and standard deviation values of the coefficient. Best values were achieved by Random Forest and Gradient Boosting Regressors. Eventually a Gradient Boosting Regressor (Friedman, 1999, 2000) was chosen because its implementation allowed us to obtain a measure of uncertainty using lower and upper prediction intervals (Scikit-learn, 2020). As mentioned before, this technique combined with feature selection allowed the authors to identify the most relevant factors that explained CO2 emissions within each AACC. These variables are shown in Table 4 .
Table 3

Average and standard deviation values of the coefficient for the different models tested.


linear
knn
decisionTrees
randomForest
gradientBoosting
svr
krr
avgstdavgstdavgstdavgstdavgstdavgstdavgstd
Andalucia−1.041.460.170.190.120.340.600.130.510.20−0.200.23−26.2410.22
Aragon−31.2660.04−0.330.79−0.631.100.010.450.020.39−0.250.25−15.7114.45
Cantabria−1.631.77−0.280.38−1.001.12−0.180.22−0.370.39−0.130.12−26.9412.50
Castilla la Mancha−2.874.92−0.040.32−1.171.19−0.020.17−0.040.28−0.240.28−21.1614.05
Castilla y Leon−2.233.180.070.44−1.842.08−0.280.71−0.631.02−0.170.20−7.049.21
Cataluña−1.853.150.140.30−0.440.820.140.310.110.35−0.200.17−54.2621.47
Pais Vasco−1.031.82−0.000.38−0.460.690.260.170.110.10−0.120.10−16.256.54
Principado de Asturias−2.143.280.180.31−0.721.100.240.280.210.49−0.110.12−10.315.72
Comunidad de Madrid−6.866.12−0.070.41−0.400.430.100.210.150.31−0.140.16−90.5244.94
Comunidad de Navarra−3.095.39−0.030.30−1.000.56−0.060.85−0.130.75−0.080.09−18.3812.18
Comunidad Valenciana−4.197.320.000.47−0.620.660.020.270.040.39−0.070.09−29.3918.45
Extremadura−1.421.32−0.220.83−1.491.31−0.420.53−0.770.71−0.100.12−15.918.55
Galicia−2.595.340.240.24−0.030.240.420.120.390.14−0.130.16−10.492.18
Islas Baleares−3.035.380.450.470.280.310.580.250.490.31−0.310.40−36.954.01
Islas Canarias−8.7013.210.150.28−0.490.510.250.220.080.36−0.140.25−66.5216.03
La Rioja−11.9015.68−0.040.29−0.770.400.050.32−0.140.47−0.080.10−1.390.48
Region de Murcia−3.796.53−0.180.47−0.330.630.190.250.180.23−0.140.14−21.1713.26
Table 4

Relevant variables per model.

AACCvar1var2var3var4var5var6var7var8var9var10var11
Andaluciayeargbpgbp chainedpopulationgbp p.c1depositstCO2eq2
AragongbpcreditstCO2eq
Cantabriagbppopulationgbp p.ctCO2eq
Castilla la ManchayeargbppopulationtCO2eq
Castilla y Leonyearmonthgbpgbp chainedpopulationgbp p.ctrx_inmob3services idx4creditsdepositstCO2eq
CataluñagbppopulationtCO2eq
Pais Vascogbpgbp chainedpopulationgbp p.ctrx_inmobtCO2eq
Principado de Asturiasgbpgbp p.ctCO2eq
Comunidad de Madridyearpopulationgbp p.ctCO2eq
Comunidad de Navarrayeargbp p.ctCO2eq
Comunidad ValencianayeargbptCO2eq
Extremadurayearpopulationgbp p.ctCO2eq
GaliciapopulationmortgagestCO2eq
Islas BalearesyeargbppopulationtCO2eq
Islas CanariasyeargbptCO2eq
La Riojagbppopulationgbp p.cindexretailtCO2eq
Region de Murciayeargbppopulationgbp p.ctCO2eq

gdp p.c. stands for gdp per capita.

tCO2eq stands for tCO2eq from non-renewable electric generation.

trx inmob stands for number of real state operations.

Services idx stands for Index of activity of service sector.

Average and standard deviation values of the coefficient for the different models tested. Relevant variables per model. gdp p.c. stands for gdp per capita. tCO2eq stands for tCO2eq from non-renewable electric generation. trx inmob stands for number of real state operations. Services idx stands for Index of activity of service sector. As shown in Table 4, the models are from different complexity as intended with the proposed feature selection methodology. The results also show that emissions from non-renewable electric generation are possibly the best predictor of CO2 emissions because the former variable (tCO2eq) was selected by all models. The accuracy of the model can be quantified by means of the coefficient as shown in Table 5 .
Table 5

R-squared values per model (from best to worse accuracy).

CommunityAccuracy (R2)
Galicia0.929011
Castilla la Mancha0.902679
Cataluña0.858921
Andalucía0.798672
Principado de Asturias0.796600
Aragón0.772803
Castilla y Leon0.753828
Islas Baleares0.749662
Islas Canarias0.735272
Comunidad de Madrid0.684010
Comunidad de Navarra0.631332
La Rioja0.559210
Extremadura0.459273
Comunidad Valenciana0.394295
Pais Vasco0.211449
Región de Murcia−0.218722
Cantabria−2.792710
R-squared values per model (from best to worse accuracy).

Predicted CO2 emissions in each AACC in Spain in 2019 and 2020

Eventually, given the accuracy obtained in the validation, the model can be used to forecast CO2 values for 2019 and 2020. These estimations are shown in Fig. 2, Fig. 3 . These figures show validation data from 2017 to 2018 and forecast data for 2019 and 2020. The red lines are the values predicted by the models and the shaded regions are bounded by the lower and upper limits representing the and percentiles.
Fig. 2

Actual data (kTmCO2eq) along with predicted data for 2017 and 2018 and forecast values for 2019 and 2020 for every AACC models.

Fig. 3

Aggregated actual data (kTmCO2eq) and predicted data for Spain.

Actual data (kTmCO2eq) along with predicted data for 2017 and 2018 and forecast values for 2019 and 2020 for every AACC models. Aggregated actual data (kTmCO2eq) and predicted data for Spain. CO2 emission values estimated for 2019 and 2020 cannot be validated since no official figures have been published to date. However, the model predicts the decrease of emissions during 2019 since 58,6% of the electricity generated in Spain in 2019 did not emit CO2 because it came from renewable sources. The prediction also shows a decrease in emissions in the first semester of 2020 that can possibly be linked to the COVID-19 pandemic.

Required pre-analysis for scenarios forecasting

The good fit of the model is strongly associated to the predictor quantifying the emissions from non-renewable electric generation. Thus, the estimation of the anthropogenic CO2 emitted in future scenarios requires computing the predictors based on the assumptions taken for each scenario, including estimating the amounts of non-renewable electric generation in these scenarios. To overcome this difficulty, a new set of models to predict the amount of CO2 emitted from non-renewable energies per every AACC was developed using the same techniques as the ones described in the Data and methodology section. To validate the accuracy of these new models per AACC, the model used to predict the emissions of CO2 was run again using the prediction of CO2 in the generation of non-renewable energy per AACC instead of the available reported data. As it could be expected, less precise predictions were obtained than with the first set of models. Table 6 shows the variables that entered in each model. More variables needed to be considered to get to a reasonable level of results. Fig. 4, Fig. 5 show the estimation obtained.
Table 6

Relevant variables per model in the estimation of emitted CO2 on electric non-renewable generation.

AACCvar1var2var3var4var5var6var7var8var9var10
Andaluciamonthtrx_inmobss affilspetrol_movpetrol_indstrdeposits
Aragonmonthgbppopulationss affilscredits
Cantabriagbpgbp chainedpopulationtrx_inmobindexretailss affilspetrol_indstrcredits
Castilla la Manchamonthprecpopulationenergy_demandss affilscredits
Castilla y Leonyeargbpgbp chainedgbp p.cservices idxss affilscredits
Cataluñamonthpopulationcreditsdeposits
Pais Vascomonthgbpgbp chainedpopulationgbp p.chotelnightsss affils
Principado de Asturiasmonthgbpindexindustrypetrol_indstr
Comunidad de Madridmonthgbppopulationmortgagescredits
Comunidad de Navarrayearmonthgbp chainedindexretailservices idxss affilspetrol_movpetrol_indstrcreditsdeaths
Comunidad Valencianayearmonthpopulationservices idx
Extremadurayearmonthgbpgbp chainedpopulationhotelnightscredits
Galiciamonthgbppopulationss affilsdeposits
Islas Balearesgbppopulationenergy_demandtrx_inmobcreditsdeposits
Islas Canariasyearmonthgbppopulationgbp p.ctrx_inmobcredits
La Riojayearmonthpopulationmortgagesservices idxpetrol_indstrcredits
Region de Murciayearmonthtmedgbp chainedpopulationgbp p.cindexindustrypetrol_mov
Fig. 4

Actual data (TmCO2eq) on electric non-renewable generation along with predicted data for 2017 and 2018 and forecasted values for 2019 and 2020 for every AACC models.

Fig. 5

Aggregated actual data (TmCO2eq) on electric non-renewable generation and predicted data for Spain.

Relevant variables per model in the estimation of emitted CO2 on electric non-renewable generation. Actual data (TmCO2eq) on electric non-renewable generation along with predicted data for 2017 and 2018 and forecasted values for 2019 and 2020 for every AACC models. Aggregated actual data (TmCO2eq) on electric non-renewable generation and predicted data for Spain. Even if this new set of models are not extremely accurate, it was considered that they can provide a reasonable approximation to the monthly trends of CO2 emissions for every AACC.

Scenarios

After the validation explained in the previous subsection, the trained model was used to predict the amount of CO2 (kTmCO2eq) that will be emitted to the atmosphere for each of the three scenarios described above until January 2023. Results are displayed in Fig. 6 . Values from January 2019 until the moment of writing this document are colored in grey (estimations produced with actual values of the predictors). Then, and until January 2023, the values have also been produced by the above method using hypothetical values of the predictors according to the different scenarios. Scenarios 1, 2 and 3 are colored in blue, orange or green respectively.
Fig. 6

Aggregated estimated data (kTmCO2eq) on anthropogenic emissions for every scenario.

Aggregated estimated data (kTmCO2eq) on anthropogenic emissions for every scenario. The scenarios described in section Scenarios used to predict future emissions differed in the hypothesized recovery timeline of the GDP. The starting point is the second quarter of 2020 were the GDP was estimated to have fallen by 21.5% with respect the same quarter of 2019. Scenario 1 is the most optimistic of the three with a V-shape recovery; this scenario would reach by January 2022 the same levels of economic activity than those from January 2020. The remaining scenarios 2 and 3 predict V-shaped and U-shaped respectively slower recoveries, reaching the same economic activity values observed in January 2020 three years later, that is, in January 2023. From the emissions point of view, the best scenario would be scenario 3 (green curve, slow U-shaped recovery) since the integral of its curve accumulates the least amount of kTmCO2eq. Unfortunately, this scenario is also the worst from an economical perspective, as it assumes 12 months of recession at its lowest values and it is not until 2022 that the economy begins its recovery. On the contrary, the desired scenario for the economical recovery of Spain would be scenario 1 (blue curve) that is also the one with the largest amount of CO2 emitted.

Discussion

The COVID-19 crisis has caused a decrease in pollutant emissions in many places in the world. It would be expected that once the crisis are over, emissions will return to their original levels. However, it can be argued that this situation can also lead to substantial shifts in energy efficiency and to the development of alternative, cleaner, energy sources (Le Quéré et al., 2020; Peters et al., 2011). In the case of COVID-19, for instance, it has been observed that the precautions taken to avoid infection had caused a decrease in the use of public transportation associated to an increase in the use of new means of clean personal mobility such as e-scotters or e-bikes. It should be noted that restrictions to contain the propagation of COVID-19 change very rapidly depending on the growth of the infection, even on a weekly or monthly basis. That means that the human and industrial activity may change dramatically in a very short period of time, making more difficult to estimate the CO2 emissions using traditional methods. This is why the approximation shown in this work, even if it is only an approximation, can assist policy makers in predicting the impact in emissions of these restrictive policies. Since AACC in Spain can implement policies to control CO2 emissions in their territory, this paper used data retrieved at the AACC level so that the models developed here could be used by the regional authorities to design custom emission regulation policies that could be optimized for their individual characteristics. This approach resulted in training 19 different prediction models. From a modeling perspective, it is not a big issue. But having different models for each AACC complicated drawing conclusions about their similarities and differences regarding CO2 emissions. Focusing on the variables that entered in the anthropogenic CO2 and non-renewable CO2 estimation models as shown in Table 4, Table 6, it was expected that the inclusion of the levels of CO2 coming from non-renewable generation would be the key predictor for the 19 anthropogenic models. But the relevance of GDP related variables was also significant and somewhat less predictable. It seems that for those AACC with a quite predictable emission model of non-renewable CO2, GDP and time of the year variables are enough to get an accurate prediction. For those communities with less predictable emissions, several extra variables related to financial products, state agent transactions or service sector levels were required in addition. It should also be noted that the estimation of non-renewable generation emissions was more challenging, as shown by the increased number of variables required per model. Variables related to energy demand, the number of mortgages or affiliations to the social security become more relevant then. The information coded on those two tables could be of great value when designing emission reduction plans per AACC. Two major difficulties were found: apportioning in monthly figures the yearly amount of CO2 emissions and the lack of mobility data available in Spain. To cope with the first one, a model to estimate these monthly figures was also developed. Unfortunately, despite the evident influence of traffic related emissions in the global CO2 emissions, the availability of information about mobility patterns per month and per AACC in Spain was very limited and did not allow us to establish any sensible method of estimating them. Initially, data related to the number of yearly crashes and victims were used as proxy variables for mobility, as there is literature pointing to the existing relationship between exposure and injuries (Segui-Gomez et al., 2011). However even if this data is available up to 2018 it did not end up contributing to the significance of the models. Some local administrations report the actual figures of private and public transportation use, unfortunately the number of informed cities is still too low to include this predictor in the models. As mentioned above, one of the most significant predictors used in the model of anthropogenic CO2 emissions was the CO2 emissions produced in the generation of non-renewable electric energy. Recent publications describe energy generation as the second source of CO2 emissions in Spain (Ministry of Environment, 2020). However, generation values per AACC are not available and therefore published values could not be directly used as variables in the prediction models. To overcome this difficulty, we followed the approach described in the Data and methodology section that required the assumption of two hypotheses. The first one considers that the same distribution of energy per AACC applies to the CO2 emissions from the generation process. The second one considers that the monthly distribution of non-renewable emissions per AACC also applies to the monthly distribution of anthropogenic emissions. The first hypothesis seems quite reasonable. The second one is only supported on the similarity of the annual time-series that are the only actual data available. Even if this approach might present certain limitations, in the absence of other more detailed sources of data, it is the only way of estimating these values. Although it is not possible to validate these assumptions, the model predicts the decrease of emissions due to new policies on contaminant energy production during 2019 and also due COVID-19 restrictions during 2020. These results were therefore consistent with reality. The strength of the approach is that, contrary to the total level of emissions that are only reported in a yearly bases, the exact amount of CO2 produced in generation is reported monthly by the relevant authority in Spain, and therefore can be used to predict the total amount of CO2 monthly emissions, which was the goal of this paper. Last, one of the main contributions of this study is that the developed method allows quantifying economical recovery and anthropogenic emissions variables at the same time. The manuscript includes three hypothesized scenarios to show the feasibility of the method, but the method can be applied to other scenarios with different recovery lengths or more complex hypothesis affecting predictors to find an optimal solution between economic growth and keeping CO2 emissions at an affordable level, an issue that has been already identified in previous literature (Linares and Romero, 2000; Guerra et al., 2016; Lopez-Pena et al., 2012). It should be noted that the three studied scenarios were designed based on the estimation of the GDP reduction in Spain provided by the Statistic National Institute of Spain (Instituto Nacional de Estadística, 2020). The scenarios could be updated for other values of GDP change and the method developed here would serve as a tool to forecast interactions between economic recovery and CO2 emissions. For the proposed GDP reduction value, scenario 2 (orange curve in Fig. 6) might be a good compromise between reasonable recovery of economic activity and controlled emissions of CO2. However, when the economic survival of an entire country is at stake, economic recovery is likely a priority.

Conclusion

This study has proposed a method that estimates almost real time (monthly) CO2 emissions based on proxy variables related to energy production and comsumption for each AACC in Spain. The method is flexible enough so that each AACC can forecast the short-term effect of implementing diverse energy policies in the CO2 emission levels. The method has been shown capable of capturing the effects of changes in regulation and of sudden events such as the COVID-19 pandemic. The model can also be used to product a set of economic recovery scenarios after the pandemic, in which the participating variables can be estimated, to benchmark the effects of different energy policies in the emissions of CO2. The advantage of a data-centric policy definition is that, if the hypotheses about the predictors are revealed to be wrong, the scenarios might be updated with actual values whenever possible and policies can be dynamically improved. Future work should include improving the complexity of the modeling techniques introducing theory of Recurrent Neural Networks that might capture better the correlation of the predictors and their evolution in time. For instance, the possibility of including mobility data at the AACC level and for a number of years can contribute largely to improving the predicting capabilities of the model.

CRediT authorship contribution statement

Luis F.S. Merchante: Conceptualization, Funding acquisition, Data curation, Supervision, Formal analysis. Delia Clar: Data curation. Alberto Carnicero: Conceptualization, Supervision, Formal analysis. Francisco J. Lopez-Valdes: Funding acquisition, Supervision, Formal analysis. Jesús R. Jimenez-Octavio: Conceptualization, Funding acquisition, Supervision, Formal analysis.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  5 in total

1.  Exposure to traffic and risk of hospitalization due to injuries.

Authors:  Maria Segui-Gomez; Francisco J Lopez-Valdes; Francisco Guillen-Grima; Ernesto Smyth; Javier Llorca; Jokin de Irala
Journal:  Risk Anal       Date:  2010-10-12       Impact factor: 4.000

2.  Growth in emission transfers via international trade from 1990 to 2008.

Authors:  Glen P Peters; Jan C Minx; Christopher L Weber; Ottmar Edenhofer
Journal:  Proc Natl Acad Sci U S A       Date:  2011-04-25       Impact factor: 11.205

3.  Can China fulfill its commitment to reducing carbon dioxide emissions in the Paris Agreement? Analysis based on a back-propagation neural network.

Authors:  Daoyan Guo; Hong Chen; Ruyin Long
Journal:  Environ Sci Pollut Res Int       Date:  2018-07-24       Impact factor: 4.223

4.  Forecasting CO2 emissions in Hebei, China, through moth-flame optimization based on the random forest and extreme learning machine.

Authors:  Sun Wei; Wang Yuwei; Zhang Chongchong
Journal:  Environ Sci Pollut Res Int       Date:  2018-08-14       Impact factor: 4.223

5.  The impact of COVID-19 on small business outcomes and expectations.

Authors:  Alexander W Bartik; Marianne Bertrand; Zoe Cullen; Edward L Glaeser; Michael Luca; Christopher Stanton
Journal:  Proc Natl Acad Sci U S A       Date:  2020-07-10       Impact factor: 11.205

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.