Ryo Iwama1, Koji Takizawa2, Kenichi Shinmei2, Eisuke Baba2, Noritoshi Yagihashi2, Hiromasa Kaneko1. 1. Department of Applied Chemistry, School of Science and Technology, Meiji University, 1-1-1 Higashi-Mita, Tama-ku, Kawasaki-shi, Kanagawa-ken 214-8571, Japan. 2. Sekisui Chemical Co., Ltd., 2-4-4 Nishitennma, Kita-ku, Osaka-shi, Osaka-fu 530-8565, Japan.
Abstract
We aim to achieve resource recycling by capturing and using CO2 generated in a chemical production and disposal process. We focused on CO2 conversion to CO by the reverse water gas shift-chemical looping (RWGS-CL) reaction. This reaction proceeds in two steps (H2 + MO x ⇆ H2O + MO x-1; CO2 + MO x-1 ⇆ CO + MO x ) via a metal oxide that acts as an oxygen carrier. High CO2 conversion can be achieved owing to a low H2O concentration in the second step, which causes an unwanted back reaction (H2 + CO2 ⇆ CO + H2O). However, the RWGS-CL process is difficult to control because of repeated thermochemical redox cycling, and the CO2 and H2 conversion extents vary depending on the metal oxide composition and experimental conditions. In this study, we developed metal oxides and simultaneously optimized experimental conditions to satisfy target CO2 and H2 conversion extents by using machine learning and Bayesian optimization. We used transfer learning to improve the prediction accuracy of the mathematical models by incorporating a data set and knowledge of oxygen vacancy formation energy. Furthermore, we analyzed the RWGS-CL reaction based on the prediction accuracy of each variable and the feature importance of the random forest regression model.
We aim to achieve resource recycling by capturing and using CO2 generated in a chemical production and disposal process. We focused on CO2 conversion to CO by the reverse water gas shift-chemical looping (RWGS-CL) reaction. This reaction proceeds in two steps (H2 + MO x ⇆ H2O + MO x-1; CO2 + MO x-1 ⇆ CO + MO x ) via a metal oxide that acts as an oxygen carrier. High CO2 conversion can be achieved owing to a low H2O concentration in the second step, which causes an unwanted back reaction (H2 + CO2 ⇆ CO + H2O). However, the RWGS-CL process is difficult to control because of repeated thermochemical redox cycling, and the CO2 and H2 conversion extents vary depending on the metal oxide composition and experimental conditions. In this study, we developed metal oxides and simultaneously optimized experimental conditions to satisfy target CO2 and H2 conversion extents by using machine learning and Bayesian optimization. We used transfer learning to improve the prediction accuracy of the mathematical models by incorporating a data set and knowledge of oxygen vacancy formation energy. Furthermore, we analyzed the RWGS-CL reaction based on the prediction accuracy of each variable and the feature importance of the random forest regression model.
A wide variety of chemicals
are made from fossil fuels, such as
crude oil, coal, and natural gas, through to petrochemical-based materials,
such as ethylene and propene. Some of these chemicals are recycled
after use, but most are incinerated and landfilled. A large quantity
of carbon dioxide (CO2) is released during the chemical
manufacturing process and incineration of the chemicals. CO2 is considered a cause of global warming.One option to reduce
CO2 emissions is by carbon capture
and storage (CCS),[1] which is a process
to separate and capture CO2 generated during combustion
and store it in a suitable place. CCS has the disadvantage that it
requires a lot of energy to store CO2. Carbon capture and
usage (CCU)[2] has therefore been attracting
attention. CCU is a method of manufacturing petrochemical-based materials
using captured CO2 as a raw material. CCU reduces CO2 emissions to the atmosphere and use of chemicals derived
from fossil fuels; consequently, it is expected to achieve resource
recycling. In addition, CO2 conversion products can be
sold as petrochemical-based materials, and thus, the CCU cost can
be recovered. However, high thermodynamic stability is an obstacle
to the use of CO2, and therefore, hydrogen (H2) is frequently used for CO2 conversion because of its
high energy content. CO2 reacts with H2 to form
a wide variety of petrochemical-based materials, such as methane,[3] methanol,[4] carboxylic
acids,[5] and carbon monoxide (CO). In this
work, we focused on the process of manufacturing CO, which can be
used for production of chemicals, such as acetic acid, or as a synthesis
gas with hydrogen to produce methanol or dimethyl ether.The
reverse water gas shift (RWGS) reaction[6] produces CO from CO2. CO2 and H2 react on a catalytic surface with CO and H2O, as shown
in the following reactionGoguet et al.[7] investigated the reactivity
of the surface species present
over a 2%Pt/CeO2 catalyst during the RWGS reaction by a
detailed operando spectrokinetic analysis. Chen et al.[8] studied the reaction mechanism over a Cu catalyst by CO2 hydrogenation. Wu et al.[9] studied
a Ni/SiO2 catalyst and focused on the production of CO
and CH4. The RWGS reaction is characterized by a simple
reaction mechanism and has been extensively studied; however, several
disadvantages exist, such as the reaction efficiency being limited
by equilibrium, the need for separation of the outlet gas, methanation,
and the need for excess H2.In this study, we focused
on the reverse water gas shift–chemical
looping (RWGS-CL) reaction.[10] The RWGS
reaction proceeds via the following two steps that are either spatially
or temporally separated via a metal oxide (MO), which acts as an oxygen carrierHence, high CO2 conversion can be achieved owing to low
H2O concentration
in the second step, which causes an unwanted backward reaction in eq . Daza et al.[11−13] investigated the effects of Cu and Sr doping on perovskites used
as the metal oxide catalyst. Siriwardane et al.[14] presented data on conversion for two different coals with
a chemical looping oxygen carrier, CuO–Fe2O3–alumina, over a range of conditions, including steam
and various levels of reduction of the oxygen carrier. Hare et al.[15−17] combined a perovskite with a wide variety of supports, such as CeO2, ZrO2, Al2O3, SiO2, and TiO2, at industrial scale. Maiti et al.[18] achieved 100% selectivity of CO generation at
low temperature (450–500 °C) using La- and Ca-based perovskite
oxides. Sun et al.[19] proposed a one-pot
method to synthesize dual-function materials that contained a sorbent
coupled with a catalyst component. Ramos et al.[20] discovered material properties that promote CO2 conversion in a perovskite by doping with Co and Mn. Ma et al.[21] presented several Co- and Mn-codoped ferrites
for a midtemperature RWGS process. Kellar et al.[22] searched for the best metal oxide from a process analysis
based on thermodynamic data. Although the RWGS-CL reaction has been
actively researched, it remains difficult to control because the reaction
involves repeated thermochemical redox cycling, and the CO2 and H2 conversions vary greatly depending on the metal
oxide composition and experimental conditions. Therefore, metal oxides
with high CO2 and H2 conversions have not yet
been developed. The objective of this study was to design metal oxides
and simultaneously optimize experimental conditions to satisfy target
CO2 and H2 conversion extents.In conventional
development of metal oxides, the next experimental
candidate, including metal oxides and experimental conditions, is
selected based on the experience and intuition of researchers. For
example, if design variables for the RWGS-CL process are assumed to
be 5 parameters, such as temperature and reactor size, with 10 candidates
assumed for each parameter, and the metal oxides are assumed to have
15 candidates, the total number of combinations is 15 × 105 (= 1 500 000), which requires a large number
of experiments and is unrealistic. Therefore, we focused on machine
learning to efficiently determine optimal metal oxides for this application.
Mathematical models of Y = f(X)
were constructed for CO2 and H2 conversions
(Y) and descriptors for metal oxides and experimental
conditions (X) using experimental results. Y-values were estimated by inputting X-values
for metal oxides and experimental conditions into the models. By using
models and estimating Y-values, it is possible to
realistically handle 1 500 000 combinations of candidates.
By selecting the next experimental candidates from estimation results,
then an X that leads to target performances can be
selected from a huge number of combinations of metal oxides and experimental
conditions. In a typical mathematical model, the applicability domain
(AD) is set, and candidates with desirable Y-values
are selected within the AD. In the case of Gaussian process regression
(GPR),[23] the candidates considered include
variability of the estimated Y-values, called Bayesian
optimization.[24] AD is useful for reliable
prediction of X-values in interpolation regions,
while Bayesian optimization is useful for exploring extrapolation
in repeated experiments.In addition to machine learning, we
used transfer learning to improve
the prediction accuracy of the mathematical models by incorporating
a data set and knowledge of oxygen vacancy formation energy (OVFE).
Method
Data
The data comprised 520 samples
that were investigated by Sekisui Chemical Co., Ltd. Test measurements
were carried out at atmospheric pressure in a quartz tube microreactor,
placed in an electric furnace. Typically, 200 mg of sample was packed
between quartz wool plugs. First, the samples were heated to 650 °C
under He. After this, the samples were reduced for 5 min in 100% H2, flushed for 5 min under He, and then 100% CO2 was flowed for 5 min, followed by another 5 min of He flushing.
The total flow rate of the feed gas into the reactor was kept constant
at 5.0 mL·min–1. This process was repeated
one time for a total of two cycles.There were 161 metal oxides.
Experimental conditions included the metal, molar rates, precipitation
method (sol–gel, coprecipitation, or solid phase), sintering
temperature, heat-up time, temperature retention time, hydrogen excess,
and reaction temperature. The outlet products were analyzed by a gas
chromatograph–mass spectrometer (GC/MS). Consumption of H2 and CO2 is connected in that each molecule consumes
and provides one oxygen atom. Hence, the molar amount of hydrogen
consumption during reduction corresponds with the molar amount of
carbon monoxide produced during CO2 reoxidation. Additionally,
the CO selectivity was almost 100% in this reaction. Therefore, H2 conversion was calculated from the ratio of the amount of
CO produced and the total amount of the inlet of H2.For the purposes of this paper, the term of the CO2 conversion
was taken to mean the instantaneous conversion ratio of CO2 gas to CO. Accordingly, the CO2 conversion was calculated
from the ratio of the amount of CO and CO2 produced in
1 min after the starting of the reaction. For this reason, the conversion
of CO2 and H2 was not the same. The CO selectivity
was almost 100% in all experiments, which was because separating carbon
and hydrogen prevents the generation of CH4.
Descriptor Calculation
Data from
the periodic table, Python Materials Genomics (pymatgen),[25] and the Materials Project[26] were used to represent structural information for the metal
oxides. Periodic table data were collected from the Chemistry Handbook
and papers[27,28] and included atomic mass, electron
count, peripheral electron count, atomic radius, ionic radius, ionization
energy, electronegativity, work function, and surface energy. Pymatgen
includes data for atomic mass, atomic orbitals, atomic radius, boiling
point, density of solid, liquid range, melting point, molar volume,
thermal conductivity, and electronegativity for metallic elements.
The Materials Project data includes energy, energy per atom, volume,
formation energy per atom, band gap, and density. Periodic table data
and pymatgen descriptors were obtained for each atom, and weighted
averages were determined using the molar rates; similarly, Materials
Project descriptors were obtained for each metal oxide, and weighted
averages determined using the molar rates.
Regression
Model
Partial least-squares
(PLS),[29] ridge regression (RR),[30] least absolute shrinkage and selection operator
(LASSO),[31] elastic net (EN),[32] linear support vector regression (LSVR),[33] nonlinear support vector regression (NLSVR),
linear Gaussian process regression (LGPR), random forest (RF),[34] light gradient boosting machine (light-GBM),[35] and GPR methods were used to build the mathematical
models.The prediction accuracy was evaluated by double cross-validation
(DCV).[36] The sample was first divided by
the number of outer folds. One group was used for validation, and
the remaining groups were used for cross-validation (CV) (inner folds)
to optimize the hyperparameters and then predict the data for validation.
This was done for all groups. The prediction accuracy of the mathematical
model was evaluated by comparing the actual and predicted Y-values of the outer CV. For comparison, the coefficient
of determination (R2DCV), mean
absolute error (MAEDCV), and root-mean-square error (RMSEDCV) were used, as respectively given bywhere y( is the Y-value of the i-th sample, yDCV( is the predicted Y-value of i-th sample by DCV, yaverage is the average of the Y-values, and n is the number of samples.
Transfer Learning
Transfer learning
is a method to improve the prediction accuracy and efficient learning
of a mathematical model by transferring knowledge from other data.
To improve the prediction accuracy of the mathematical model in this
study, we used OVFE values, which are considered to be correlated
with the extents of CO2 and H2 conversion. OVFE
is the energy released when oxygen atoms are released from metal oxides
and is typically calculated by density functional theory calculations.
In this study, we used the OVFE values of 45 compounds reported by
Deml et al.[37]In several transfer
learning algorithms, we focused on the frustratingly easy domain adaptation,[38] which can transfer knowledge by expanding data
sets. The expanded data sets XC ∈
R( and yc ∈
R( are shown
as followswhere Xexp ∈ R is the experimental condition term, XA ∈ R is the structural information term of the metal oxides from
CO2 and H2 conversion data, XB ∈ R is the structural information term of the metal oxides from
OVFE data, yA ∈ R is the CO2 or H2 conversion
extent, and yB ∈ R is the OVFE value. Furthermore, mA is the number of CO2 and H2 conversion samples, mB is number
of OVFE samples, l is the number of experimental
conditions, and n is the number of structural information
variables. Knowledge of the OVFE data was transferred to the CO2 and H2 conversion data by expanding the original
data set.
Inverse Analysis
By inputting candidates
for X-values that have not been experienced, Y-values of experimental results can be estimated without
the need for experiments. Therefore, metal oxides that have desirable Y-values are expected to be searched with only a small number
of experiments by selecting metal oxides based on prediction results.Candidates for X were generated in three parts:
metal oxides, metal molar ratio, and experimental conditions. Some
2300 metal oxides were generated by selecting 3 metals from the 25
metals that have been used in metal oxides for this application. From
this, 4707 candidates were generated in 0.01 increments so that the
sum of the molar ratios of the three metals was unity. Three types
of precipitation method (coprecipitation, sol–gel, and solid-phase)
and two types of hydrogen excess condition (hydrogen-deficient and
hydrogen-rich) were generated, resulting in a total of six candidates
for the experimental conditions. In addition, the sintering temperature
was fixed at 700 °C, the heat-up time was fixed at 180 min, the
temperature retention time was fixed at 180 min, and the reaction
temperature was fixed at 650 °C.After the estimated Y-values were obtained, the
candidates for X were evaluated. If the mathematical
model with the highest prediction accuracy was PLS, RR, LASSO, EN,
LSVR, NLSVR, RF, or light-GBM, the AD was set, and the X-candidate that had desirable Y-values within the
AD was selected. If the mathematical model with the highest prediction
accuracy was LGPR or GPR, the X-candidate that had
the highest probability of improvement (PI) value, i.e., the probability
of exceeding the maximum Y-values in the training
data, was selected.
Results and Discussion
Descriptor Calculations
The correlation
coefficients between X and Y are
shown in Figure after
calculating the structural information for the metal oxides. There
are no variables that highly correlated with Y. The
maximum correlation coefficient was 0.37 for the atomic orbital energy
and melting point, as shown in Figure (c). In addition, the data set has X variables that are highly correlated with each other, such as sintering
temperature, heat-up time, and reaction temperature, shown in Figure (a).
Figure 1
Correlation coefficients
between X and Y where X is (a) experimental conditions,
(b) periodic table data, (c) pymatgen descriptors, and (d) Materials
Project descriptors.
Correlation coefficients
between X and Y where X is (a) experimental conditions,
(b) periodic table data, (c) pymatgen descriptors, and (d) Materials
Project descriptors.
Regression
Analysis
Mathematical
models were constructed between the X and Y values calculated in Section , and each was evaluated by DCV. The number
of DCV inner folds was set to 5 and the number of outer folds was
set to 161, which was the number of metal oxides. The mathematical
models were validated by assuming that they would predict new metal
oxides. The DCV-predicted results for each variable set are shown
in Table . R2DCV in eq , MAEDCV in eq , and RMSEDCV in eq were calculated for each variable
and mathematical model, and the model with the highest prediction
accuracy is listed. For both CO2 and H2 conversions,
the prediction accuracy using nonlinear regression models was higher
than that using linear regression models. GPR gave the best prediction
ability, which is attributed to learning of the complex relationships
between X and Y by the kernel functions.
For CO2 conversion, the GPR model using the experimental
conditions, pymatgen descriptors, and Materials Project descriptors
had the highest prediction accuracy, with R2DCV of 0.546. It is considered that the prediction accuracy
could be improved by adding the electronic information in the pymatgen
descriptors and structural information for the metal oxides in the
Materials Project descriptors to the GPR model. For H2 conversion,
the GPR model with X comprising the experimental
conditions and pymatgen descriptors had the highest prediction accuracy,
with R2DCV of 0.733. H2 conversion seems to depend on the experimental conditions,
such as hydrogen excess, metal, and metal composition, because the
prediction accuracy was almost identical for each variable.
Table 1
Prediction Accuracies of CO2 and H2 Conversions for Each Model Input
CO2 conversion
H2 conversion
model inputs
method
R2DCV
MAEDCV
RMSEDCV
method
R2DCV
MAEDCV
RMSEDCV
periodic table
GPR
0.433
13.990
18.286
GPR
0.660
7.487
11.566
pymatgen descriptors
GPR
0.504
13.159
17.096
GPR
0.733
6.683
10.233
Materials Project descriptors
GPR
0.367
14.576
19.320
GPR
0.689
7.268
11.045
periodic table + pymatgen descriptors
GPR
0.441
13.701
18.158
GPR
0.713
6.888
10.626
periodic table + Materials Project descriptors
GPR
0.506
12.685
17.072
GPR
0.697
7.253
10.908
pymatgen + Materials Project
descriptors
GPR
0.546
12.445
16.359
GPR
0.714
7.092
10.604
all
RF
0.542
12.397
16.428
RF
0.682
7.676
11.186
Figure shows plots
of the measured and DCV-predicted conversions. Prediction errors were
mainly caused by samples in which the metal combination was the same
and metal compositions were slightly different, but the conversion
changed significantly: for example, the measured CO2 conversion
of Cu0.05·Ce0.37·Zr0.57 is 47% and predicted CO2 conversion is 83%; the measured
CO2 conversion of Cu0.17·Ce0.33·Zr0.50 is 91%, and predicted is 85%. Y values of other samples could be accurately predicted. The same
trend was observed for H2 conversion.
Figure 2
Measured and estimated
values of CO2 and H2 conversion by double cross-validation.
(a) Y =
CO2 conversion, X = experimental conditions,
periodic table, and pymatgen descriptors; (b) Y =
H2 conversion, X = experimental conditions and pymatgen
descriptors.
Measured and estimated
values of CO2 and H2 conversion by double cross-validation.
(a) Y =
CO2 conversion, X = experimental conditions,
periodic table, and pymatgen descriptors; (b) Y =
H2 conversion, X = experimental conditions and pymatgen
descriptors.
Transfer
Learning
We conducted transfer
learning using the X with the highest accuracy, as
determined in Section . Transfer learning Method 1 added the predicted OVFE values
to X; Methods 2 and 3, described in Section , were used in this study.
The method of autoscaling after expanding the data set as per eqs and 8 is defined as Method 2; the method of autoscaling each term, such
as Xexp, XA, and XB, before expanding the data set
as per eqs and 8 is defined as Method 3.The OVFE model was
first constructed between X, calculated as in Section , and OVFE and
then evaluated for each mathematical model by DCV. The number of DCV
inner folds was set to 5 and the number of outer folds was set to
45, which is the number of metal oxides. The mathematical model was
constructed as described in Section . The results of prediction accuracy for
each variable by DCV are shown in Table . The GPR model between X with the periodic table and pymatgen descriptors and OVFE had the
highest prediction accuracy, with R2DCV of 0.886, because there are many variables with a relatively
high correlation between OVFE and X, such as atomic
mass, solid density, melting point, and boiling point.
Table 2
Prediction Accuracy of Oxygen Vacancy
Formation Energy (OVFE) Values for Each Variable
model inputs
method
R2DCV
MAEDCV
RMSEDCV
periodic table
PLS
0.555
0.677
0.882
pymatgen
descriptors
GPR
0.832
0.451
0.593
Materials Project descriptors
LSVR
0.840
0.398
0.530
periodic table + pymatgen descriptors
GPR
0.886
0.335
0.446
periodic table + Materials Project descriptors
PLS
0.830
0.422
0.546
pymatgen + Materials Project descriptors
GPR
0.832
0.425
0.543
all
PLS
0.852
0.393
0.510
Figures and 4 show the DCV-predicted OVFE. In Figure , most samples appear
on the
diagonal, and thus, the prediction accuracy is quite high. Figure shows is no relationship
between CO2 conversion and predicted OVFE values, but metal
oxides with predicted OVFE values of 3 to 4 eV tended to have high
H2 conversion.
Figure 3
Measured and estimated values of oxygen vacancy
formation energy
(OVFE) by double cross-validation. y = OVFE, X = periodic table and pymatgen descriptors.
Figure 4
CO2 and H2 conversions and estimated values
of oxygen vacancy formation energy (OVFE) by double cross-validation.
Measured and estimated values of oxygen vacancy
formation energy
(OVFE) by double cross-validation. y = OVFE, X = periodic table and pymatgen descriptors.CO2 and H2 conversions and estimated values
of oxygen vacancy formation energy (OVFE) by double cross-validation.We then evaluated the prediction accuracies by
DCV for each transfer
learning method. The results are shown in Table . For both CO2 and H2 conversions, the prediction accuracy was improved by using transfer
learning Methods 1, 2, or 3. For CO2 conversion, Method
3 had the highest prediction accuracy. Method 2 was not able to distinguish
between the zero matrix and zero with no values; by excepting the
zero matrix for autoscaling, Method 3 was able to make this distinction
and therefore showed improved prediction accuracy. For H2 conversion, Method 1 had the highest prediction accuracy, because
it is considered to reflect a direct relationship between OVFE and
H2 conversion.
Table 3
Prediction Accuracy
of CO2 and H2 Conversions for Transfer Learning
Methods
CO2 conversion
H2 conversion
variable
method
R2DCV
MAEDCV
RMSEDCV
method
R2DCV
MAEDCV
RMSEDCV
Section 3.2
GPR
0.546
12.445
16.359
GPR
0.733
6.683
10.233
Method 1
GPR
0.550
12.428
16.290
GPR
0.737
6.712
10.160
Method 2
GPR
0.522
12.827
16.791
GPR
0.723
6.798
10.434
Method 3
GPR
0.566
12.167
16.000
GPR
0.730
6.771
10.296
Figure shows the
CO2 and H2 conversions with highest DCV-predicted
accuracy. Figure shows
the feature importance of RF. Comparing Figure with Figure , by utilizing knowledge of OVFE, the prediction accuracy
is slightly improved in the region where the value of Y is large. In CO2 conversion, the feature importance of XB in eq is high. In Method 2, the feature of XB is not high, and thus, the data set with high feature importance
of XB is considered to be suitable for
transfer learning. In H2 conversion, the feature importance
of the predicted OVFE is high, and this variable may have increased
the prediction accuracy.
Figure 5
Measured and estimated values of CO2 and H2 conversions by double cross-validation using transfer
learning.
(a) Y = CO2 conversion; transfer learning
Method 3; (b) Y = H2 conversion; transfer
learning Method 1.
Figure 6
Random forest feature
importance using transfer learning. (a) Y = CO2 conversion; transfer learning Method
3; (b) Y = H2 conversion; transfer learning
Method 1.
Measured and estimated values of CO2 and H2 conversions by double cross-validation using transfer
learning.
(a) Y = CO2 conversion; transfer learning
Method 3; (b) Y = H2 conversion; transfer
learning Method 1.Random forest feature
importance using transfer learning. (a) Y = CO2 conversion; transfer learning Method
3; (b) Y = H2 conversion; transfer learning
Method 1.
Inverse
Analysis
We created 65 232 600
candidate metal oxides as described in Section . The created candidates were input to
the regression model with the highest prediction accuracy and the
corresponding Y-values predicted. For CO2 conversion, the GPR model was constructed between X, with experimental conditions, pymatgen descriptors, and Materials
Project descriptors, and Y using Method 3. For H2 conversion, the GPR model used was constructed between X, with experimental conditions and pymatgen descriptors
and Y using Method 1. Figure shows the predicted and experimental Y-values. We were able to find a candidate with predicted Y-values that exceeded the trade-off relationship in the
experimental Y-values.
Figure 7
Comparison between experimental
(blue) and predicted (gray) Y-values.
Comparison between experimental
(blue) and predicted (gray) Y-values.Both the CO2 and H2 conversion models
used
GPR, from which we calculated the PI using the predicted Y-values and their variances. Figure shows the PI values. The 53 red samples are the Pareto
optimal solution: six representative samples are shown in Table . The Pareto optimal
solution is divided into two groups: one with high CO2 conversion
and the other with high H2 conversion. Table shows that these values are
affected by the hydrogen excess value: when the hydrogen flow is insufficient,
the extent of reaction between CO2 and MO in eq is small because of low reaction between H2 and MO in eq ; when the hydrogen flow is excessive,
the reaction between CO2 and MO in eq proceeds well,
but the H2 conversion decreases, and thus, PI values for
H2 conversion will be low. In Table , Cu and Ga, which are known to facilitate
high H2 conversion, were selected, and their molar composition
was approximately 0.4. From the high RF feature importance shown in Figure , the melting point
of Cu is larger than that of other metals and the atomic orbitals
of both metals are relatively low, accounting for the high CO2 and H2 conversions.
Figure 8
Probability of improvement
(PI) results. Red: Pareto optimal solution
(53 samples); gray: all candidates (65 232 600 samples).
Table 4
Samples of Representative Pareto Optimal
Solutions
metal oxide
log PI [-]
PICO2 [-]
PIH2 [-]
ratio of metal 1 [-]
ratio of metal 2 [-]
ratio of metal 3 [-]
precipitation method
hydrogen excess
Mg·Cu·Ga
–12.85
0.30
8.70 × 10–6
0.16
0.41
0.43
coprecipitation
excess
Mg·Cu·Ga
–12.20
0.29
1.78 × 10–5
0.18
0.48
0.34
coprecipitation
excess
K·Cu·Ga
–13.80
0.31
3.24 × 10–6
0.17
0.49
0.34
coprecipitation
excess
K·Cu·Ga
–13.93
0.32
2.84 × 10–6
0.19
0.5
0.31
coprecipitation
excess
K·Cu·Ga
–2.86
0.22
0.26
0.18
0.46
0.36
coprecipitation
less
K·Cu·Ga
–2.99
0.18
0.28
0.2
0.44
0.36
coprecipitation
less
Table 5
Samples with the Six Highest Log PI
Values
metal oxides
log PI [-]
PICO2 [-]
PIH2 [-]
ratio of metal 1 [-]
ratio of metal 2 [-]
ratio of metal 3 [-]
precipitation method
hydrogen excess
K·Cu·Ga
–2.86
0.22
0.26
0.18
0.46
0.36
coprecipitation
less
Mg·Cu·Ga
–3.22
0.22
0.18
0.17
0.41
0.42
coprecipitation
less
Ca·Cu·Ga
–4.02
0.18
0.10
0.17
0.38
0.45
coprecipitation
less
Cu·Ga·Sr
–4.48
0.17
0.07
0.42
0.42
0.16
coprecipitation
less
Mn·Cu·Ga
–6.69
0.11
0.01
0.08
0.39
0.53
coprecipitation
less
Probability of improvement
(PI) results. Red: Pareto optimal solution
(53 samples); gray: all candidates (65 232 600 samples).PI values are probabilities, and therefore, we calculated the probability
(log PI) that all Y values exceed the existing Y-values by taking the logarithm for each Y and adding these together. Figure shows the histogram of log PI. Table shows the five metal oxides with highest
log PI values. As in Table , Cu and Ga were selected; the remaining metals tended to
be alkali and alkaline-earth metals, which have a low number of electrons
in the outermost electron shell. The state of electrons on the surface
of the metal oxide is thought to be important for thermochemical redox
cycling to occur. The candidates selected in Tables and 5 were not in
the training data. Experiments with the selected candidates are expected
to develop metal oxides that exceed the Y-values
in the training data set.
Figure 9
Histogram of logarithm of probability of improvement
(log PI).
Histogram of logarithm of probability of improvement
(log PI).
Conclusion
In this study, we constructed mathematical models using machine
learning to predict Y-values, where Y is CO2 and H2 conversions. Based on combinations
of variables with the highest prediction accuracy and RF (random forest)
feature importance, the prediction accuracy for CO2 conversion
was improved by adding electronic information and structural information
for the metal oxides; H2 conversion seems to depend on
experimental conditions, such as hydrogen excess, metal, and metal
composition. Furthermore, we focused on OVFE (oxygen vacancy formation
energy) values, which are considered to be correlated with the Y-values, and conducted transfer learning. For CO2 conversion, the prediction accuracy was improved by considering
the relationship with X; for H2 conversion,
which is directly correlated with the predicted OVFE values, the prediction
accuracy was improved by using the OVFE values as X. After constructing the mathematical models, we input 65 232 600
candidates as X-values that have not been experienced,
and the corresponding Y-values were estimated without
experiments. From the predicted Y-values, we were
able to find candidates with predicted Y-values that
exceed the trade-off relationship in the experimental Y-values. By selecting candidates for the next experiments from the
estimated results based on Bayesian optimization and updating the
mathematical models with the experimental results, the candidate X metal oxides that will achieve the target Y-values are expected to be found with a small number of experiments.Although there are lots of experimental results published in the
literature regarding CO2 conversion using metal oxides,
even when the experimental results are similar in terms of CO2 conversion and metal oxides, the experimental results cannot
be compared, the prediction accuracy of models cannot be compared,
and the samples cannot be merged if experimental systems and experimental
conditions are different. The use of samples from different experimental
systems with techniques such as transfer learning would be a challenge
for the future.
Authors: Ann M Deml; Aaron M Holder; Ryan P O'Hayre; Charles B Musgrave; Vladan Stevanović Journal: J Phys Chem Lett Date: 2015-05-11 Impact factor: 6.475