Literature DB >> 33644549

Comparative Analysis of Four Neural Network Models on the Estimation of CO₂-Brine Interfacial Tension.

Xiaojie Liu¹, Meiheriayi Mutailipu¹, Jiafei Zhao¹, Yu Liu¹.

Abstract

During the CO2 injection of geological carbon sequestration and CO2-enhanced oil recovery, the contact of CO2 with underground salt water is inevitable, where the interfacial tension (IFT) between gas and liquid determines whether the projects can proceed smoothly. In this paper, three traditional neural network models, the wavelet neural network (WNN) model, the back propagation (BP) model, and the radical basis function model, were applied to predict the IFT between CO2 and brine with temperature, pressure, monovalent cation molality, divalent cation molality, and molar fraction of methane and nitrogen impurities. A total of 974 sets of experimental data were divided into two data groups, the training group and the testing group. By optimizing the WNN model (I_WNN), a most stable and precise model is established, and it is found that temperature and pressure are the main parameters affecting the IFT. Through the comparison of models, it is found that I_WNN and BP models are more suitable for the IFT evaluation between CO2 and brine.

Entities: Chemical Disease

Year: 2021 PMID： 33644549 PMCID： PMC7906582 DOI： 10.1021/acsomega.0c05290

Source DB: PubMed Journal: ACS Omega ISSN： 2470-1343

Introduction

The CO2 content in the atmosphere is increasing year by year. According to the latest report released by the International Energy Agency[1] in March 2019, the global energy-related carbon dioxide emissions increased by 1.7% compared with last year, reaching 3.31 billion tons, and it is at a record high. The greenhouse effect caused by high CO2 content has become an urgent problem to be solved. In order to combat climate change, researchers have found that CO2 can be injected into gas reservoirs, coal seams that cannot be exploited, and deep saline alkali aquifers, or CO2-enhanced oil displacement can be used to store CO2 in the reservoir,[2] so as to reduce the CO2 content in the atmosphere. On the other hand, the interface property between CO2 and liquid is the key to control multiple flow behaviors in the CO2 geological storage process.[3] The interfacial tension (IFT) of CO2 and brine determines whether CO2 can break through capillary force and damage reservoir safety.[4] Therefore, it is very important to know exactly the CO2–brine IFT in underground CO2 storage. The IFT is mainly obtained by experimental methods, empirical formula methods, and some theoretical model methods. At present, there are a large number of experimental CO2–brine IFT data provided by researchers.[5−19] These experiments are mainly carried out by using the pendant drop method, covering a wide range of temperature and pressure, comprehensively considering other variables that may cause IFT changes, including cations (K+, Ca2+, Na+, Mg2+, etc.) in liquid and components of surrounding gas environment (nitrogen, methane, and carbon dioxide). However, the experimental method is usually time- and labor-consuming, and the experimental apparatus costs a lot of money. The theoretical methods,[20−31] including molecular dynamics simulation and theoretical calculation, give theoretical guidance for understanding the influence of temperature, pressure, salinity, and the gas environment on IFT. The molecular dynamics simulation method can observe the microcosmic characteristics of the two-phase interface more intuitively, and theoretical calculation is convenient to evaluate the influence degree of variable parameters. However, because of the huge amount of theoretical calculation, it is necessary to be proficient in molecular characteristics. The method is easy to lead to the difference of results between different groups in terms of temperature, and the prediction results are too high under high-pressure conditions. The molecular dynamics simulation method needs to be carried out under ideal conditions, and therefore, the application is limited. Many researchers try to find a reliable and convenient method to predict the IFT under various conditions. Zhang et al.[4] summarized the prediction models of Hebach et al.,[14] Chalbaud et al.,[11] and Li et al.[15,17] and evaluated the performance of pure CO2–water and pure/impure CO2–brine from the mean absolute error (MAE), mean absolute relative error (MARE), mean squared error (MSE), and determination coefficient (R2), although Li et al.’s[17] model ranks the highest among these methods, but when the IFT is greater than 60 mN/m, the model fails to guaranty its accuracy of prediction. Then, Zhang summarized up to 1716 data points and proposed an artificial neural network (ANN) model with the topology 6-10-20-1, which obtained prediction results well. Yasser et al.[32] reviewed the application of square gradient theory, linear gradient theory, density functional theory, drop shape analysis, axisymmetric drop-shape analysis, density gradient theory, and other methods for the prediction of IFT. They collected 576 data sets and established water-based binary and ternary (CO2, CH4, and N2) IFT models under elevated pressure by using neuroevolutionary technology. The MAE and R2 of the predicted data of the model are 3.34% and 0.999, respectively, which are compared with the relevant literatures,[4,11,14,15,17,33] and highest accuracy was achieved. However, these models are not suitable for complex environments because of their few parameters. In recent years, Niroomand-Toomaj et al.,[34] Partovi et al.,[35] and Chen and Yang[36] have adopted a multilayer perceptron model, least squares support vector machine, adaptive neurofuzzy inference system, radial basis function network optimized by particle swarm optimization method, adaptive neurofuzzy inference system trained by a hybrid method, and mutual solubility model; the models are established for temperature, pressure, salinity/molality concentration, and gas mole fraction, all of which have achieved high prediction accuracy. This work aims to establish the IFT model of temperature and pressure, mass molality concentration of monovalent cation, mass molality concentration of divalent cation, methane and nitrogen fraction from the existing IFT database, and to analyze the prediction effect of wavelet neural network (WNN), back propagation (BP), and radical basis function (RBF) neural network model in predicting IFT of CO2–brine. By optimizing the WNN model and comparing the BP model, the reasons for the differences were analyzed, and the advantages and disadvantages of various network models and their applicability to specific situations were compared.

Basics of Models

By simulating the information processing mode between brain neurons, the ANN conducts parallel processing and nonlinear transformation of information to establish the relationship between input data and output data. The schematic diagram of the neural network is shown in Figure . The performance of the ANN is mainly determined by the topological structure, excitation function, learning rules, and sample quality. The BP, WNN, and RBF model achieve different prediction effects by changing one or two performance conditions.

Figure 1

ANN structure.

BP Model

The BP neural network is a kind of multilayer forward feedback neural network, and the BP algorithm is used to adjust the network weight. The core of the BP algorithm is the theory of negative gradient descent, that is, the error adjustment direction is always along the direction with the fastest descent. For the n-layer network, the initial weight w(0) and threshold B(0) between each layer of the network are given a [−1,1] random matrix, i = 1...n – 1, j = i + 1. The input and output of the first layer are I1 and O1, I1 = O1= X, and X is the input data. The input and output of j-th are I = W × O + B × ones, O = F(I)(I). The activation function f( used between the input layer, the hidden layer, and the output layer is usually different, and generally are of the following two types The sigmoid function Linear function Energy error functionwhere Y is the actual output and E is the total error value. The weight and threshold learning function of gradient descent driven by quantity is adjusted by the momentum gradient descent method.[37] For t + 1, the following relation is satisfiedwhere η is the learning rate, η ∈ [0,1].

WNN Model

A wavelet is a waveform with a finite length and an average of 0. The difference between the wavelet network and BP network is that the activation function of the wavelet network is the wavelet basis function. The definition of the wavelet basis function can be seen in ref[38] Thus, the hidden layer outputa is the expansion factor, and b is the translation factor. In this paper, wavelet basis function f is selected as Morlet function WNN uses the wavelet function as the node activation function. The neural network combines the time-frequency localization function of the wavelet transform and the self-learning function of the neural network to solve the problem of poor convergence and divergence.

RBF Model

The network structure of RBF has three layers. The activation functions of the first layer and the third layer are linear functions. The Gaussian function used in the hidden layer is as followswhere x = (x1, x2, x3, ..., x)T, j is the number of input variables, c denotes the center vector, and σ is the radius or width of the hidden neuron.

Data Acquisition

The IFT between CO2 and brine is affected by many factors. In summary of previous experiments,[5,6,8,10−16,18,19] the independent variables commonly used in IFT between CO2 and brines are temperature, pressure, molality (Na+, K+, Ca2+, Mg2+), molar fraction (CH4, N2), and density difference, all of which cover all the variables that affect the changes of IFT. A total of 974 data sets[5,6,16] measured by the suspension drop method were collected in this paper, including a pure/impure CO2 gas environment, impure gas containing CH4 and N2, salt water solutes such as NaCl, KCl, CaCl2, and MgCl2. The data range is shown in Table . Studies in literature[15,16] measured the changes in IFT between CO2 and brine containing Na+, K+, Ca2+, and Mg2+; according to the experimental results, cations with identical valency obtained in the same environment had similar effect on CO2–brine IFT, so cations with identical valency were taken as the same variable.

Table 1

Ranges of CO2–Brine IFT Used in the Models

	pressure (MPa)	temperature (°C)	monovalent cation molality (mol/kg)	divalent cation molality (mol/kg)	CH₄ (mol %)	N₂ (mol %)	measured IFT (mN/m)
max	175	60.05	2.75	2.7	100	100	74
min	5.25	0.1	0	0	0	0	16.1

Model Development

In order to establish a reliable model to characterize the CO2–brine IFT and evaluate its prediction effect, the input variables include temperature T (°C), pressure P (MPa), molality of monovalent cation m1 (mol/kg), molality of divalent cation m2 (mol/kg), and mole fraction of CH4 and N2 (%). The following functional relationships can be summarized First, the target and result in the input data are normalized to make the distribution between [−1,1] Then, the collected database was randomly divided into the training set (70%) and testing set (30%). The BP, WNN, and RBF were used to establish the models and test the prediction ability. For BP and WNN models, 15 neurons are used in the hidden layer, which was the number of best prediction results for both BP and WNN models obtained by the trial and error method. In order to determine the influence of the activation function, the initial transfer vector of the WNN model is optimized as follows: 1000 transfer vectors are randomly generated, each vector carries out 200 iterations, and the transfer vector with the smallest energy error value is selected as the optimal transfer vector. Take this vector as the initial transfer vector to calculate the predicted value and final prediction error of the WNN model. The RBF model uses the newrb function of the MATLAB toolkit. The hidden layer increases by 25 neurons at a time; the final number of neurons adopts the default value, that is, the number equal to the number of training sets. The data width increases by 0.1 each time from 0.2 to 0.7; it is found that 0.5 data width can get the highest accuracy.

Results and Discussion

Result Processing

Different norms are used to evaluate the data results. Currently, norms commonly used in literature[4,39] include MAE, MARE, MSE, and R2. (mea means measured IFT, est means estimated IFT) The following norms are provided for further verification.[39] The WNN, BP, and RBF methods about the norms of the calculation results are shown in Table ; the unreliable estimated data sets of the RBF model are deleted (too high or too low). Because the initial transfer vector is randomly generated, the results of WNN and BP methods will change with the change of the initial transfer vector; this paper lists the WNN and BP methods’ single run results. The operation result will not change when the initial parameters of RBF methods are determined. It can be seen from Table that all the parts of BP and WNN and all the parameters in the training set of RBF meet the above criteria, indicating that these prediction methods are reliable and effective. In addition, regardless of whether in the training set or in the test set, it can be seen from various parameters that the BP model has the highest stability. The WNN model as a whole is second to BP, and the test of the RBF model in the training set is the closest to the actual value, which is determined by the nature of the RBF model. For the RBF model, the closer the sample is to the data center, the more likely it is to be activated. The more it deviates from the training sample in the data center, the less the influence of the RBF method on it will be.

Table 2

Different Norms Computed to Assess the BP, WNN, and RBF Models for Predicting CO2–Brine IFT

	all			training set			testing set
	BP	WNN	RBF	BP	WNN	RBF	BP	WNN	RBF
MAE	1.1557	2.6642	6.8018	1.1088	2.5956	0.4213	1.2652	2.8242	22.0167
MARE	3.2020	7.4214	17.3754	3.1115	7.2569	1.2546	3.4133	7.8057	55.8174
MSE	3.4115	13.2312	1709.1726	3.2375	12.5932	1.2357	3.8178	14.7214	5781.9453
R²	0.9797	0.9211	–9.2015	0.9811	0.9263	0.9928	0.9761	0.9080	–35.2507
k	1.0013	1.0170	0.5427	1.0007	1.0140	1.0001	1.0028	1.0241	0.2647
k′	0.9970	0.9771	1.0108	0.9977	0.9802	0.9993	0.9954	0.9698	1.0382
R_o²	1.0000	0.9965	0.5829	1.0000	0.9977	1.0000	0.9999	0.9924	0.2841
R_o^′2	0.9999	0.9937	0.9986	0.9999	0.9954	1.0000	0.9997	0.9886	0.9814
m	–0.0207	–0.0819	1.0633	–0.0193	–0.0771	–0.0073	–0.0243	–0.0930	1.0081
n	–0.0207	–0.0788	1.1085	–0.0192	–0.0745	–0.0073	–0.0242	–0.0887	1.0278
R_m²	0.8400	0.6681	19.5808	0.8461	0.6788	0.9084	0.8257	0.6442	174.8824

In order to directly observe the evaluation performance of the models, the comparison diagram of the measured value and the estimated value in the two sets of training set and testing set is drawn in Figure of the WNN, BP, and RBF models. It can be seen from Figure a–d that the reference line of the WNN and BP model is very close to the least square line, but according to Figure e,f, the data points obtained by the BP model are closer to the reference line, that is, the simulated value is closer to the actual value. The training set data sets of the RBF model almost all fall on the reference line; the closer the data is to the data center, the more accurate is the prediction. For the prediction data with large deviation, values are far from the data center or do not fall within the width of the data center.

Figure 2

Comparison between the results obtained by the models studied (i.e. BP, WNN, and RBF) in current research and the actual data of CO2–brine IFT. (m is the slope of the least-squares line).

Comparison between the results obtained by the models studied (i.e. BP, WNN, and RBF) in current research and the actual data of CO2–brine IFT. (m is the slope of the least-squares line). Figure describes the distribution of the absolute error between the measured IFT and the estimated IFT. It can be seen from it that most of the error values of the BP model are distributed within 2%, and few of them are larger than 10%. The error distribution of the WNN model is relatively average, and the RBF model error presents polarization distribution. The RBF performance is better than the WNN model in the range of low absolute error, but the performance is opposite in the range of high absolute error.

Figure 3

Error distribution diagram of CO2–brine IFT of the models (i.e. BP, WNN, and RBF).

Error distribution diagram of CO2–brine IFT of the models (i.e. BP, WNN, and RBF). From the above analysis, it can be determined that the prediction ability of the WNN model is weaker than that of the BP model when the initial transfer vector is randomly selected. In order to further determine ability of the WNN model compared with the BP model, namely, to compare the difference between the two activation functions, the IFT comparison chart of the improved WNN (I_WNN) model is shown in Figure after optimizing the WNN model of the initial transfer vector. The norms calculated are shown in Table . Compared with Table , it can be seen that the norms of the improved WNN model are still not as good as those obtained by the BP model. Therefore, from the essence of the two models, the performance of the sigmoid activation function is better than that of the wavelet basis function.

Figure 4

Comparison between the results obtained by the improved WNN model and the actual data of CO2–brine IFT. (m is the slope of the least-squares line).

Table 3

Different Norms Computed to Assess the Improved WNN Model for Predicting CO2–Brine IFT

	all	training set	testing set
MAE	2.0788	2.0592	2.1246
MARE	5.4612	5.4330	5.5272
MSE	7.3825	7.2286	7.7419
R²	0.9560	0.9577	0.9516

Comparison between the results obtained by the improved WNN model and the actual data of CO2–brine IFT. (m is the slope of the least-squares line).

Influence of Variables

The IFT is determined by the relevant parameters. Because each parameter has a different impact on IFT, it is important to ensure the weight of each parameter. In this paper, the weight and threshold value of each layer calculated by the optimized WNN model is used to determine the correlation coefficient of each parameter, as shown in Figure (the correlation coefficient of pressure is set as 1). It can be seen from the figure that pressure is the most important factor affecting IFT, followed by temperature; both of them have a high proportion, and similar conclusions can be obtained in refs.[3,17,39] The reason for this result should be the phase change of the surrounding environment. Under the critical temperature and pressure of CO2, CO2 will remain liquid, but once it exceeds this value, it will become gaseous. At this time, a sudden change of IFT will occur. Figure is drawn according to ref (15); it can be observed from Figure a that the slope of the IFT curve changes dramatically after nodes a, b, and c; as shown in Figure b, the change in IFT is also obvious when the temperature changes but not as much as the change in pressure. Therefore, in the process of the CO2 brine IFT experiment, whether the temperature and pressure can be accurately controlled is the most important condition.

Figure 5

Correlation coefficient of the independent variables on the CO2–brine IFT.

Figure 6

CO2–brine IFT scatter plots of temperature and pressure.

Correlation coefficient of the independent variables on the CO2–brine IFT. CO2–brine IFT scatter plots of temperature and pressure.

Conclusions

In order to estimate the IFT between impure CO2 and brine a in a wide range of thermodynamic properties, 974 data sets from the literature are integrated carefully and divided into the training set (70%) and testing set (30%). The WNN, BP, RBF, and improved WNN model based on the ANN model are used to measure the performance of prediction of CO2–brine IFT. The BP, RBF, and improved WNN models have their own advantages, which are suitable for different situations. The overall evaluation of the BP model is the best; the RBF model is suitable for large database models, and the I_WNN model is the most stable and reliable model, for which R2 is above 0.95 in both the training set and testing set. As a general drawback of neural networks, the prediction performance is greatly influenced by the quality of samples and the initial transfer vector; the results of each run of the BP and RBF models will be different. The I_WNN model improves the prediction stability by improves the initial transfer vector. As the main parameters of IFT, pressure and temperature have great influence on the results of IFT. Therefore, the disturbance of the surrounding environment caused by the instability of these two parameters should be avoided in the process of the IFT experiment.

9 in total

1. The interface between water and a hydrophobic gas.

Authors: Stewart K Reed; Robin E Westacott
Journal: Phys Chem Chem Phys Date: 2008-06-17 Impact factor: 3.676

2. Monte Carlo calculation of the methane-water interfacial tension at high pressures.

Authors: F Biscay; A Ghoufi; V Lachet; P Malfreyt
Journal: J Chem Phys Date: 2009-09-28 Impact factor: 3.488

3. Thermodynamic properties of methane/water interface predicted by molecular dynamics simulations.

Authors: Ryuji Sakamaki; Amadeu K Sum; Tetsu Narumi; Ryo Ohmura; Kenji Yasuoka
Journal: J Chem Phys Date: 2011-04-14 Impact factor: 3.488

4. Simultaneous application of the gradient theory and Monte Carlo molecular simulation for the investigation of methane/water interfacial properties.

Authors: Christelle Miqueu; José M Míguez; Manuel M Piñeiro; Thomas Lafitte; Bruno Mendiboure
Journal: J Phys Chem B Date: 2011-07-15 Impact factor: 2.991

5. Predicting CO₂-H₂O Interfacial Tension Using COSMO-RS.

Authors: A Silvestri; S L S Stipp; M P Andersson
Journal: J Chem Theory Comput Date: 2017-01-03 Impact factor: 6.006

6. Molecular dynamics computations of brine-CO2 interfacial tensions and brine-CO2-quartz contact angles and their effects on structural and residual trapping mechanisms in carbon geo-sequestration.

Authors: S Iglauer; M S Mathew; F Bresme
Journal: J Colloid Interface Sci Date: 2012-07-14 Impact factor: 8.128

7. Impact of pressure and temperature on CO2-brine-mica contact angles and CO2-brine interfacial tension: Implications for carbon geo-sequestration.

Authors: Muhammad Arif; Ahmed Z Al-Yaseri; Ahmed Barifcani; Maxim Lebedev; Stefan Iglauer
Journal: J Colloid Interface Sci Date: 2015-10-22 Impact factor: 8.128

8. Molecular Dynamics Simulation Study of Carbon Dioxide, Methane, and Their Mixture in the Presence of Brine.

Authors: Yafan Yang; Arun Kumar Narayanan Nair; Shuyu Sun
Journal: J Phys Chem B Date: 2017-10-10 Impact factor: 2.991

9. Modeling the Interfacial Tension of Water-Based Binary and Ternary Systems at High Pressures Using a Neuro-Evolutive Technique.

Authors: Yasser Vasseghian; Alireza Bahadori; Alireza Khataee; Elena-Niculina Dragoi; Masoud Moradi
Journal: ACS Omega Date: 2019-12-24

9 in total

1 in total

1. Pore Structure Characterization of Eocene Low-Permeability Sandstones via Fractal Analysis and Machine Learning: An Example from the Dongying Depression, Bohai Bay Basin, China.

Authors: Yan Lu; Keyu Liu
Journal: ACS Omega Date: 2021-04-23

1 in total