Literature DB >> 36061681

Novel Correlation for Calculating Water Saturation in Shaly Sandstone Reservoirs Using Artificial Intelligence: Case Study from Egyptian Oil Fields.

Abstract

The accurate determination of water saturation in shaly sandstone reservoirs has a significant impact on hydrocarbons in place estimation and selection of possible hydrocarbon zones. The available numerical equations for water saturation estimation are unreliable and depend on laboratory core analysis. Therefore, this paper attempts to use artificial intelligence methods in developing an artificial neural network model (ANN) for water saturation (Sw) prediction. The ANN model is developed and validated by using 2700 core measured points from the fields located in the Gulf of Suez, Nile Delta, and Western Desert of Egypt, with inputs including the formation depth, the caliper size, the sonic time, gamma rays (GRs), shallow resistivity (Rxo), neutron porosity (NPHI), the photoelectric effect (PEF), bulk density, and deep resistivity (Rt). The study results show that the optimization process for the ANN model is achieved by distributing the collected data as follows: 80% for training and 20% for testing processes, with an R 2 of 0.973 and a mean square error (MSE) of 0.048. In addition, a mathematical equation is extracted out of the ANN model that is used to estimate the formation water saturation in a simple and direct approach. The developed equation can be used incorporating with the existing well logs commercial software to increase the accuracy of water saturation prediction. A comparison study is executed using published correlations (Waxman and Smits, dual water, and effective models) to show the robustness of the presented ANN model and the extracted equation. The results show that the proposed correlation and the ANN model achieved outstanding performance and better accuracy than the existing empirical models for calculating the formation water saturation with a high correlation coefficient (R 2) of 0.973, lowest mean-square error (MSE) of 0.048, lowest average absolute percent relative error (AAPRE) of 0.042, and standard deviation (SD) of 0.24. To the best of our knowledge, the current study and the proposed ANN model establish a novel base in the estimation of formation water saturation.

Entities: Chemical

Year: 2022 PMID： 36061681 PMCID： PMC9435039 DOI： 10.1021/acsomega.2c01945

Source DB: PubMed Journal: ACS Omega ISSN： 2470-1343

Introduction

Clays are considered significant among the constituents of rocks by log analysis, as they form 40–50% of the mineral components of shale. Clay minerals are structured as sheets of silica tetrahedron lattices. Within the clay sheets, there are usually excessive negative electrical charges. This resulted in a local electrical imbalance within clay particles. The Archie[1] water saturation equation considers the formation water as the only electrically conductive material in the formation. The presence of conductive clay materials requires modifying the Archie equation or generating new models to incorporate the rock resistivity. Over the years, different models have been proposed to relate the fluid saturation and resistivity, assuming that shale exists in specific geometric forms (laminated, dispersed, and structural). These model parameters contain a clean sand term defined by Archie[1] and a clay term, as shown in eq .where Ro is the rock resistivity saturated with water, ohm m; F is the formation factor, dimensionless; and Rw is the water resistivity, ohm m. All these models follow the clean sand Archie equation when the clay fraction is zero; for small amounts of shale (5–10%), most models yield quite similar results. The presence of clay minerals in a sandstone reservoir leads to reduction in the reservoir’s storage and reduces the reservoir’s ability to transmit fluids by reducing the porosity and permeability, respectively. The clay minerals’ occurrence in sandstone with freshwater leads to an overly pessimistic water saturation value unless corrections are made. Moreover, the presence of clay minerals on salty formation water makes the recorded deep resistivity (Rt) too low, and this led to an increase in the values of formation water saturation. Both cases lead to bypassed production as these zones will be considered erroneous, as noncommercial. Therefore, to determine the accurate water saturation in shaly sand oil reservoirs, the Archie water saturation equation[1] must be modified, but unfortunately no adequate model exists that accounted for the fundamental electrical behavior in shaly sands. Consequently, the entire water saturation equations available for shaly sands are of empirical nature. The well-known shaly sand models for water saturation are the Waxman and Smits model[2] dual water model,[3] effective medium model,[4] and Simandoux model.[4] These models are selected to be used in the comparative study in this work based on the data collected from Egypt oil fields, as these models are widely used by petrophysicists in water saturation calculation in such fields. The Simandoux model is based on laboratory experiments for conductivity estimation.where Csh is the shale conductivity, and Cw is the water conductivity. The main drawback of this model is that it yields water saturation values quite low in shaly sand formation containing clay minerals. The Waxman and Smits model[2] is based on the laboratory measurements of cation exchange capacity (CEC), and the main drawback of this model is not available during the log analysis. This model has been modified by Juhasz.[5] Juhasz used the formation density log to estimate porosity, while the dual water model uses the neutron density cross plot. The type of clay is included in the Waxman and Smits model in the estimation of effective porosity. The general equation for water-saturated sands is then obtainedwithwhere Co is the specific conductance of sand, Cw is the specific conductance of the brine, λNae is the maximum equivalent ionic conductance of sodium exchange ions, γ is the empirical constant, and Qv is the effective concentration of clay exchange cations. A simple method to determine Qv: The dual water model[3] is indirectly based on the cation exchange capacity. Basically, the pore volume is divided into bound water (Swb) or shale water and free water, also called sand water (Swf). Bound and free waters contributed to the conductivity of the shaly sand. Both these waters have their own resistivity, Rwb and Rwf, respectively. The amount of bound water is directly related to the volume of clay in the shaly sand formation. With an increase in clay volume, more bound water will occupy the total pore space (Φt). Archie’s formula[1] can be written as follows:where Ct = 1/Rt is the formation conductivity, and CwM is the conductivity of mixed water (bound and free).where ΦT is calculated from LDT-CNL [Φql = (ΦD + ΦN)/2]; SwB and SwB ≈ Vcl from clay indicators; RwF from the water zone; and RwB from the 100% water zone. The effective medium model[4] is a theoretical one used to calculate water saturation based on porosity and resistivity measurements. The model can be used for different types of clay minerals including laminated and dispersed shale distributions. The model based on the assumption that the matrix and hydrocarbons can be treated together is as follows:where Rd is the dispersed phase resistivity (the combination of the matrix and hydrocarbons). The literature review showed that several water saturation models are developed to estimate the formation water saturation in the presence of clay minerals. These models have drawbacks, and none of them has the ability to predict precisely the water saturation in the hydrocarbon zones. The main weakness in the published correlations is their strong dependency on the experimental core analyses that requires a long experimental time. Hence, this study aims to develop a new ANN model with a supervised algorithm to accurately estimate the water saturation profile in shaly sand oil fields located in Egypt from conventional well log data. In order to fulfill this purpose, more than 2700 datasets of core points from numerous Egyptian oil fields located in Western Desert, Nile Delta, and Gulf of Suez are collected. The data include the measured petrophysical properties to be used in the developed model. The new presented correlation incorporates new parameters including gamma rays, the caliper size, and photoelectric effect (PEF).

Artificial Neural Network

In this study, the back-propagation algorithm (BP) is used as a learning process to supervise the neural net. The forward step is used to send signals to the input layers, and the backward one is used to calculate the proposal error between the field and target outputs. Weights are used for each layer to adjust the mean-square error (MSE) during the backward operation,[6] as followswhere n1 and n2 are the number of training and output neurons, respectively, and xp and yp are the target and estimated outputs, respectively. The employed function used in the back-propagation algorithm in this study is a sigmoid curve called the logistic function. The network speed of the convergence process can be improved by adding an acceleration technique[7] as follows:where α is the energy constant, w is the weight, and Δw is the weight difference. The learning and momentum constants are set in the range of 0 and 1.[8,9]

Application of ANN To Predict Water Saturation

Neural network systems have become increasingly popular in engineering applications. This is partly due to the fact that intelligent animals can solve problems which are impossible for even the most powerful modem computers and partly because of the desire by engineers and computer scientists to explore and exploit parallel hardware systems and apply them to solve practical problems. In petroleum engineering, successful applications include drill bit diagnosis,[10] seismic processing,[11] identification of well test interpretation model,[12] flow measurements,[13] identification of well productivity,[14] and wireline log analysis.[15] Also, McCormack and Day[16] and Fogelman-Soulie[17] provided some introductory articles on the use of neural networks in the petroleum industry. Artificial intelligence provides numerous benefits for petrophysical evaluation. Several researchers[18,19] have used ANN models, particularly feed-forward back-propagation neural networks (FFNNs), to develop more accurate predictions of the reservoir rock properties that include water saturation and porosity. An AI algorithm[20] is developed for shaly sandstone reservoirs to predict water saturation, with the mean-square error (MSE) of 0.064. Hamada et al.,[21] used PSONN as an optimization algorithm for water saturation estimation for clean sandstone formations, and their results show that the new hybrid PSONN model outperforms some available methods with a lower root-mean-square error of 0.009 and an R2 of 0.95. Aydin et al.[22] proposed a model for forecasting coal consumption in Turkey. The data used in Aydin’s model are divided into two groups for training and testing processes, and the results show MSE achieved of 0.025. An artificial intelligence model is proposed[23] to calculate water saturation for two reservoirs in the Middle East. The model is based on a three-layer neural net to predict saturation in the formation and yields a correlation factor of 0.91 with an error of 0.025. Amiri et al. (2015)[24] proposed an ANN model with different well log data. The results showed that the model is precise for the forecasting process with the correlation factor R2 of 0.97. An artificial neural network model is proposed[25] to evaluate the saturation of water in gas reservoirs based on competitive algorithm. A total of 2200 data points taken from 12 wells have been collected to build the model. The results indicated that the model is efficient. The developed ANN framework consists of four different structures based on the tan sigmoid function to predict water saturation from the well log data. The results show that the proposed model is more robust that the dual water model. Kamalyar et al. (2011)[26] proposed an ANN methodology for water saturation prediction for oil wells in southern Iran. Permeability, porosity, and other well logs were used as the input data, and saturation of water profile was the target. Helle and Bhatt (2002)[27] presented an ANN model based on nine trained neural networks. The input data used in their model include density, resistivity, and sonic logs, and saturation was the target. Numerous studies have been presented for water saturation calculation based on the regression method.[28−30] Gomaa et al. (2022)[31] proposed an ANN model for SW estimation using 383 core samples. The input data used in the model include porosity, permeability, and resistivity index. The results show that the ANN model gives a coefficient of determination of 0.99 and an average relative error of 0.13, and MSE = 0.066. Most of the previous attempts rarely provide a single equation to determine water saturation from well logs to overcome the drawbacks in the existing empirical models. Consequently, the main aim of this study is to provide a novel correlation using the ANN model. Furthermore, in this study, a new convergence technique is provided to rapidly predict the target data by adding two new parameters in the ANN methodology: step size and momentum.[30] The traditional ANN works that have been presented in the literature exhibit a drawback in the convergence process. Thus, this study presents two new parameters used to speed up and overcome the abovementioned problems, namely, step size and momentum. Furthermore, the learning rate is incorporated with the BP algorithm.

Collected Data Analysis

Description

Groups of datasets from different fields in Egypt[19] were used in developing the ANN model. The data comprise nine inputs used for the training: the formation depth (DEPTH), the caliper size (CALI), the sonic time (DTR), gamma rays (GRs), shallow resistivity (LLS), neutron porosity (NPHI), the photoelectric effect (PEF), bulk density (RHOB), and deep resistivity (Rt). Data are normalized in a range of 0 and 1 (see Table ).

Table 1

Statistical Analysis of the Data Used To Validate the ANN Model

parameter	formation depth (DEPTH), ft	caliper size (inch) CALI	sonic time (μs/ft) DTR	gamma ray (GR), API	shallow resistivity (LLS), ohm m	neutron porosity (NPHI)	photo electric effect (PEF)	bulk density (RHOB), g/cc	deep resistivity (Rt), ohm m
min	7870	8.04	45.364	7.88	0.3756	0.006	0.036	1.321	0.33
max	9430	25.836	109.78	138.25	255.8	65.14	10	5.70	231.01
standard deviation	460.179	2.6786	12.755	29.460	17.708	3.99	1.94	0.5491	15.941
skewness	1.1862	3.0877	–0.1815	0.0967	6.956	7.84	0.794	1.729	7.470
mean	8267	8.642	74.715	56.696	3.7466	0.209	3.1718	2.602	2.8117

Data Acquisition and Analysis

Distribution of Input Data

The data are divided into training (80%) and testing groups (20%) by using a randomization function. The distribution is used to train the ANN model to create a correlation between inputs and formation water saturation. The learning algorithm that optimizes the training data by reducing the error between the target and actual water saturation is back-propagation (BP). The BP learning algorithm provides exceptional results with an R2 of 0.973 and MSE = 0.048 compared to other algorithms, including scaled conjugate gradient (SCG) and one-step secant (OSS), as shown in Figure .

Figure 1

Correlation factor R2 (a) and mean-square error (b) for three different algorithms.

Optimization for ANN Neurons

Levernberg Marquardt algorithm[32] is used to optimize the number of neurons and hidden layers. It can be seen from Figure that the optimal neuron number is 15 with MSE = 0.048.

Figure 2

Mean-square error versus number of neurons tested during the training and testing processes.

Mean-square error versus number of neurons tested during the training and testing processes. Figure shows a flowchart of the steps involved in the ANN model used in this study to estimate the water saturation. First, the data are collected from conventional well logs. Next, numerous ratios are tested for the training and testing processes. Moreover, the ANN model parameters are optimized, including the number of hidden layers, number of neurons, learning constant, and training functions. Table summarizes the optimized parameters used in this study.

Figure 3

ANN model developing flowchart.

Table 2

Optimized Parameters for the ANN Model

parameter	tested range	optimized parameters
number of neurons	2–35	15
hidden layer number	1–3	1
algorithm function	tan sig/log sig	tan sig
learning rate	0.001–0.8	0.03

ANN model developing flowchart. Another test is executed to test how strongly output data (Sw) are related to input data (including depth, caliper size, sonic time, GR, LLS, NPHI, PEF, RHOB, and Rt) by using the correlation coefficient (CC). Figure shows that Sw is strongly dependent on GR, NPHI, RHOB, PEF, and Rt, with CC = 0.578, 0.465, 0.385, 0.294, and −0.142, respectively. It can be seen from Figure that Sw has a direct relationship with GR, NPHI, RHOB, and PEF and an inverse relationship with Rt. At high GR, which indicates shale zones, water saturation increases with low Rt values.

Figure 4

Correlation coefficients of water saturation vs inputs.

Results and Discussion

To optimize the ANN model parameters, a sensitivity analysis is performed including different runs to show the deviation between the actual measured core data and the predicted water saturation data. The nine inputs mentioned before give minimum errors with the highest correlation factor R2 = 0.973. Approximately 2000 points are used for the training process, and 700 data points are used for testing. During the training process, an iterative operation is executed using different number of neurons. The results show that using 10 neurons in the neural net gives an R2 of 0.91 (see Figure a), while using 15 neurons gives an optimum R2 of 0.973 (see Figure b), with MSE = 0.021. The results of the testing process show that the optimal R2 of 0.965 and MSE = 0.035 are found with the use of 15 neurons, while 10 neurons give an R2 of 0.91, as shown in Figure a,b, with MSE = 0.068. Figure shows an exceptional match between the measured saturation of core points and ANN water saturation at the same depth for the testing process, which confirms the consistency of the created neural model and water saturation correlation.

Figure 5

Actual core Sw vs targets using training data (a) with 10 neurons and (b) with 15 neurons.

Figure 6

SW distribution using testing data (a) with 10 neurons and (b) with 15 neurons.

Figure 7

Core Sw vs ANN-predicted values for testing data.

Actual core Sw vs targets using training data (a) with 10 neurons and (b) with 15 neurons. SW distribution using testing data (a) with 10 neurons and (b) with 15 neurons. Core Sw vs ANN-predicted values for testing data. Perceiving the encouraging outcomes out of the ANN operations, a mathematical equation is extracted to be used with a very simple approach to calculate the saturation of water at different depths. The weights and biases for the generated equation are given in Table .

Table 3

Weights Used in the Extracted Correlation Eq

neuron number	input (w₁)									weights (w₂)	bias (b₁)	bias (b₂)
neuron number	depth	caliper size	sonic time	gamma ray	LLS	NPHI	PEF	RHOB	Rt	weights (w₂)	bias (b₁)	bias (b₂)
1	8.60 × 10^–2	–1.64366	1.364198	4.488277	0.828984	–3.76761	–0.71577	6.80 × 10^–2	2.37 × 10^–2	–2.45044	–3.51022	8.922018
2	–0.33533	0.69407	–0.37064	2.452354	–1.78967	0.410602	0.570176	–21.7014	–0.46693	0.322973	3.251176
3	–0.72663	–0.7733	1.05871	2.158347	0.124696	–2.74917	0.236778	–4.02574	0.399239	3.860151	–10.325
4	–1.28712	–1.36386	4.136454	–7.21623	–1.53894	1.860875	–0.74278	0.471668	3.147989	–10.0577	–1.16598
5	–0.30975	1.913667	–1.04754	5.228027	3.262796	–3.79047	–4.28611	1.29184	–0.35187	–2.22 × 10^–3	–2.07675
6	–4.70811	–0.19207	1.949784	–5.15739	1.252633	2.135902	5.58273	1.59766	11.8251	4.390237	5.96349
7	–4.2743	2.073048	–5.45202	–2.09825	2.974399	–1.36232	–1.17286	–0.57869	–2.07078	7.388246	15.15375
8	–2.78686	0.850972	0.604677	0.37774	–3.36826	3.01271	3.059444	0.905125	–1.20279	9.32 × 10^–2	–4.54714
9	0.756154	2.578654	–1.50889	–3.33637	1.529586	0.134132	0.751849	–2.48792	–3.57177	–10.7205	–15.1273
10	–0.76985	0.816966	1.435553	3.299932	–1.25607	–16.8058	–0.18043	–1.91115	–2.22478	5.523283	–0.55698
11	0.28348	1.90887	–0.57044	–0.90216	2.530426	–2.54324	–1.48602	–0.46482	1.13 × 10^–2	–1.23776	5.067025
12	1.503564	–0.40778	–0.81572	–0.37021	–0.96029	–0.35082	–2.22131	–2.99481	–3.32748	–0.13705	3.599462
13	1.370686	4.117492	–4.08873	3.742478	1.837794	18.52603	–1.79264	–0.77339	1.265298	–5.68593	7.51383
14	2.113377	–1.51161	15.37353	–22.0491	1.23213	–2.77885	1.126326	23.15521	–1.62084	–0.66437	3.364801
15	1.076056	0.507737	0.161874	1.27 × 10^–2	–1.39666	3.772604	–1.12231	2.538367	0.196369	–7.24734	–2.56502

The novel correlation generated using ANN for water saturation estimation in shaly sand reservoirs is given bywhere Sw is the normalized water saturation, (w2,) is the vector weight between the hidden layer and the output layer, b1 and b2 are the bias vectors for the input and output layers, respectively, DEPTH is the formation depth, CALI is the caliper size, GR is the gamma ray, LLS is the shallow resistivity, NPHI is the neutron porosity, PEF is the photoelectric effect, RHOB is the bulk density, and Rt is the deep resistivity. The extraction of Sw is achieved by denormalizing Sw as follows:

Validation of the Developed ANN Model

In order to validate the newly proposed correlation for water saturation, unseen data sets are used for the training process. First, the correlation is used to predict the water saturation values at different formation depths, and then a comparison against well-known correlations is performed. The published correlations used in this comparison study are those of Waxman and Smits, dual water model, and effective model. The log data for a well including porosity and resistivity measurements are shown in Figure a,b.

Figure 8

(a) Porosity logs and (b) resistivity logs for the selected dataset used for validation operation.

(a) Porosity logs and (b) resistivity logs for the selected dataset used for validation operation. Figure shows the cross plots of the calculated water saturation using the proposed ANN model, and empirical correlations include Waxman and Smits, dual water, and effective models. The generated correlation is able to predict accurately the water saturation with an R2 of 0.93, as shown in Figure a. Proceeding to validate the generated correlation in this study, the available models are used to estimate Sw as well. The results show that the match between the predicted and actual Sw is poor using the Waxman and Smits model, with an R2 of 0.64 (see Figure b) and AAPRE = 0.13. The dual water model gives an R2 of 0.457, while the effective model provides an R2 of 0.065 (see Figure c,d). Out of the presented results, it has been resolved that the published correlations are not able to predict Sw precisely compared to the newly generated correlation. Figure presents the performance of an ANN empirical correlation versus the published empirical correlations. From Figure , it can be seen that the ANN empirical correlation is able to detect the changes of water saturation with depth, with AAPRE of 0.042, while the Waxman and Smits model gives AAPRE = 0.13, The dual water model gives AAPRE of 0.60, and the effective model provides AAPRE of 0.14. Table shows a comparison of the statistical information of the proposed correlation with other empirical correlations. It can be seen from Table that the proposed ANN empirical correlation gives AAPRE of 0.048 and MSE of 0.042, less than that obtained from Sayed et al. (2022)[31] (AAPRE of 0.13 and MSE of 0.066) and Hamada et al.[19] (AAPRE of 0.15 and MSE of 0.064). Statistically, the proposed ANN model is consistent and robust, based on the presented statistical analysis in Table .

Figure 9

Scatter diagram comparing the predicted Sw and actual Sw using (a) ANN model, (b) Waxman and Smits model, (c) dual water model, and (d) effective model.

Figure 10

Core Sw vs ANN-predicted values for validation data.

Table 4

Statistical Analysis for the Used Correlations and Neural Model

correlation	AAPRE[33]	MSE	SD[33]	correlation coefficient
Waxman and Smits	0.1303	0.95	0.35	0.64
dual water	0.6012	1.52	1.51	0.457
effective model	0.1419	3.52	0.324	0.065
Hamada et al. 2020	0.15	0.064	1.32	0.92
Sayed et al. 2022	0.13	0.066	21.5	0.99
this study: ANN	0.042	0.042	0.24	0.973

Scatter diagram comparing the predicted Sw and actual Sw using (a) ANN model, (b) Waxman and Smits model, (c) dual water model, and (d) effective model. Core Sw vs ANN-predicted values for validation data. Figure shows the MSE values between the real and predicted water saturation for the new ANN correlation: it shows the lowest MSE of 0.048 compared to that of the Waxman and Smits (MSE =0.95), dual water (MSE = 1.52), and effective (MSE = 3.52) models. The results show that the proposed ANN model is able to model a complex nonlinear relationship between the input and output variables. Based on the published literature and information collected from the Egypt fields, it is observed that the Waxman and Smits model is the most significant correlation used to estimate the saturation of water in shaly sand formation. Therefore, on testing the strength of the developed ANN correlation against the Waxman and Smits model, it is observed that using the ANN empirical correlation makes the selection of hydrocarbon zones and the accurate estimation of hydrocarbons in place more efficient.

Figure 11

MSE for the ANN correlation, Waxman and Smits model, dual water model, and effective model.

Conclusions

In this study, the ANN technique is presented to propose a new correlation for water saturation estimation with an accuracy of 97–98% in shaly oil reservoirs without using the ANN software. This correlation is generated for the reservoirs located in Gulf of Suez, Western Desert, and Nile Delta of Egypt, and other fields in the same locations having the same data range can use the generated correlation. The ANN technique in this study uses the back-propagation learning algorithm with the new acceleration method for improving the convergence scheme. ANN results provide the optimum water saturation values with the lowest error and highest R2 value. The comparison performed in this study shows that literature correlations display severe lags in predicting the measured core water saturation. Accordingly, this study provides a solution for oil companies in Egypt to forecast precisely water saturation and fluid in place consequently. The presented ANN model in this study is promising, and it should be evaluated further using large number of oil fields with various lithologies.

3 in total

1. Computational properties of networks of synchronous groups of spiking neurons.

Authors: Judith E Dayhoff
Journal: Neural Comput Date: 2007-09 Impact factor: 2.026

2. Avoiding false local minima by proper initialization of connections.

Authors: L A Wessels; E Barnard
Journal: IEEE Trans Neural Netw Date: 1992

3. New Correlation for Calculating Water Saturation Based on Permeability, Porosity, and Resistivity Index in Carbonate Reservoirs.

Authors: Sayed Gomaa; Ahmed Ashraf Soliman; Abdulrahman Mohamed; Ramadan Emara; Attia Mahmoud Attia
Journal: ACS Omega Date: 2022-01-20

3 in total