Literature DB >> 34963953

Reservoir Production Prediction Model Based on a Stacked LSTM Network and Transfer Learning.

Yukun Dong¹, Yu Zhang¹, Fubin Liu¹, Xiaotong Cheng¹.

Abstract

Gas injection and water injection are common and effective methods to improve oil recovery. To ensure its production effect, it is necessary to simulate the oilfield production process. However, traditional composition simulation runs a large number of calculations and takes a long time. Through the analysis of relevant data, we found that production is affected by many factors and has a strong sequential character. Therefore, this paper proposes a deep learning model for reservoir production prediction based on stacked long short-term memory network (LSTM). It is applied to other well patterns with a short production time and a few samples in the same oilfield block by transfer learning. The model achieves an effective combination with the actual reservoir production process. At the same time, it uses the knowledge learned from the well pattern with sufficient historical data to assist in the establishment of the model of the well pattern with limited data. This can obtain accurate prediction results and save the model training time, thus getting more effective application effects than composition simulation. This paper verifies the effectiveness of the proposed method through the data and multiple different injection combinations of the Tarim oilfield.

Entities: Chemical

Year: 2021 PMID： 34963953 PMCID： PMC8697399 DOI： 10.1021/acsomega.1c05132

Source DB: PubMed Journal: ACS Omega ISSN： 2470-1343

Introduction

Gas injection and water injection are the most common exploitation methods used in the process of oilfield development, but the cost of injection, water treatment, and liquid extraction is relatively large. Therefore, the selection and reasonable construction of injection and production parameters are particularly important for the development effect of injection. Due to the complexity and unpredictability of the oil production process, it is necessary to adjust the injection strategy and the corresponding working system in time according to the problems encountered in the implementation of the oilfield exploitation process. Due to the difference in the well pattern, injection volume, and the development stage in the oilfield, there are many different working systems. The application of reservoir composition simulation to study the influence of various working systems on the effect of injection is a commonly used method at present.[1−6] But, because of the large amount of calculation in composition simulation, it is impossible to return an optimization result in a short time. The simulation of a large amount of data using composition simulation is relatively imperfect, the time effectiveness cannot meet the business needs, and the feasibility is not high. As a result, the oil production prediction model based on an agent model comes into being. Using agent models to replace composition simulations is an important idea in many fields, and it is also suitable for oilfield production predictions. The agent model in this paper learns rules from the existing data and finally achieves fast prediction and gets good results. This is of vital importance to scientific decision-making and reasonable guidance of oilfield production. With the rapid development and wide application of machine learning technology, machine learning algorithms have been used in oil reservoir production prediction. Negash et al.[7] and Wu et al.[8] used simple neural networks to learn the characteristics of historical data from oil wells to achieve simple production prediction. These prediction models can be used to predict production accurately and quickly instead of numerical simulation after training with a large amount of actual data, but the majority of them cannot predict the oil production of new wells, nor can they obtain the dynamic effects of different systems; therefore, they still have certain limitations. With the development of a recurrent neural network (RNN), a variety of algorithms based on the RNN have begun to be used in reservoir development. Liu et al.[1] combined an LSTM network, a traditional machine learning algorithm, and mean decrease impurity (MDI) feature selection to realize oil production prediction. Tatsipie et al.[9] used the RNN to create a data-driven model capable of generating logging curves. Zhang et al.[10] proposed a method using the LSTM network to establish a prediction model to predict the distribution of reservoir water saturation and oil production. Ki et al.[11] proposed a data-driven method based on the LSTM network to recover the lost pressure data in gas wells. Li et al.[12] used a bidirectional gated recurrent unit (Bi-GRU) and sparrow search algorithm (SSA) to improve the prediction accuracy of oil production. Zhou et al.[13] achieved high-precision fluid identification using a bidirectional long short-term memory (Bi-LSTM) network. Although the abovementioned neural network models can achieve good performance in the prediction of oil production, the superior performance of these methods largely depends on sufficient historical data to train the model, and not every well pattern can establish the deep learning model because there is insufficient data for training. Traditional machine learning is only a small part of the research, with the above experience of these traditional machine learning methods, deep learning methods are naturally derived, and it solves some problems of machine learning. Therefore, methods should be explored to overcome or alleviate the problems of insufficient historical data and long training duration, and this paper hopes to provide a deep learning model that combines them to promote the understanding and research of production prediction. In the development process of an oilfield block, there may be different existing situations of injection wells and oil production wells in the oilfield block, and there are different combinations of injection strategies and timings; therefore, it is very important to accurately predict the production under different conditions. If the trained production prediction deep learning model can be applied to other well patterns in the same oilfield block, the time cost can be saved to a great extent. Considering that the data of different well patterns may have differences, therefore, the trained deep learning model cannot be directly applied in other well patterns of the same oilfield block. This paper applies dimension alignment to solve the data differences between them. Aiming at the accurate prediction of oil production with different well patterns in the same oilfield block, this paper proposes a method based on transfer learning to solve the problem when the data does not meet the requirements of model construction. When the data of well patterns is limited, training the model by transfer learning can improve the model performance and save model training time. To sum up, this paper proposes a deep learning model of oil production in reservoirs based on a stacked LSTM network and applies this model to other well patterns in the same oilfield block through transfer learning. The method proposed in this paper has high prediction precision and good efficiency, and through transfer learning, the model of training time can be saved. For well patterns with insufficient historical data, using transfer learning can improve the prediction accuracy and save time cost.

Establishment of the Deep Learning Prediction Model

For any reservoir, economic evaluation is necessary at its development stage, and then the optimal plan is selected for investment. In the process of injection development, because of the need for frequent injection and production operations, the remaining oil will inevitably change in a certain form, which makes the production capacity and net present value (NPV) change. In fact, the value of NPV is the most important factor affecting the development, and the calculation and analysis of the NPV will determine the priorities of different strategies. The calculation formula of NPV can be expressed aswhere b is the basic discount rate, N is the calculation period of the project, k represents the Kth period, Np and Ni are the number of production wells and the number of gas injection wells, respectively, rgpqgp is the gas sales revenue, ropqop is the oil sales revenue, cgiqgi is the gas injection cost, cwiqwi is the water injection cost, and cdwpqwp is the water treatment cost. The optimization problem is considered as shown in eq where δ is the parameter to be optimized. According to the characteristics and geological situation of a certain oilfield block, different wells will be drilled, and different well pattern deployment strategies will be designed, that is, the distribution mode of oil wells, gas wells, and water injection wells. For a certain well pattern, different values of its parameters correspond to different working systems, that is, the working method of the oil well, such as the flowing bottom-hole pressure, and the amount of liquid injection of the oil well, which is the source of the above parameter δ. Finally, the well pattern and working system that can maximize the oil fields’ income[14−16] and give full play to its reasonable productivity are explored. Through the comprehensive study of injection development, it can be found that after the gas drive operations, the saturation value of the remaining oil in the stratum will change to a certain extent. With the gradual increase of the development time, the cumulative production value of the oilfield will gradually increase, and it is bound to have a certain law. To study the influence of various factors in the gas injection reservoir on the production, it is necessary to make sure of the production influencing factors. There are many factors that affect oilfield production, such as the injection rate,[2] the number of producing wells,[17] injection–production ratio,[18] etc. This paper further screened the main influencing factors of oilfield injection development, and eventually, the injection volume of injection wells,[19] flowing bottom-hole pressure of production wells,[3] and production time[20] are selected as several indicators that have a direct impact on production changes, that is, the injection and production parameters are expressed as , and the dimensions of gas injection and water injection correspond to the number of gas injection and water injection wells in each well pattern, corresponding to qgi and qwi in eq , and their values correspond to the injection rate. The injection and production parameter matrix corresponding to water injection and gas injection arewhere, m, n, and z are the number of production, gas injection, and water injection wells, respectively; T and Qgi are the timing and speed of gas injection, respectively, i = 1,2,...,n; Q is the water injection rate, j = 1,2,...,z; and P is the flowing bottom-hole pressure of the production well, k = 1,2,...,m. Moreover, Qgi, Qwj, and P meet the following constraintsObviously, these injection and production parameters are easy to obtain and quantify. An injection and production working system have a great influence on the development effect of an oilfield block, and for an oilfield block, the controllable parameters are also mainly injection and production parameters. Therefore, the injection and production parameters, the important factors that affect the development effect of the gas drive oilfield, are analyzed first. The parameters that affect the development of gas drive oil fields mainly include the injection rate and flowing bottom-hole pressure. Table is an example of the injection and production parameters of an oilfield block in the Tarim oilfield, and the corresponding values mentioned above are expressed as ⟨(Qg1, Qg2,Qg3, Qg4, Qg5), ⌀, (P1, P2, P3, P4, P5, P6), T⟩.

Table 1

Example Table of Injection and Production Parameter Values

working system for 5 gas injection wells					working system for 6 producing wells						time
I1	I2	I3	I4	I5	P1	P2	P3	P4	P5	P6	T
0	0	0	0	0	27 000	11 000	22 000	25 000	18 000	22 000	30
0	0	0	0	0	27 000	11 000	22 000	25 000	18 000	22 000	60
0	0	57 779	0	0	27 000	11 000	22 000	25 000	18 000	22 000	90
0	0	57 779	0	0	27 000	11 000	22 000	25 000	18 000	22 000	120
0	0	57 779	46 438	0	27 000	11 000	22 000	25 000	18 000	22 000	150
0	0	57 779	46 438	0	27 000	11 000	22 000	25 000	18 000	22 000	180
0	0	57 779	46 438	0	27 000	11 000	22 000	25 000	18 000	22 000	210
58 359	0	57 779	46 438	0	27 000	11 000	22 000	25 000	18 000	22 000	240
58 359	0	57 779	46 438	53 934	27 000	11 000	22 000	25 000	18 000	22 000	270
58 359	80 333	57 779	46 438	53 934	27 000	11 000	22 000	25 000	18 000	22 000	300
58 359	80 333	57 779	46 438	53 934	27 000	11 000	22 000	25 000	18 000	22 000	330
58 359	80 333	57 779	46 438	53 934	27 000	11 000	22 000	25 000	18 000	22 000	360

Table is the working system data established when five gas injection wells and six production wells are used to simulate the development, according to the actual situation of the well pattern of an oilfield block in the Tarim oilfield, on the premise of ensuring compliance with the current production status and taking into account the actual demand. A total of 11 wells are used in the model, among which I1, I2, I3, I4, and I5 are five gas injection wells, and the rest P1, P2, P3, P4, P5, and P6 are six production wells. The values of each parameter are shown in Table . Among them, the working system parameters of the gas injection well are partly derived from the gas injection rate and those of the production well is derived from the flowing bottom-hole pressure. For example, the injection volume of I1 is 58 359 m3/day, and the well would be opened from the 8th month. The injection volume of I2 is 80 333 m3/day. From the 10th month, the well was opened and maintained upto the 12th month, the flowing bottom-hole pressure of P1 is 27 000 kPa, and that of P2 is 11 000 kPa. To generate sufficient and diverse data as well as keep the difference between different working systems in the process of generating the database, that is, to make the parameter values of each well under each system evenly distributed, Latin hypercube sampling[21] is selected in this paper. Compared with the pure stratified sampling method, its biggest advantage is that any number of samples can be easily produced. The deep learning model of this paper is to study a rule of oil production change during a certain period, and then use the trained model to predict the production of the new injection and production strategy because the production dynamics slowly changes as the injection development progresses. Therefore, if the development time is short, the oil production will not change obviously, and the knowledge learned by the model will be very limited. A well can be productive for more than 10 years, 20–30 years, or even longer; this paper chooses to use 10 years of development data. It is obtained by recording the cumulative oil production every other month. The data in Table only represents one kind of working system. The Latin hypercube sampling method described above should be carried out to generate data samples to obtain hundreds of working systems, and then the numerical simulation is used to obtain the final samples. Using composition simulation to simulate the above data and analyze the monthly production of the reservoir is an important task. Similarly, the composition simulation is also of great value to the method presented in this paper. When the well patterns with no historical data are predicted, a small number of samples are generated using composition simulation for transfer. But, the reservoir composition simulation also has its limitations, mainly embodied in the simulation of a working system will need 10 min. Since the time of composition simulation is related to whether the development work can be carried out efficiently and directly affects the final injection and production strategy, it is of great significance to analyze the existing problems, study how to predict oil production efficiently in an oilfield, and form an effective production prediction method. In this paper, the production prediction deep learning model takes the oil production prediction as the research object, and the influencing factors are the injection and production parameters. The constructed model is based on the stacked LSTM network and transfer learning, and it mainly includes three parts: Based on the initial historical data or the data generated by the composition simulation, a reservoir oil production prediction model that meets the current accuracy requirements is formed. The actual output value of the component numerical simulation is compared with the predicted value obtained by the constructed prediction model, and the accuracy of the latter is calculated. If the accuracy shows that the accuracy of the prediction model is sufficient to reflect the true dynamics of the current reservoir, the model structure and parameters can be transferred to the new well pattern in the same oilfield block, and the new well pattern data can be used to train the model. Affected by the working system, the investment and recovery factor of the oil reservoir will be different, and the NPV will also be different. To draw the most economic benefit of the development plan, it is necessary to compare the NPV of different development plans. Through the constructed deep learning model, we can achieve rapid production prediction, so as to select the most optimal working system. The prediction model can be used to simulate oil reservoir production, so the model can provide guidance for production.

Dimension Alignment of Heterogeneous Samples Based on the Domain Knowledge

The dimensions of injection and production parameters of the samples in the actual new well pattern may not be strictly consistent with those of the training samples. To solve the problem of inconsistency, dimension alignment is introduced, and then the stacked LSTM network is used to predict the aligned data to complete the transfer learning. The alignment idea is as follows: the source well pattern sample has Gs gas injection wells and Ws water injection wells; a new well pattern sample has Gt gas injection wells and Wt water injection wells. If the same prediction network model is used, the above samples need to be aligned as Gr gas injection wells and Wr water injection wells. The injection volume of the expanded injection well is zero. With the dimension alignment, the prediction network can adapt to the mapping of different dimension parameters, so as to prepare for transfer learning. The formal description of the process is expressed by eqs and 8among them, the source well pattern and the target well pattern are located in the same oilfield block of a certain oilfield. This paper verifies through experiment, and it shows that this method has no effect on the prediction accuracy. For example, the source well pattern that has two gas injection wells and three water injection wells and the target well pattern that has one gas injection well and six water injection wells are the research objects. As mentioned above, the source structure is expressed as ⟨(Qg1, Qg2), (Qw1, Qw2, Qw3), ⌀, T⟩, and the target structure is expressed as ⟨(Qg1), (Qw1, Qw2, Qw3, Qw4, Qw5, Qw6), ⌀, T⟩. The number of gas injection wells and water injection wells of the two structures is obviously different. The values of the gas injection dimension are 2 and 1, respectively, and the values of the water injection dimension are 3 and 6, respectively. The data of the two well patterns need to be manually aligned to perform dimensional alignment based on domain knowledge and unify the dimensions of the data. The consequences of using the above formula to align the data of the two structures are that they are both aligned as the data of two gas injection and six water injection wells, i.e., ⟨(Qg1, Qg2), (Qw1, Qw2, Qw3, Qw4, Qw5, Qw6), ⌀, T⟩. For the source well pattern, the injection volume of the expanded water injection wells IW4, IW5, and IW6 is zero, and for the target well pattern, the injection volume of the expanded gas injection well IG2 is also zero.

Deep Learning Model Based on the Stacked LSTM Network

Through a comprehensive analysis of the characteristics of the injection and production data in this paper, we found that it is difficult to characterize the effects of various parameters on the oil production in the reservoir with traditional models. In addition, the data is time dependent, and it is necessary to analyze the effects of last month’s production on the next month’s production. In the model training stage, the traditional neural network cannot use the information obtained in the previous time step in the current step, which is the main disadvantage of the traditional neural network. However, the RNN tries to pass the information in one step of the network to the next step by using recursions, so that the timing information can be retained, so as to solve this problem. Therefore, this model has advantages in processing time-series data. However, the traditional RNN can only remember short-term information, and it does not have the ability to remember information over a long distance. The LSTM network has solved the problem of gradient disappearance and explosion of the RNN, it can deep mine the information contained in the historical data, at the same time, take into account the scheduling of the data, which meets the requirements of the constructed model. In addition, simple neural networks cannot transfer from one well pattern to another without sufficient data. Thus, the models based on the LSTM network have advantages. To avoid the limitation of the traditional neural network, this paper chooses to construct a deep learning model based on the stacked LSTM network. There are three types of gates in the LSTM unit: the forget gate (f), the input gate (i), and the output gate (o).[22] The LSTM unit uses the input X at the current time, the hidden state H, and the cell state C at the previous time as the input of the LSTM unit at the current time,[23] and the output H and C are used as the input at the next time. The specific details of the LSTM unit at time t are shown in Figure . The specific calculation of the LSTM unit is as followsAmong them, C̃ represents the candidate vector; σ represents the Sigmoid activation function, and the output is between 0 and 1; ° represents the Hadamard product; Wf, Wi, and Wo represent the gate weight matrix; bf, bc, bi, and bo represent the bias; and it, ft, and ot control the information flow across the memory state C.

Figure 1

Schematic diagram of the LSTM unit structure.

Schematic diagram of the LSTM unit structure. A multilayer LSTM network can remove some of the constraints of a single-layer LSTM network, which can be viewed as the output of the upper layer and as the input of the next layer. This kind of network uses stacking to stack the layers of the LSTM network.[24] A hierarchical feature representation of the input data can be created through this structure, which can subsequently be used for prediction. The stacked LSTM network belongs to deep learning,[25] which is a representative optimization to improve the efficiency of training and obtain higher accuracy by adding the depth of the network. In many fields, such as human motion recognition,[26] fault diagnosis,[27] prediction research,[28] the detection algorithm,[29] and so on, stacked networks are more effective than single-layer networks. The success of a deep neural network (DNN) in predicting classification problems is usually attributed to the depth of the network. Given that the LSTM network operates on sequence data, this means that increasing the layers of the network increases the time to extract the features of the input signal. At present, the stacked LSTM network is a good model for sequence prediction problems. A stacked LSTM network architecture can be defined as an LSTM network model consisting of multiple LSTM layers. The LSTM layer above provides a sequential output, rather than a single value being the output to the LSTM layer below. Figure shows the model of the stacked LSTM network in this paper.

Figure 2

Stacked LSTM network model.

Stacked LSTM network model. To better extract features, the LSTM units shown in Figure are stacked to build a stacked LSTM network with stronger representation ability. The stacked LSTM network in this paper is composed of four LSTM layers. Specifically, the LSTM layer is used to extract the features first, and then the output feature vector is sent to the FC layer for prediction, so as to obtain the predicted value of the oil production.

Transfer Learning Based on the Pretraining Model

Oil Production Prediction Algorithm Based on Transfer Learning

In the same oilfield block, the development strategy can be optimized for a more economical and efficient development by adjusting the well pattern. In the transfer learning theory, the learning domain with enough labeled samples is called the source domain,[30] and the target domain usually has limited data. Given a specific data set Ds of the source well pattern and data set D of the target well pattern, suppose that the labeled data sets are Ds = {Xs, Ys} and D = {X, Y}. Among them, X represents the injection and production parameters, and Y represents the corresponding oil production. Therefore, the problem of transfer learning production prediction is to learn an accurate regression to predict the production of the corresponding oilfield represented by the data in the target domain. The algorithm is based on the constructed stacked LSTM network model, uses the well pattern with abundant available information as the source domain, and that of a small number of samples as the target domain. The prediction of transfer learning is carried out by the method of network parameter transfer. Compared with the previous prediction methods, this method only needs a small amount of labeled target domain data to achieve good prediction performance. The corresponding tasks are expressed in eqs and 16Among them, fs represents the function mapping from Xs to the predicted value of the source well pattern Ys, f represents the function mapping from X to the predicted value of the target well pattern Y, and δs and δ are the model parameters of the source well pattern and the target well pattern, respectively. Transfer learning is to find the relevant features in the source well pattern data {Xs,Ys} and obtain the mapping f. Then, after the mapping f is transferred to the target well pattern, the mapping f of the target task is learned from the target well pattern data {X, Y}. Specifically, it can be expressed in eqs and 18where the subscript i represents the Ith sample, and ns and nt are the sample numbers of source well pattern data and target well pattern data, respectively. L is the loss function, and the model training process is to minimize the loss function. x represents a training sample, and y refers to the label of the sample. The optimal model parameter δs′ can be obtained by fully training the source well pattern model. δ and δ are the model parameters of the source well pattern and the target well pattern, respectively. The purpose of transfer learning is to use the optimal model parameter δs′ when training the model in the target task and obtain the optimal model parameter δ′ after training in the target well pattern.

Stacked LSTM Network Training Combined with Transfer Learning

The stacked LSTM network uses a large number of sample data to train the parameters of the model. The trained model can be used for the sample prediction but it must meet two requirements. First, the source data set must be labeled sample data.[31] Second, the source data set and the target data set must have the same or similar distribution.[32] Therefore, it takes a lot of computing resources to simulate the sample library. In addition, the number of parameters that need to be trained for the stacked LSTM network is very large, and the training process consumes a lot of time and computing resources. Training these parameters requires a large number of sample data, which is very time-consuming in practical application. If the samples are insufficient, it is easy to lead to overfitting problems and the generalization of the trained model is poor. In view of the above problems, this paper adopts the parameter-based transfer learning strategy to improve the training method of the stacked LSTM network, so as to reduce the requirement of the amount of sample data. The stacked LSTM network training method combined with transfer learning refers to retraining a deep learning model again with the data of the current well pattern, rather than training from the beginning, so as to obtain the model suitable for the current well pattern. Due to the feature extraction part of the stacked LSTM network in different well patterns being interlinked, therefore when using transfer learning, the structure and parameters of the network layer can be retained, and the weights of the model can be initialized and set as a trainable state, and then use the data of the current well pattern to retrain the parameters. Using transfer learning, only a small amount of data is needed to train a model suitable for the current well pattern, which effectively solves the problem of a lack of enough labeled data and the need for a large number of computing resources to train a new model. In this paper, transfer learning and the LSTM network are combined to build a deep learning model for the oilfield prediction of the same type. The purpose of neural network training is to minimize the loss function L through backpropagation and gradient descent. In this paper, mean-square-error (MSE) is used as the loss function of the LSTM networkwhere L is the network loss, y is the expected output of the network, f(x) is the actual output of the network, and n is the number of samples. L gets closer and closer to the minimum by updating the weights and bias parameters in the direction of gradient descent. Each layer of the stacked LSTM network has its own weight and bias parameters. The weight and bias parameters are obtained by training the source well pattern model, and then the determined parameter δs′ is directly transferred to the target well pattern by transfer learning. In this paper, the trained parameters of the source well pattern model are used as the initial values of the parameters of the target well pattern model. The LSTM layer in the stacked LSTM network is used as a feature extractor to extract the features of the input time-series data, while the information each layer of the neural network extracted is not the same, but the information is universal, so the parameters of each layer are transferred. The target well pattern data is used for training for all layer parameters so that the accuracy and time cost of the prediction results of the target well pattern are optimal. The entire process of applying transfer learning is shown in Figure , and the specific process is as follows:

Figure 3

Flow chart of production prediction based on transfer learning.

First, align the dimension of the data of the source well pattern, construct the model of the source well pattern, train and debug it, find the best prediction model of the source well pattern, and save the model and parameters of the source well pattern for the target well pattern; Second, load the saved source well pattern model, transfer the weight parameters of the source well pattern model as the initial value of the parameters of the target well pattern, transfer all layer parameters, align the dimension of the data of the target well pattern and use the data of the target well pattern to train all layers of the network, and get the prediction model of the target well pattern. Flow chart of production prediction based on transfer learning.

Results and Discussion

The reservoir data used in this paper is from the Tarim oilfield, and the Donghe sandstone member in Tarim Basin is taken as the research object. The Donghe sandstone member is a typical marine clastic reservoir rich in oil and gas in China.[33] Limited by the “three highs” of high temperature, high salinity, and high pressure, appropriate injection and production systems need to be formulated during the development. After tracking and analyzing the oil and gas production in the process of gas injection and oil displacement, it is found that the oil and gas production of gas injection and oil displacement technology is directly related to the injection volume of gas injection wells, water injection wells, and the flowing bottom-hole pressure of production wells. Therefore, the sample database constructed in this paper mainly includes the working systems of different wells, including thousands of different working systems. To ensure the effectiveness of the work, Latin hypercube sampling is used to design an injection and production parameter suitable for marine clastic reservoirs. Taking an oilfield in Tarim Basin as an example, there are altogether 37 oil wells in the oilfield, including 21 production wells and 16 injection wells. According to the actual exploitation needs, the four water injection wells, INJ1, INJ2, INJ3, and INJ4, can be adjusted. For example, we can choose 1, 2, 3, or 4 wells for gas injection, and the rest for water injection, so there are C41 + C42 + C43 + C44 = 15 different injection and production well patterns, as shown in Table . If the model training is carried out from the beginning for each well pattern, a lot of time cost will be lost, and the model trained in a variety of well patterns can also have very good effects in any kind of well pattern. It extracts general information from various well patterns with a large amount of available data and then transfers it to a certain well pattern with different data. The prediction can be made by training the model with a small number of samples.

Table 2

Corresponding Table of the Gas Injection Well and Well Pattern

well pattern	gas injection well number
1	INJ1
2	INJ2
3	INJ3
4	INJ4
5	INJ1, INJ2
6	INJ1, INJ3
7	INJ1, INJ4
8	INJ2, INJ3
9	INJ2, INJ4
10	INJ3, INJ4
11	INJ1, INJ2, INJ3
12	INJ1, INJ2, INJ4
13	INJ1, INJ3, INJ4
14	INJ2, INJ3, INJ4
15	INJ1, INJ2, INJ3, INJ4

Using modern reservoir numerical simulation technology to run the sample library, the oil production and gas production of the reservoir in different well patterns and development periods are predicted, and the above data is recorded and organized to obtain a rich data set with 300 systems for each well pattern. Since the recorded data is daily data, data cleaning is required. The final data provided to the model consisted of a monthly record of 10 years of data for different working systems, that is, gas injection volume, water injection volume, flowing bottom-hole pressure, time, and cumulative oil production of the reservoir. The time interval is 1 month, where the first 80% is used as a training set and the last 20% is used as a test set. Taking into account the complexity of sample generation, taking 50 working systems as an example, the final sample contains a total of 6000 lines of data, so it is estimated that hundreds of systems need to write nearly tens of thousands of lines of data. Such a huge workload requires high-quality completion in a short time. The traditional line-by-line manual design method cannot meet the requirements of the project, and in the repetitive and complicated design, it is prone to make mistakes and errors. Therefore, the python high-level programming language is used to design the program and generate the user interface to generate the injection and production parameters. The user inputs the number of gas injection wells, the number of production wells, the number of production months, and the number of working systems, and the program automatically designs and writes the data. Instead of a manual design, it can greatly reduce the difficulty and workload of the sample generation.

Results before and after the Dimension Expansion of a Certain Well Pattern

Considering the influence of data preprocessing, the accuracy of yield prediction cannot be affected in the dimension alignment stage, so its influence should be considered before selecting the data alignment scheme. At the same time, because the alignment methods of different well patterns are also different, considering that evaluating the suitability of the method as accurately as possible before the transfer, this paper selects the injection and production data provided by the well pattern 5 to extend its dimensions. Based on a dimension alignment strategy, and considering the real environment, the two gas injection wells INJ3 and INJ4 and one water injection well were extended, and their impacts were studied as follows: Figure compares the predicted oil production before and after the alignment with the corresponding simulated data; see Table for details. Before and after the expansion of the data dimension, the production prediction accuracy rates of the well pattern are 95.3 and 94.7%, respectively, which are both within the acceptable range. After the alignment, the accuracy decrease is minimal, and it does not affect the final prediction results.

Figure 4

Production fitting before and after the dimension alignment.

Table 3

Comparison of the Accuracy of Prediction Results before and after the Dimension Expansion of a Certain Well Pattern

		before the alignment		after the alignment
predict time	simulated value	predicted value	accuracy (%)	predicted value	accuracy (%)
Month_1	134 759	137 972	97.62	132 759	97.16
Month_2	143 738	146 668	97.96	141 738	97.02
Month_3	152 560	155 284	98.21	150 560	96.96
Month_4	161 239	163 820	98.40	159 239	96.95
Month_5	169 784	172 276	98.53	167 784	96.99
Month_6	178 202	180 651	98.63	176 202	97.05
Month_7	186 505	188 946	98.69	184 505	97.14
Month_8	194 690	197 159	98.73	192 690	97.25
Month_9	202 761	205 292	98.75	200 761	97.37
Month_10	210 723	213 344	98.76	208 723	97.50
Month_11	218 585	221 314	98.75	216 585	97.63
Month_12	226 351	229 205	98.74	224 351	97.77
average			98.48		97.23

Production fitting before and after the dimension alignment.

Source Well Pattern Model and Prediction Results

The establishment process of the prediction model is as follows. The first part is data preprocessing, considering the well location distribution and incorporating well location coordinates into the features. Performing the correlation analysis on the injection and production data and production of each well, we then obtain the importance of the features of all injection and production wells and use the injection and production data of different injection and production wells multiplied by the corresponding weight as the current input values, and then the data is normalized and reshaped. Consequently, the data set is transformed into the input form of the LSTM network model (batch size, time step, input size). In the second part, the prediction model of the source well pattern is generated and debugged, and the structure of the model and the trained weights are saved to facilitate the subsequent transfer learning in the target well pattern. The simulation is implemented using Python 3.7 in the Ubuntu system, using the Keras deep learning modeling environment in Python, which supports existing common structures, such as RNN, fully connected neural network, etc. The LSTM, dense, and dropout layers can be imported from Keras, and these layers can be put into the sequence model to build the required model in any reasonable way. In this paper, four LSTM layers and one dense layer are selected. The number of hidden units in the four layers of the LSTM network is (50,50,50,50), and Adam and MSE are selected as the optimizer and loss function, respectively. The abovementioned source well pattern prediction model is used to train the data. ModelCheckPoint is used to store the optimal model, and callbacks are used to control the training model. Figure shows the fitting diagram after 10 years of production, and there is little difference between the predicted and simulated production. When the production cycle reaches 7 years, the prediction error is the largest, and after 7 years, the prediction error decreases. In contrast, the prediction accuracy in the early stage is much higher than that in the middle and later stages.

Figure 5

Oil production predicted in 10 years.

Oil production predicted in 10 years. We analyzed and compared three models: a stacked LSTM model, single-layer LSTM model without stacking, and FCNN model. As shown in Figure , it can be seen that even if it is predicted for 10 years, the prediction results of the stacked LSTM network prediction model are close to the actual ones. To make the experiment scientific and accurate, compare it with a single-layer LSTM network and fully connected neural network (FCNN), transform the LSTM layer to the dense layer, and retrain the network after adjusting the structure. It can be seen from Table that the prediction results of the stacked LSTM network model are more accurate than those of the single-layer LSTM network and FCNN. The results on the test set show that comparing and analyzing all data, the accuracy of the stacked LSTM network model is 6.2 and 4.7% higher than those of the two models, respectively.

Figure 6

Fitting of oil production under three conditions.

Table 4

Comparison of Accuracy of Prediction Results of Three Models

		stacked LSTM		single-layer LSTM		FCNN
predict time	simulated value	predicted	accuracy (%)	predicted	accuracy (%)	predicted	accuracy (%)
Month_1	134 759	137 972	97.62	123 000	91.27	160 689	80.76
Month_2	143 738	146 668	97.96	131 390	91.41	166 630	84.07
Month_3	152 560	155 284	98.21	139 731	91.59	172 571	86.88
Month_4	161 239	163 820	98.40	148 022	91.80	178 512	89.29
Month_5	169 784	172 276	98.53	156 263	92.04	184 453	91.36
Month_6	178 202	180 651	98.63	164 454	92.29	190 394	93.16
Month_7	186 505	188 946	98.69	172 594	92.54	196 335	94.73
Month_8	194 690	197 159	98.73	180 683	92.81	202 276	96.10
Month_9	202 761	205 292	98.75	188 722	93.08	208 218	97.31
Month_10	210 723	213 344	98.76	196 709	93.35	214 159	98.37
Month_11	218 585	221 314	98.75	204 646	93.62	220 100	99.31
Month_12	226 351	229 205	98.74	212 531	93.89	226 041	99.86
average			98.48		92.47		92.60

Fitting of oil production under three conditions.

Prediction Effect of Different Well Patterns When Using Transfer Learning

To verify the transfer learning ability of the designed predictive model, the conditions of different source well patterns and the same target well pattern are considered. Among them, the source well pattern uses several different data such as 1, 14, and 15 in Table , and the target well pattern uses the data of the well pattern 8 for model training. The data of the target well pattern and the source well pattern are processed by the dimension alignment. Transfer learning is using the pretraining model to train the data of different working systems of the target well pattern 8, 2 gas injection, and 14 water injection, and the following results are obtained. To study the influence of the pretraining model and the number of samples on the prediction accuracy, different values are used for analysis, Table compares the prediction accuracy rate of oil production using 5, 10, 15, and 20 working systems and different pretraining models during transfer. The numbers in brackets correspond to the well patterns in Table , representing the sources of the pretraining samples, and 100, 200, and 300 represent the number of working systems of the training source models. It can be seen in Table that the prediction results obtained when using 400 kinds of systems and four kinds of well patterns data is the best. As shown in Figure a–d, the results obtained when using four kinds of well patterns with 200 kinds of systems, data is not as good as when using two kinds of structures with 200 kinds of systems. The reason is that the data of the four well patterns is averaged, and there are only 50 systems for each type, the number decreases and the types increase, and the regularity becomes worse, so the prediction precision is lower. Compared with the 100 systems using two types of samples (1 and 14), the accuracy rate of the 5, 10, 15, and 20 systems in the 200 systems using four samples has increased by 3.0, 3.1, 3.5, and 3.2%, respectively. Similarly, the accuracy rate of the 400 systems with four kinds of structures was higher than that of the 200 systems with two kinds of structures. The results show that when the number of samples for each well pattern is the same, the more abundant the source type, the higher the accuracy of the prediction results after transfer.

Table 5

Comparison of the Prediction Results between the Different Number of Samples and Different Pretraining Models

			number of working systems
source of pretraining samples	number	training time (s)	5	10	15	20
1 well pattern (15)	100	240	79.7%	80.3%	80.6%	81.3%
	200	476	84.4%	85.4%	86.2%	87.0%
	300	711	88.2%	88.8%	89.2%	89.5%
2 well patterns (1 and 14)	100	240	79.7%	80.1%	80.4%	81.1%
	200	477	88.8%	90.0%	90.3%	90.7%
	300	709	89.1%	90.1%	90.5%	91.0%
2 well patterns (1 and 5)	100	238	80.0%	80.5%	80.9%	81.3%
	200	554	88.3%	88.6%	89.3%	89.7%
	300	709	89.0%	89.3%	89.9%	90.9%
4 well patterns (3, 5, 12, and 14)	100	246	82.4%	82.8%	83.3%	83.6%
	200	476	82.7%	83.2%	83.9%	84.3%
	300	714	88.5%	89.5%	90.4%	91.0%
	400	951	90.1%	90.4%	90.7%	91.6%

Figure 7

Influence of the source of pretraining samples on the prediction accuracy (a) 15; (b) 1 and 14; (c) 1 and 5; and (d) 3, 5, 12, and 14.

Influence of the source of pretraining samples on the prediction accuracy (a) 15; (b) 1 and 14; (c) 1 and 5; and (d) 3, 5, 12, and 14. Table is the comparison of the results of prediction when the source model is trained using 300 working systems from two kinds of samples, and the target well pattern is selected with different well patterns. It is the same as the well pattern 8, several different cases are compared. From Table , it can be intuitively seen that the prediction effect of using transfer learning is better when there are more training samples. Table shows the comparison of the prediction results. It can be seen from the prediction results in Table that the accuracy of the prediction results in the target well pattern 2 are 89.2, 89.6, 90.1, and 90.7%, respectively, for the 5, 10, 15, and 20 systems. Compared with other target well patterns, the biggest difference in the four cases is only 0.3% when 10 systems are used. The prediction results of transfer learning are not much different for different target well patterns, indicating that the generalization ability of the used model is good, and it is suitable for the prediction of different well patterns.

Table 6

Comparison of Prediction Results of Transferring to Different Well Patterns

	number of working systems
target well pattern	5	10	15	20
2	89.2%	89.6%	90.1%	90.7%
3	89.4%	89.8%	90.2%	90.6%
5	89.0%	89.5%	90.2%	90.6%
12	89.4%	89.9%	90.3%	90.7%
15	89.2%	89.6%	90.3%	90.8%

It can be seen from the comparison of the above results that the deep learning model based on the stacked LSTM network proposed in this paper has better prediction performance. For the construction of other well pattern prediction models, the prediction precision and the model of training time are comprehensively compared, and the proposed transfer learning saves the model training time and improves the prediction precision, and also receives the recognition from the fields.

Conclusions

In this paper, a prediction model based on the stacked LSTM network is proposed to improve the predictive effect of oil production. The actual experimental results show that this method has good practicability and accuracy for oil well production prediction and can be used for oil production prediction of oil wells in oil fields. At the same time, this deep learning model is applied to other well patterns through transfer learning to solve the prediction problem of well patterns with a small number of samples, which can provide a reference for the optimization of the working system, and play an auxiliary role in decision making. Therefore, this algorithm has a good application prospect and certain research value. The conclusions of this paper can be mainly summarized as follows: A deep learning framework for the production prediction is proposed. Taking into account the sequential characteristics of the data, a model is constructed through the stacked LSTM network. Compared with composition simulation software, the deep learning model can achieve fast prediction without a significant decrease in accuracy; Through transfer learning, the trained deep learning model is applied to other well patterns. Considering that the training of the neural network needs a large amount of data and the available data of the new well patterns are less, the trained parameters of the network of the source well pattern model is chosen to be used as the initial value of the target well pattern, and the pretraining model is used to train the network with the data of the target well pattern. The result of the case analysis shows that this method saves the training time of the model and achieves fast prediction.

3 in total

1 in total

1. Dense Residual LSTM-Attention Network for Boiler Steam Temperature Prediction with Uncertainty Analysis.

Authors: Zheming Tong; Xin Chen; Shuiguang Tong; Qi Yang
Journal: ACS Omega Date: 2022-03-22