| Literature DB >> 35664599 |
Ramez Abdalla1, Hanin Samara1, Nelson Perozo1, Carlos Paz Carvajal1, Philip Jaeger1.
Abstract
Electrical submersible pumps (ESPs) are considered the second-most widely used artificial lift method in the petroleum industry. As with any pumping artificial lift method, ESPs exhibit failures. The maintenance of ESPs expends a lot of resources, and manpower and is usually triggered and accompanied by the reactive process monitoring of multivariate sensor data. This paper presents a methodology to deploy the principal component analysis and extreme gradient boosting trees (XGBoosting) in predictive maintenance in order to analyze real-time sensor data to predict failures in ESPs. The system contributes to an efficiency increase by reducing the time required to dismantle the pumping system, inspect it, and perform failure analysis. This objective is achieved by applying the principal component analysis as an unsupervised technique; then, its output is pipelined with an XGBoosting model for further prediction of the system status. In comparison to traditional approaches that have been utilized for the diagnosis of ESPs, the proposed model is able to identify deeper functional relationships and longer-term trends inferred from historical data. The novel workflow with the predictive model can provide signals 7 days before the actual failure event, with an F1-score more than 0.71 on the test set. Increasing production efficiencies through the proactive identification of failure events and the avoidance of deferment losses can be accomplished by means of the real-time alarming system presented in this work.Entities:
Year: 2022 PMID: 35664599 PMCID: PMC9161246 DOI: 10.1021/acsomega.1c05881
Source DB: PubMed Journal: ACS Omega ISSN: 2470-1343
Summary of the Most Relevant Studies Related with This Paper
| author, year | relevant work |
|---|---|
| (Zhao et al., 2006); (Li et al., 2008); (Zhang, 2017) | ESP fault tree diagnosis
through a proposed qualitative and
quantitative method.[ |
| (Xi, 2008) | the use of a traditional mechanical fault diagnosis and wavelet analysis |
| realization of excessive shaft thrust
and wear fault characteristic
extraction to investigate the fault diagnostics of the centrifugal
pump[ | |
| (Wang, 2004) | use of Neuro-Fuzzy Petri nets and extracted features
for the
identification of eccentric wear of both the impeller and bearing
as well as the sand plug of the impeller[ |
| (Zhao, 2011) | ESP vibration
signal analysis, feature extraction, and establishment
of typical fault vibration mechanical models[ |
| (Tao, 2011) | data analysis
and application of vibration signals based on
wavelet analysis and wavelet transform in the ESP.[ |
| (Guo et al., 2015) | utilization of the support vector method in the prediction
of anomalous operation[ |
| (Wang, 2013) (Peng, 2016) | utilization of back
propagation (BP) neural networks for ESP
diagnosis.[ |
| (Jansen Van Rensburg, 2019) | exploration of surveillance-by-exception
on ESP using a train
model with normal yet good quality data[ |
| (Andrade Marin et al., 2019) | analysis of random forest to obtain a high value of accuracy
and recall of ESP failure prediction in 165 cases[ |
| (Adesanwo et al., 2016); (Adesanwo et al., 2017); (Gupta et al., 2016); (Abdelaziz et al., 2017); (Bhardwaj et al., 2019); (Sherif et al., 2019); (Peng et al., 2021); (Zhang et al., 2017); (Yang et al., 2021) | application of principal component analysis
(PCA) for anomaly
detection and failure prediction for the identification of correlations
in the dynamic ESP parameters such as intake pressure and temperature,
discharge pressure, vibrations, motor and system current and frequency
measured by means of a variable speed drive (VSD) at regular time
intervals[ |
Figure 1Knowledge discovery in database workflow.
Figure 2Data box plot before outlier detection.
Figure 3Box plot after outlier removal.
Figure 4Box plot after outlier removal, normalization, and use of moving difference signals.
Data Exploration
| FRQ (Hz) | PDP (Psi) | PIP (Psi) | WHP (Psi) | WHT (F) | MT (F) | CHP (Psi) | CURRENT (A) | |
|---|---|---|---|---|---|---|---|---|
| mean | 52.68 | 1522.96 | 855.89 | 642.20 | 62.38 | 137.11 | 922.13 | 349.65 |
| std | 4.29 | 187.46 | 211.08 | 608.61 | 11.49 | 32.39 | 1883.03 | 86.90 |
| min | 35.00 | 1086.51 | 610.45 | 194.06 | 60.54 | 120.62 | 0 | 101.59 |
| 25% | 49.70 | 1506.13 | 660.90 | 213.38 | 63.23 | 122.20 | 91.54 | 317.00 |
| 50% | 52.76 | 1543.80 | 721.80 | 243.58 | 65.99 | 154.19 | 261.38 | 354.00 |
| 75% | 56.58 | 1595.04 | 778.89 | 547.33 | 67.65 | 160.60 | 405.28 | 394.73 |
| max | 64.96 | 1893.23 | 1578.28 | 845.86 | 122.84 | 169.30 | 986.74 | 598.21 |
Data Points Classification
| condition | reported data points |
|---|---|
| normal | 339 089 |
| preworkover | 1728 |
| workover | 288 |
Figure 5Geometric meaning of PCA.
Figure 6Principal component analysis of ESP wells.
Loading for Input Parameters
| PC1 | PC2 | PC3 | PC4 | PC5 | PC6 | PC7 | PC8 | |
|---|---|---|---|---|---|---|---|---|
| FRQ | –0.90 | –0.05 | 0.06 | 0.05 | –0.05 | –0.26 | 0.05 | 0.01 |
| PDP | –0.54 | –0.14 | –0.70 | 0.32 | –0.07 | –0.08 | 0.04 | 0.05 |
| PIP | 0.03 | –0.17 | –0.47 | 0.14 | –0.05 | 0.81 | –0.18 | –0.04 |
| WHP | 0.10 | –0.02 | –0.79 | 0.37 | –0.01 | –0.15 | 0.23 | –0.28 |
| WHT | –0.63 | 0.00 | 0.35 | –0.30 | 0.18 | 0.22 | 0.12 | –0.32 |
| MT | –0.79 | –0.12 | –0.07 | 0.04 | –0.12 | 0.25 | –0.16 | 0.22 |
| CHP | –0.85 | –0.15 | –0.07 | 0.15 | –0.03 | –0.27 | 0.07 | 0.12 |
| CURRENT | –0.84 | –0.10 | 0.21 | –0.17 | 0.07 | 0.21 | –0.06 | –0.19 |
| diff_FRQ | –0.14 | 0.78 | 0.03 | 0.27 | 0.09 | 0.02 | –0.22 | –0.17 |
| diff_PDP | –0.05 | 0.09 | –0.45 | –0.54 | 0.21 | –0.19 | –0.40 | 0.28 |
| diff_PIP | 0.06 | –0.35 | –0.34 | –0.67 | –0.21 | 0.06 | 0.18 | 0.11 |
| diff_WHP | –0.03 | 0.11 | –0.42 | –0.50 | 0.36 | –0.15 | –0.05 | –0.42 |
| diff_WHT | –0.10 | 0.43 | –0.08 | –0.11 | 0.37 | 0.23 | 0.69 | 0.26 |
| diff_MT | –0.15 | 0.84 | –0.10 | –0.10 | –0.34 | 0.03 | –0.03 | –0.03 |
| diff_CHP | 0.03 | –0.29 | 0.03 | 0.30 | 0.85 | 0.01 | –0.14 | 0.11 |
| diff_CURRENT | –0.13 | 0.80 | –0.14 | –0.04 | 0.16 | 0.05 | –0.09 | 0.20 |
Figure 7Explained variance of the proposed model.
Hyperparameter Tuning
| parameter | reference to | sampling type | range |
|---|---|---|---|
| max_depth | control of overfitting, higher depth facilitates such that the model learns relations that are specific to a particular sample | suggest integer value | 2, 10 |
| min_child_weight | a minimum sum of weights is defined for all observations required in a child | log uniform | 1e–10, 1e10 |
| colsample_bytree | the subsample ratio of columns when constructing each tree | uniform | 0, 1 |
| learning_rate | overfitting prevention through step size shrinkage in updates | uniform | 0, 0.1 |
| gamma | specification of the minimum loss reduction required to make a split | suggest integer value | 0, 5 |
Figure 8ROC for the proposed model.
Figure 9Precision recall curve.
Precision, Recall, and F1-Score
| precision | recall | F1-score | support | |
|---|---|---|---|---|
| normal | 0.99 | 1.00 | 1.00 | 101 726 |
| 7 days or less pre-event | 0.80 | 0.60 | 0.71 | 604 |