Literature DB >> 25938760

A Fault Diagnosis Methodology for Gear Pump Based on EEMD and Bayesian Network.

Zengkai Liu1, Yonghong Liu1, Hongkai Shan2, Baoping Cai1, Qing Huang3.   

Abstract

This paper proposes a fault diagnosis methodology for a gear pump based on the ensemble empirical mode decomposition (EEMD) method and the Bayesian network. Essentially, the presented scheme is a multi-source information fusion based methodology. Compared with the conventional fault diagnosis with only EEMD, the proposed method is able to take advantage of all useful information besides sensor signals. The presented diagnostic Bayesian network consists of a fault layer, a fault feature layer and a multi-source information layer. Vibration signals from sensor measurement are decomposed by the EEMD method and the energy of intrinsic mode functions (IMFs) are calculated as fault features. These features are added into the fault feature layer in the Bayesian network. The other sources of useful information are added to the information layer. The generalized three-layer Bayesian network can be developed by fully incorporating faults and fault symptoms as well as other useful information such as naked eye inspection and maintenance records. Therefore, diagnostic accuracy and capacity can be improved. The proposed methodology is applied to the fault diagnosis of a gear pump and the structure and parameters of the Bayesian network is established. Compared with artificial neural network and support vector machine classification algorithms, the proposed model has the best diagnostic performance when sensor data is used only. A case study has demonstrated that some information from human observation or system repair records is very helpful to the fault diagnosis. It is effective and efficient in diagnosing faults based on uncertain, incomplete information.

Entities:  

Mesh:

Year:  2015        PMID: 25938760      PMCID: PMC4418566          DOI: 10.1371/journal.pone.0125703

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

As the key element of hydraulic system, a pump is responsible for the mechanical to hydraulic energy conversion process. Gear pumps are widely applied in modern industry owing to its advantages such as small and compact design, high efficiency and low manufacturing cost. The working status of a gear pump greatly affects the performance of the whole hydraulic system, thus it is necessary to develop its fault diagnosis technique. The condition of mechanical equipment is closely associated with vibration signals, which come about during rotating process for rotating machinery [1]. Therefore, fault diagnosis of gear pumps can be performed with the characteristic information extracted by the signal processing techniques such as short time Fourier transform [2], wavelet transform [3], blind source separation [4], sparse decomposition method [5] and empirical mode decomposition (EMD) [6]. EMD is a time-frequency signal processing technique. Compared with other signal processing methods, EMD is self-adaptive and especially suits for the non-stationary and non-linear signals. For sparse decomposition method, the algorithm is much more demanding and complex compared to EMD [7]. Based on the local characteristic time scales of a signal, the original vibration signal can be decomposed into several intrinsic mode functions (IMFs). Due to the adaptive analysis and high robustness nature, EMD has been widely applied in fault diagnosis of rotating machinery [8, 9]. However, EMD suffers from the mode mixing problem, which means a single IMF either consisting of widely disparate scales, or a signal of a similar scale residing in different IMF components [10]. In order to alleviate the problem of mode mixing in EMD, Wu and Huang [11] propose ensemble empirical mode decomposition (EEMD) method. Essentially, white noise of finite amplitude is added to the original signal during the EEMD decomposition process. The ensemble means of the corresponding IMFs generated from each trial are defined as the true IMFs of the EEMD [12]. Lei et al [13] employ EEMD in diagnosing rub-impact faults in a power generator and a heavy oil catalytic cracking machine set. Compared with EMD method, it is demonstrated that EEMD has superiority in fault diagnosis of rotating machinery. Caesarendra et al. [14] apply EEMD method in two real cases of slow speed slewing bearing with natural bearing fault damage and the results show that EEMD is better than FFT in identifying fault frequencies. Mahgoun et al. [15] present the application of EEMD in purpose to detect localized faults of damage at an early stage. Although previous research jobs on rotating machinery have produced significant outcomes, only information from sensor measurement is used for fault diagnosis. However, other sources of information besides sensor measurement could be useful for fault diagnosis, which are obviously not fully utilized. For example, the observation information from human or the maintenance record would make the diagnosis results more reliable. Obviously, the more evidences are used, the more accurate the diagnostic results will be. Recently, multi-source information fusion fault diagnosis systems based on Bayesian networks have been developed in some fields. To take advantage of all useful information, a three-layered Bayesian network has been presented for fault diagnosis of a chiller [16] and variable air volume terminals [17]. Xu [18] develops an intelligent expert system of rotating flexible rotors based on Bayesian network by fully incorporating human experts’ knowledge, machine faults and faults symptoms as well as machine running conditions. Cai et al. [19] propose a multi-source information fusion based fault diagnosis methodology for ground-source heat pump by making use of sensor information and observation information. Oukhellou et al. [20] present a hybrid diagnosis system based on the combination of local sensor data information and global structural knowledge information for the detection of broken rail. Bayesian network is an acyclic directed graph consisting of a set of variables with directed edges between the variables. It is a powerful tool for modeling complex problems in probabilistic knowledge representation and reasoning [21]. It has been widely used for fault diagnosis in a variety of fields. Sun et al. [22] develop a Mild Congnitive Impairment (MCI) expert system based on Bayesian network to address MCI’s prediction and inference question and the experimental results indicate that the proposed model achieved better results than some existing methods in most instances. Barco et al. [23] propose a discrete Bayesian network for diagnosis of radio access networks of cellular systems and the research shows that the developed model outperform traditional Bayesian network when there is inaccuracy in the model parameters. Sahin et al. [24] develop a Bayesian network for fault diagnosis of airplane engines and the results show that the proposed model can detect the anomalies or faults in the sensor readings. Kariv et al. [25] develop a computerized decision support system for the diagnosis of infections among solid organ transplant recipients based on Bayesian network. In this paper, a fault diagnosis methodology for gear pumps based on EEMD method and Bayesian network is proposed to make full use of multi-source information. One of the weak points of Bayesian network is that there is no specific semantic to guide the model development [26].This paper presents a three-layered diagnostic Bayesian network for model development, which is composed of fault layer, fault feature layer and multi-source information layer. With the proposed framework, Bayesian network for fault diagnosis can be easily developed. Vibration signals from sensor measurement are decomposed by the EEMD method and the features of IMFs are extracted. The obtained fault features and other multi-sources information are entered into fault feature layer and multi-source information layer, respectively. The reminder of this paper is organized as follows. In Section 2, EEMD method and Bayesian network are introduced. Section 3 proposes the fault diagnosis methodology and applies it to a gear pump. In Section 4, fault diagnosis based on the developed model is performed. Section 5 summarizes the paper.

EEMD and Bayesian Network

EEMD algorithm and feature extraction method

EEMD is proposed to overcome the mode mixing problem, which is defined as a single IMF including oscillations of dramatically disparate scales or a component of a similar scale residing in different IMFs. Essentially, white noise of finite amplitude is added to the original signal during the EEMD decomposition process. In fact, to make the EEMD effective, the amplitude of the added noise should not be too small. In most cases, white noise of an amplitude that is about 0.2 standard deviation of that of the data is suggested. However, when the data is dominated by high-frequency signals, the noise amplitude may be smaller, and when the data is dominated by low-frequency signals, the noise amplitude may be increased. Generally speaking, the range of standard deviation is 0.1–0.4 [11]. The ensemble means of the corresponding IMFs generated from each trial are defined as the true IMFs of the EEMD. The EEMD performance of overcoming the mode mixing problem has been demonstrated [13, 27]. The EEMD algorithm can be given as follows [15]. Determine the number of ensemble M and initialize the amplitude of the added white noise, and m = 1. Perform the mth trial on the investigated signal added white noise. Add a white noise series with the given amplitude to the original signal. where n (t) represents the mth added white noise series and x (t) denotes the noise-added signal of the mth trial. With the EMD method [28], the noise-added signal x (t) is decomposed into N IMFs c (t)(i=1,2,…,I), where c (t) represents the nth IMF of the mth trial, and N is the number of IMFs. If n < M then let m = m + 1. Repeat steps (2) and (3) again and again with different white noise series each time until n = M. Calculate the ensemble mean c (t) of the M trials for each IMF Report the mean a (t) (i = 1, 2,…, N) of each of the N IMFs as the final IMFs. According to the above steps, a vibration signal measured from a gear pump is decomposed and the decomposition result is given in Fig 1. It shows 8 IMFs in different frequency bands decomposed by EEMD algorithm. It can be seen from the figure that the original signal is very complicated and the decomposed IMFs are hard to use for fault diagnosis. Hence, features of the signals need to extract.
Fig 1

The decomposed components with EEMD and original signal.

Feature extraction is an important step for fault diagnosis. IMFs decomposed by EEMD contain valid information for fault diagnosis. The analysis results from EEMD energy of different vibration signals indicate that the energy of a vibration signal will change in different frequency bands when a fault occurs. It means that for the same faults, the decomposed IMFs are similar in the corresponding frequency band. Therefore, the energy of the decomposed IMFs could be used as features for fault diagnosis. E is the energy of the ith IMF. Then the feature vector of the investigated signal T = [E 1, E 2, E 3, … E ] is obtained.

Bayesian network

Bayesian network is a directed acyclic graph that is composed of a set of variables {X 1, X 2, …, X } and a set of directed edges between the variables [29, 30]. A variable has several possible states, such as true and false. Bayesian networks are very successful in probabilistic knowledge representation and reasoning. In Bayesian networks, the joint probability distribution function of all nodes can be calculated by Eq. (4). Where Pa is the set of random variables whose corresponding nodes are parent nodes of X . A Bayesian network contains two elements, namely structure and parameters. An example shown in Fig 2 is used to illustrate the basic idea of Bayesian networks. In Fig 2, the nodes (X1, X2, X3, X4) represent random variables and arcs means dependence relationships among them. Each arc starts from a parent node and ends at a child node. Pa(X) represents the parent nodes of node X, therefore, Pa(X2) = {X1}, Pa(X3) = {X1}, Pa(X4) = {X2, X3}. X1 is the root node, because it has no input arcs. Each node has two states: state0 and state1. Root nodes have prior probabilities. Each child node has conditional probabilities based on the combination of states of its parent nodes.
Fig 2

A simple Bayesian network.

The Proposed Fault Diagnosis Methodology and Its Application

The proposed methodology

Fault diagnosis methods in previous research are mainly based on fault features extracted by some signal processing techniques. In this paper a fault diagnosis methodology for gear pump based on EEMD and Bayesian network is proposed. Flow chart of the methodology is shown in Fig 3. To establish the Bayesian network, the investigated faults, fault features from sensor data and multi-source information are integrated into the diagnostic model. According to the evidences obtained from the fault features and multi-source information, a fault of the gear pump could be diagnosed.
Fig 3

Flow chart of the proposed fault diagnosis methodology.

The proposed methodology consists of fault detection, signal processing and fault diagnosis. For fault detection, a sensor is responsible for monitoring the vibration signal of the gear pump, and then the data is decomposed by EEMD method to obtain its IMFs. Fault feature extraction is accomplished by calculating the energy of IMFs according to Eq. (3). Actually, the diagnostic model includes three layers: fault layer, fault feature layer and multi-source information layer. Features decomposed by EEMD method will be entered into the fault feature layer. Fault layer includes the common faults to identify. Obviously, the more faults to diagnose, the more complicated the diagnosis system will be. For multi-source information, all the factors such as human observation information, system maintenance information or abnormal operation records are directly related to the probability of occurrence of the faults. For example, tooth face wear in fault layer is less possible to appear if the gear has been replaced by a new one during the recent maintenance.

Experiment and feature extraction

The proposed methodology is applied to a gear pump with the type of WCB-50. In this paper, four common fault reasons including tooth face wear (TFW), cavitation (CA), oil pollution (OP) and wear of internal surface of shaft sleeve (WISSS) are investigated. To obtain the data sets, these four fault reasons are simulated respectively. TFW is simulated by grinding one of the meshing surfaces of driving gear. CA is simulated by loosening the oil pump inlet. By adding pollutants into the working oil, OP is simulated. WISSS is simulated by grinding the internal surface of shaft sleeve. A piezoelectric acceleration sensor is used for collecting the vibration signal and it is connected to a dynamic test and analysis system with the type of DH5923. The sampling frequency is set as 10 kHz. Each fault has 100 training and 50 testing instances. There are 400 training and 200 testing samples in total. Vibration signals of the gear pump with different faults and in normal condition are plotted in Fig 4.
Fig 4

Vibration signals of the gear pump in different conditions.

According to the EEMD algorithm and feature extraction process described in Section 2.1, the vibration signals from different conditions are decomposed. Because the last few IMFs contain very little energy, which are useless for fault diagnosis, only the first 8 IMFs are selected for each signal. Therefore 8 features based on the energy of IMFs are calculated, which are used to identify the faults. Before a feature is entered into the Bayesian network, it is discretized according to the range of values of the data samples. Although increasing the number of intervals can improve the accuracy, it will increase the burden for building the Bayesian network. To balance the accuracy and difficulty of developing the Bayesian network, 6 intervals are determined. After discretization, the extracted feature can be denoted by one of the six numbers (1, 2,…, 6). Table 1 is the training samples of discretized features of four faults. In the table, feature i is denoted by Feai (i = 1,2,…,8). The testing samples can be obtained in the same way.
Table 1

Training samples of discretized features of four faults.

SamplesFea1Fea2Fea3Fea4Fea5Fea6Fea7Fea8Conditions
143552241TFW
243661351TFW
343662342TFW
39861121121WISSS
39961221111WISSS
40061121121WISSS

Bayesian network structure

The structure of the Bayesian network is a graphic illustration about the qualitative relationships nodes in different layers. The Bayesian diagnostic network based on the proposed methodology is shown in Fig 5. The developed Bayesian network has three layers: fault feature layer, fault layer and multi-source information layer. The directed arc denotes that each parent node will cause changes of the child nodes.
Fig 5

Developed Bayesian network for gear pump.

The fault layer includes four nodes, indicating the investigated faults. After the nodes are determined, the states of each node should be defined. In the fault layer, each node has two states, namely present and absent, indicating the presence and absence of the fault, respectively. The fault feature layer consists of eight child nodes, indicating eight features extracted from sensor signals using EEMD method. Each feature node has six states (state1-state6), representing its interval that the energy value of IMF belongs to. Multi-source information useful to diagnose the gear pump could be added into the information layer. In this paper, human observation and repair service information of the gear pump are adopted. Five casual factors are selected, namely gear replacement (GR), oil pipe folding (OPF), oil replacement (OR), shaft sleeve replacement (SSR) and noise level (NL).The events that the nodes represent are listed in Table 2. Each node has two states: yes and no.
Table 2

Nodes and their states in the multi-source information layer.

NodeEventStatePrior probability
GRIs the wear replaced by a new one in the last maintenance?Yes5%
No95%
OPFIs the oil pipe of the gear pump folded?Yes6%
No94%
ORIs the oil replaced by new one in the last maintenance?Yes5%
No95%
SSRIs the shaft sleeve replaced by a new one in the last maintenance?Yes3%
No97%
NLIs the noise level high?Yes3%
No97%

Bayesian network parameters

When the structure of Bayesian network is established, the prior probabilities and conditional probabilities are required to specify. A prior probability is the probability that an event occurs without new evidence or information. Usually, prior probabilities are determined by the experts or statistical analysis of historical data. Since historical data is hardly available, prior probabilities are often obtained according to the expert knowledge [16, 17]. It is obvious that the higher the prior probability of an event, the more likely it is to occur. In this paper, prior probabilities of the nodes in the information layer are determined in Table 2. A conditional probability is the probability that an event occurs for the given new evidence. The conditional probabilities among the nodes in the multi-source information layer and the nodes in the fault layer are set according to the knowledge and experience of authors. They are shown in Tables 3–6.
Table 3

Conditional probability table of node TFW.

FaultGRYESNO
ORYESNOYESNO
TFWPresent0.010.030.050.1
Absent0.990.970.950.9
Table 6

Conditional probability table of node WISSS.

FaultSSRYESNO
ORYESNOYESNO
WISSSPresent0.010.030.050.1
Absent0.990.970.950.9
The conditional probabilities of a child node depend on all the possible combination of states of its parents. For instance, Feature 1 in the fault feature layer has four parent nodes. Therefore, it has 96 (6*24) conditional probabilities and the 8 feature nodes need 768 parameters in total. To reduce the number of parameters need to specify conditional probabilities, Noisy-MAX node is applied [31]. The nodes in the fault feature layer are set as Noisy-MAX nodes. Hence, conditional probabilities calculated statistically using the training samples are used as parameters for those Noisy-MAX nodes. They are listed in Table 7.
Table 7

The conditional probability among nodes in the fault layer and feature layer.

FaultStateFea1Fea2Fea3Fea4Fea5Fea6Fea7Fea8
TFW10.010.010.010.010.170.30.010.76
20.010.010.010.010.730.610.010.2
30.130.610.080.070.070.060.110.01
40.750.350.510.610.010.010.590.01
50.090.010.330.250.010.010.260.01
60.010.010.060.050.010.010.020.01
CA10.010.050.260.760.760.950.950.95
20.010.860.70.20.20.010.010.01
30.010.060.010.010.010.010.010.01
40.010.010.010.010.010.010.010.01
50.650.010.010.010.010.010.010.01
60.310.010.010.010.010.010.010.01
OP10.010.010.020.030.040.890.950.81
20.030.010.690.680.590.070.010.15
30.30.210.260.260.320.010.010.01
40.570.580.010.010.030.010.010.01
50.080.180.010.010.010.010.010.01
60.010.010.010.010.010.010.010.01
WISSS10.010.70.680.010.870.930.080.76
20.010.260.280.60.090.030.880.2
30.010.010.010.360.010.010.010.01
40.010.010.010.010.010.010.010.01
50.010.010.010.010.010.010.010.01
60.950.010.010.010.010.010.010.01

Fault Diagnosis and Discussion

Fault diagnosis only using fault features

Take a feature set of TFW fault as a testing sample, T = {4, 4, 4, 5, 2, 1, 4, 1} and perform diagnosis only using these features. The diagnostic results are shown in Fig 6. It indicates that the most suspected fault is TFW (98.2%). The diagnostic result is accurate. It demonstrates that the developed Bayesian network only using the sensor data has good performance on identifying the faults.
Fig 6

Diagnostic results of a testing sample using only fault features.

To test the effectiveness of the model only using evidences from the fault feature layers like other researchers usually did, 200 testing instances are used. Each type of fault has 50 samples. In order to reflect the model superiority, it is necessary to build other models to compare with the proposed model. Recently, some intelligent classification algorithms, such as artificial neural network (ANN) and support vector machine (SVM) have been successfully applied to the intelligent fault diagnosis of mechanical equipment [32-34]. Features of the investigated signals are dealt with ANN or SVM to recognize the health conditions of the objects. In this paper, ANN and SVM are applied to train and test the same samples as the Bayesian network did. The test results are shown in Table 8. It demonstrates that the proposed method based on Bayesian network achieves the best diagnostic performance. Besides, the average diagnostic accuracy of ANN, SVM and Bayesian network is 94%, 95%,98.5%, respectively. The developed Bayesian network improves the average diagnosis accuracy by 4.5% and 3.5%, respectively, compared with ANN and SVM. The comparison result indicates that the proposed method outperforms the other two common methods in diagnosing different categories of gear pump faults.
Table 8

Testing accuracy ANN, SVM and Bayesian network.

FaultSamplesANNSVMBayesian network
TFW5090%100%100%
CA5096%100%98%
OP5098%98%100%
WISSS5092%82%96%

Fault diagnosis using fault features and multi-source information

Take a feature set as an example, T = {5, 3, 3, 1, 1, 2, 6, 1} and perform diagnosis using the developed Bayesian diagnostic network. The steps are shown in Figs 7–9. At step 1 in Fig 7, only fault features are applied for fault diagnosis. Probabilities of the presence of TFW, CA, OP, WISSS is 64%, 46.5%, 3.83%, 0.017%, respectively. So, it’s hard to tell whether TFW or CA occurs without other useful information. At step 2 in Fig 8, a new evidence (GR is yes) in the information layer is added. The most suspected fault is CA (48.3%). The fault probability of TFW (33.8%) is decreased under the new evidence. As shown in Fig 9, at step 3 (added evidence: OPF is yes), the fault probability of CA increases from 48.3% to 98.3%. It is because the new evidence is a unique cause for CA. It demonstrates that multi-source information is helpful to fault diagnosis.
Fig 7

Step 1 of the fault diagnosis for the case.

Fig 9

Step 3 of the fault diagnosis for the case.

Fig 8

Step 2 of the fault diagnosis for the case.

Conclusions

The main contribution of this paper is that a methodology based on Bayesian network and EEMD for fault diagnosis is presented. The advantages of Bayesian network and EEMD are integrated. Compared with the other conventional fault diagnosis methods, the presented methodology is able to make use of more useful information besides sensor signals. Essentially, the presented scheme based on EEMD and Bayesian network is a multi-source information fusion based methodology. With the proposed three-layered Bayesian network framework, some useful information (including naked eyes inspection, maintenance records, etc) can be helpful to identify the fault. The proposed method has been applied to the fault diagnosis of gear pump and it is effective and efficient based on vibration signals and other information. The proposed methodology is applied to fault diagnosis of a gear pump. The developed diagnostic Bayesian network has three layers, namely fault feature layer, fault layer and multi-source information layer. Sensor signals and other helpful information for diagnosis could be added into the networks. When fault features extracted from EEMD method are only used, the developed model has better diagnostic performance than ANN and SVM classification algorithms. It improves the average diagnosis accuracy by 4.5% and 3.5%, respectively, compared with ANN and SVM. Sometimes, it may be hard to distinguish the faults only based on the sensor signals. A case study has demonstrated that some information from human observation or system maintenance records is very helpful to the fault diagnosis.
Table 4

Conditional probability table of node CA.

FaultOPFYESNO
NLYESNOYESNO
CAPresent0.950.90.10.08
Absent0.050.10.90.92
Table 5

Conditional probability table of node OP.

FaultORYESNO
NLYESNOYESNO
OPPresent0.070.050.120.1
Absent0.930.950.880.9
  2 in total

1.  A model for diagnosis of pulmonary infections in solid-organ transplant recipients.

Authors:  Galia Kariv; Vered Shani; Elad Goldberg; Leonard Leibovici; Mical Paul
Journal:  Comput Methods Programs Biomed       Date:  2010-07-31       Impact factor: 5.428

2.  Risk analysis of a patient monitoring system using Bayesian Network modeling.

Authors:  I Maglogiannis; E Zafiropoulos; A Platis; C Lambrinoudakis
Journal:  J Biomed Inform       Date:  2005-11-15       Impact factor: 6.317

  2 in total
  4 in total

1.  Computational Effective Fault Detection by Means of Signature Functions.

Authors:  Przemyslaw Baranski; Piotr Pietrzak
Journal:  PLoS One       Date:  2016-03-07       Impact factor: 3.240

2.  Application of a Tabu search-based Bayesian network in identifying factors related to hypertension.

Authors:  Jinhua Pan; Huaxiang Rao; Xuelei Zhang; Wenhan Li; Zhen Wei; Zhuang Zhang; Hao Ren; Weimei Song; Yuying Hou; Lixia Qiu
Journal:  Medicine (Baltimore)       Date:  2019-06       Impact factor: 1.817

3.  Multi-Sensor Data Fusion Identification for Shearer Cutting Conditions Based on Parallel Quasi-Newton Neural Networks and the Dempster-Shafer Theory.

Authors:  Lei Si; Zhongbin Wang; Xinhua Liu; Chao Tan; Jing Xu; Kehong Zheng
Journal:  Sensors (Basel)       Date:  2015-11-13       Impact factor: 3.576

4.  Improving forecasting accuracy for stock market data using EMD-HW bagging.

Authors:  Ahmad M Awajan; Mohd Tahir Ismail; S Al Wadi
Journal:  PLoS One       Date:  2018-07-17       Impact factor: 3.240

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.