| Literature DB >> 35455173 |
Hosameldin O A Ahmed1, Asoke K Nandi2,3.
Abstract
As failures of rolling bearings lead to major failures in rotating machines, recent vibration-based rolling bearing fault diagnosis techniques are focused on obtaining useful fault features from the huge collection of raw data. However, too many features reduce the classification accuracy and increase the computation time. This paper proposes an effective feature selection technique based on intrinsic dimension estimation of compressively sampled vibration signals. First, compressive sampling (CS) is used to get compressed measurements from the collected raw vibration signals. Then, a global dimension estimator, the geodesic minimal spanning tree (GMST), is employed to compute the minimal number of features needed to represent efficiently the compressively sampled signals. Finally, a feature selection process, combining the stochastic proximity embedding (SPE) and the neighbourhood component analysis (NCA), is used to select fewer features for bearing fault diagnosis. With regression analysis-based predictive modelling technique and the multinomial logistic regression (MLR) classifier, the selected features are assessed in two case studies of rolling bearings vibration signals under different working loads. The experimental results demonstrate that the proposed method can successfully select fewer features, with which the MLR-based trained model achieves high classification accuracy and significantly reduced computation times compared to published research.Entities:
Keywords: compressive sampling (CS); feature selection; multinomial logistic regression (MLR); rolling bearing fault diagnosis; vibration-based condition monitoring
Year: 2022 PMID: 35455173 PMCID: PMC9027945 DOI: 10.3390/e24040511
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.738
Figure 1A typical roller bearing [4].
Figure 2Rolling element bearing geometry [6].
Figure 3The proposed method.
Figure 4Single measurement vector compressive sampling framework [37].
Figure 5The test rig used to collect the vibration data of bearings of the first case study [4].
The characteristics of bearings’ health conditions in the obtained bearing dataset.
| Condition | Characteristic |
|---|---|
| NO | The bearing was brand new and in perfect condition. |
| NW | The bearing was in service for some time but in good condition. |
| IR | Inner race fault. This fault was created by cutting a small groove in the raceway of the inner race. |
| OR | Outer race fault. This fault was created by cutting a small groove in the raceway of the outer race. |
| RE | Roller element fault. This fault was created by using an electrical etcher to mark the surface of the balls, simulating corrosion. |
| CA | Cage fault. This fault was created by removing the plastic cage from one of the bearings, cutting away a section of the cage so that two of the balls were not held in a regular space and had freedom to move. |
Figure 6Typical time domain vibration signals for the six different conditions [4].
Figure 7Example of the average loss values versus values computed from the reduced dimension of compressively sampled data with α = 0.2.
Figure 8Example of the selected features and their corresponding weights using α = 0.2 and NCA tolerance value = 0.02.
Examples of the computed values of the average intrinsic dimension, the dimension of the NCA-based selected features, least loss, and best values taken from 10 trials.
| NCA Tolerance Value | CS Sampling Rate
| Average | The Average Dimension of NCA-Based Selected Features ( | Average Least Loss | Average Best Lambda for NCA |
|---|---|---|---|---|---|
| 0.01 | 0.1 | 28 | 8 | 0.013 | 0.004 |
| 0.2 | 40 | 10 | 0.009 | 0.004 | |
| 0.3 | 26 | 8 | 0.010 | 0.003 | |
| 0.02 | 0.1 | 62 | 18 | 0.014 | 0.003 |
| 0.2 | 55 | 14 | 0.013 | 0.003 | |
| 0.3 | 33 | 11 | 0.013 | 0.003 |
Classification results with their corresponding RMSE and computational time for the automatically selected features (d refers to the intrinsic dimension, and f is the dimension of the NCA-based selected feature) using two values of NCA tolerance and three compressive sampling rates.
| NCA Tolerance Value | CS Sampling Rate
|
|
| MLR Classifier | |||
|---|---|---|---|---|---|---|---|
| Training Accuracy | Training | Testing Accuracy | Testing | ||||
| 0.01 | 0.1 | 28 | 8 | 99.8 ± 0.3 | 5.34 ± 1.7 | 99.5 ± 0.6 | 0.015 ± 0.002 |
| 0.2 | 40 | 10 | 99.9 ± 0.1 | 4.6 ± 2.3 | 99.7 ± 0.3 | 0.003 ± 0.00 | |
| 0.3 | 26 | 8 | 100 ± 0.0 | 3.3 ± 0.5 | 99.9 ± 0.1 | 0.003 ± 0.001 | |
| 0.02 | 0.1 | 62 | 18 | 99.9 ± 0.2 | 3.37 ± 0.8 | 99.7 ± 0.3 | 0.003 ± 0.001 |
| 0.2 | 55 | 14 | 99.9 ± 0.1 | 3.55 ± 0.9 | 99.8 ± 0.2 | 0.003 ± 0.001 | |
| 0.3 | 33 | 11 | 100 ± 0.0 | 4.6 ± 2.0 | 100 ± 0.0 | 0.004 ± 0.003 | |
Sample confusion matrices of the classification results of MLR classifier using selected features with tolerance value = 0.01 and a sampling rate of (a) , (b) , and (c) .
| NO | NW | IR | OR | RE | CA | |
|---|---|---|---|---|---|---|
|
| 80 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 80 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 80 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 80 | 0 | 0 |
|
| 0 | 0 | 1 | 0 | 79 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 80 |
| (a) | ||||||
|
|
|
|
|
|
| |
|
| 80 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 78 | 0 | 0 | 0 | 2 |
|
| 0 | 0 | 80 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 80 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 80 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 80 |
| (b) | ||||||
|
|
|
|
|
|
| |
|
| 80 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 80 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 80 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 80 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 80 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 80 |
| (c) | ||||||
A comparison with the classification results from the literature on the vibration bearing dataset of the first case study.
| Ref | Method | Testing Accuracy | Testing Time |
|---|---|---|---|
| [ | Raw vibration with entropic features + SVM | 98.9 ± 1.2 | _ |
| Compressed sampled with | 92.4 ± 0.5 | ||
| Compressed sampled with | 84.6 ± 0.41 | ||
| [ | GP generated feature sets (un-normalised data) | ||
| ANN | 96.5 | ||
| SVM | 97.1 | ||
| [ | FMM-RF SamEn | 99.7 ± 0.02 | _ |
| PS | 99.7 ± 0.50 | ||
| SamEn + PS | 99.8 ± 0.41 | ||
| [ | CPDC (with 6000 inputs from FFT) | 99.4 ± 0.5 | 64.9 |
| CS-CPDC | 99.8 ± 0.2 | 6.7 | |
| α = 0.2 | 99.9 ± 0.1 | 7.8 | |
| [ | With FFT, | _ | |
| CS-FS | 99.7 ± 0.4 | ||
| CS-LS | 99.5 ± 0.3 | ||
| CS-Relief-F | 99.8 ± 0.2 | ||
| CS-PCC | 99.8 ± 0.3 | ||
| CS-Chi-2 | 99.5 ± 0.5 | ||
| [ | Feature selection (with | _ | |
| α = 0.1 and feature dimension = 14 | 98.8 ± 2.4 | ||
| α = 0.2 and feature dimension = 13 | 99.9 ± 0.2 | ||
| α = 0.3 and feature dimension = 26 | 99.9 ± 0.1 | ||
| Our proposed method with | |||
| MLR classifier | 99.5 ± 0.6 | 0.015 | |
| SVM classifier | 99.5 ± 0.5 | 0.060 | |
| Our proposed method with | |||
| MLR classifier | 99.7 ± 0.3 | 0.003 | |
| SVM classifier | 99.8 ± 0.2 | 0.040 | |
| Our proposed method with | |||
| MLR classifier | 100 ± 0.0 | 0.003 | |
| SVM classifier | 100 ± 0.0 | 0.030 |
Description of the bearing health conditions of the bearing vibration dataset used in the second case study.
| Health Condition | Fault Width (mm) | Classification Label |
|---|---|---|
| NO | 0 | 1 |
| RE1 | 0.18 | 2 |
| RE2 | 0.36 | 3 |
| RE3 | 0.53 | 4 |
| RE4 | 0.71 | 5 |
| IR1 | 0.18 | 6 |
| IR2 | 0.36 | 7 |
| IR3 | 0.53 | 8 |
| IR4 | 0.71 | 9 |
| OR1 | 0.18 | 10 |
| OR2 | 0.36 | 11 |
| OR3 | 0.53 | 12 |
Examples of the computed values of the average intrinsic dimension, the dimension of the NCA-based selected features, least loss, and best values taken from 10 trials for datasets A, B, C, and D.
| Dataset | NCA Tolerance Value | CS Sampling Rate
|
|
| Average Least Loss | Average Best Lambda for NCA |
|---|---|---|---|---|---|---|
| A | 0.01 | 0.1 | 13 | 6 | 0.001 | 0.0011 |
| 0.2 | 15 | 7 | 0.000 | 0.0009 | ||
| 0.3 | 15 | 9 | 0.001 | 0.0006 | ||
| 0.02 | 0.1 | 18 | 10 | 0.000 | 0.0009 | |
| 0.2 | 21 | 12 | 0.000 | 0.0004 | ||
| 0.3 | 25 | 14 | 0.000 | 0.0003 | ||
| B | 0.01 | 0.1 | 17 | 7 | 0.000 | 0.0007 |
| 0.2 | 19 | 9 | 0.001 | 0.0006 | ||
| 0.3 | 24 | 9 | 0.000 | 0.0002 | ||
| 0.02 | 0.1 | 28 | 11 | 0.000 | 0.0009 | |
| 0.2 | 23 | 10 | 0.000 | 0.0005 | ||
| 0.3 | 26 | 12 | 0.000 | 0.0003 | ||
| C | 0.01 | 0.1 | 16 | 5 | 0.001 | 0.0010 |
| 0.2 | 17 | 7 | 0.000 | 0.0006 | ||
| 0.3 | 23 | 8 | 0.000 | 0.0006 | ||
| 0.02 | 0.1 | 21 | 11 | 0.000 | 0.0009 | |
| 0.2 | 22 | 12 | 0.000 | 0.0004 | ||
| 0.3 | 27 | 14 | 0.000 | 0.0003 | ||
| D | 0.01 | 0.1 | 15 | 4 | 0.003 | 0.0041 |
| 0.2 | 18 | 5 | 0.001 | 0.0041 | ||
| 0.3 | 20 | 7 | 0.001 | 0.0015 | ||
| 0.02 | 0.1 | 17 | 9 | 0.001 | 0.003 | |
| 0.2 | 21 | 10 | 0.001 | 0.003 | ||
| 0.3 | 23 | 9 | 0.001 | 0.003 |
Figure 9Example of the average loss values versus values computed from the reduced dimension of compressively sampled data with α = 0.2 and dataset A.
Figure 10Example of the selected features from dataset A and their corresponding weights using α = 0.2 and NCA tolerance value = 0.02.
Classification results with their corresponding RMSE and computational time for the automatically selected features (d refers to the intrinsic dimension, and f is the dimension of the NCA-based selected feature) using two values of the NCA tolerance and compressive sampling rates for datasets A, B, C, and D (all classification accuracies of 100% are in bold).
| Dataset | NCA | CS Sampling Rate
|
|
| MLR Classifier | |
|---|---|---|---|---|---|---|
| Testing Accuracy | Testing Time | |||||
| A | 0.01 | 0.1 | 13 | 6 | 98.5 ± 0.7 | 0.002 |
| 0.2 | 15 | 7 | 99.9 ± 0.1 | 0.003 | ||
| 0.3 | 15 | 9 | 99.9 ± 0.1 | 0.002 | ||
| 0.02 | 0.1 | 18 | 10 | 99.6 ± 0.2 | 0.003 | |
| 0.2 | 21 | 12 | 99.8 ± 0.2 | 0.006 | ||
| 0.3 | 25 | 14 |
| 0.003 | ||
| B | 0.01 | 0.1 | 17 | 7 | 99.2 ± 0.7 | 0.002 |
| 0.2 | 19 | 9 | 99.5 ± 0.5 | 0.003 | ||
| 0.3 | 24 | 9 |
| 0.009 | ||
| 0.02 | 0.1 | 28 | 11 | 99.9 ± 0.1 | 0.003 | |
| 0.2 | 23 | 10 | 99.9 ± 0.1 | 0.006 | ||
| 0.3 | 26 | 12 |
| 0.003 | ||
| C | 0.01 | 0.1 | 16 | 5 | 99.7 ± 0.2 | 0.002 |
| 0.2 | 17 | 7 | 99.9 ± 0.1 | 0.003 | ||
| 0.3 | 23 | 8 |
| 0.002 | ||
| 0.02 | 0.1 | 22 | 11 | 99.9 ± 0.1 | 0.003 | |
| 0.2 | 27 | 12 | 99.9 ± 0.1 | 0.006 | ||
| 0.3 | 15 | 14 |
| 0.003 | ||
| D | 0.01 | 0.1 | 18 | 4 | 92.7 ± 2.9 | 0.002 |
| 0.2 | 20 | 5 | 99.1 ± 0.8 | 0.002 | ||
| 0.3 | 17 | 7 | 99.9 ± 0.1 | 0.002 | ||
| 0.02 | 0.1 | 21 | 9 | 99.9 ± 0.1 | 0.002 | |
| 0.2 | 23 | 10 | 99.9 ± 0.1 | 0.002 | ||
| 0.3 | 13 | 9 | 99.9 ± 0.1 | 0.002 | ||
Sample confusion matrices of the classification results of MLR classifier using selected features with tolerance value = 0.01 and a sampling rate of (a) , (b) , and (c) with the dataset A.
|
|
|
|
|
|
|
|
|
|
| |
|
| 100 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 100 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 100 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 100 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 100 | 0 | 0 | 0 | 0 | 0 |
|
| 1 | 0 | 1 | 0 | 0 | 97 | 0 | 0 | 1 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 100 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 99 |
| (a) | ||||||||||
|
|
|
|
|
|
|
|
|
|
| |
|
| 100 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 100 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 100 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 100 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 100 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 97 | 3 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 100 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
| (b) | ||||||||||
|
|
|
|
|
|
|
|
|
|
| |
|
| 100 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 100 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 100 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 100 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 100 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 100 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 100 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 100 |
| (c) | ||||||||||
A comparison with the classification results from the literature on the vibration bearing datasets A, B, C, and D of the second case study.
| Ref | Dataset | Method | Testing | Testing Time |
|---|---|---|---|---|
| A | 99.3 ± 0.6 | 5.7 | ||
| [ | B | CS-DNN with | 99.7 ± 0.5 | 5.9 |
| C | 100 ± 0.0 | 5.7 | ||
| [ | D | With FFT, | _ | |
| CS-FS | 98.4 ± 1.6 | |||
| CS-LS | 99.1 ± 0.8 | |||
| CS-Relief-F | 99.3 ± 0.6 | |||
| CS-PCC | 99.2 ± 0.8 | |||
| CS-Chi-2 | 97.5 ± 2.6 | |||
| [ | OFS-FSAR-SVM | _ | ||
| Selected features = 25 | 91.46 | |||
| Selected features = 50 | 69.58 | |||
| OFS-FSAR-PCA-SVM | ||||
| Selected features = 25 | 91.67 | |||
| Selected features = 50 | 69.79 | |||
| OFS-FSAR-LDA-SVM | ||||
| Selected features = 25 | 86.25 | |||
| Selected features = 50 | 92.70 | |||
| OFS-FSAR-LFDA-SVM | ||||
| Selected features = 25 | 93.75 | |||
| Selected features = 50 | 94.38 | |||
| OFS-FSAR-(SM-LFDA)-SVM | ||||
| Selected features = 25 | 94.58 | |||
| D | Selected features = 50 | 95.63 | ||
| [ | A | 99.95 ± 0.06 | _ | |
| B | 99.61 ± 0.21 | |||
| C | DNN | 99.74 ± 0.16 | ||
| A | 62.20 ± 18.09 | |||
| B | 61.95 ± 22.09 | |||
| C | BPNN | 69.82 ± 17.67 | ||
| [ | A | MLP | _ | |
| Our proposed method with | ||||
| A | feature dimension = 10 | 99.6 ± 0.2 | 0.003 | |
| The proposed | B | feature dimension = 11 | 99.9 ± 0.1 | 0.003 |
| method | C | feature dimension = 11 | 99.9 ± 0.1 | 0.003 |
| D | feature dimension = 9 | 99.9 ± 0.1 | 0.002 |