| Literature DB >> 26393603 |
Mariela Cerrada1,2, René Vinicio Sánchez3,4, Diego Cabrera5, Grover Zurita6, Chuan Li7.
Abstract
There are growing demands for condition-based monitoring of gearboxes, and techniques to improve the reliability, effectiveness and accuracy for fault diagnosis are considered valuable contributions. Feature selection is still an important aspect in machine learning-based diagnosis in order to reach good performance in the diagnosis system. The main aim of this research is to propose a multi-stage feature selection mechanism for selecting the best set of condition parameters on the time, frequency and time-frequency domains, which are extracted from vibration signals for fault diagnosis purposes in gearboxes. The selection is based on genetic algorithms, proposing in each stage a new subset of the best features regarding the classifier performance in a supervised environment. The selected features are augmented at each stage and used as input for a neural network classifier in the next step, while a new subset of feature candidates is treated by the selection process. As a result, the inherent exploration and exploitation of the genetic algorithms for finding the best solutions of the selection problem are locally focused. The Sensors 2015, 15 23904 approach is tested on a dataset from a real test bed with several fault classes under different running conditions of load and velocity. The model performance for diagnosis is over 98%.Entities:
Keywords: fault diagnosis; feature selection; gearbox; genetic algorithms; neural networks; vibration signal
Year: 2015 PMID: 26393603 PMCID: PMC4610427 DOI: 10.3390/s150923903
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1General cycle of the GA.
Figure 2Basic architecture of an NN for classification.
Figure 3Vibration analysis laboratory at the Universidad Politécnica Salesiana in Cuenca, Ecuador.
Simulated gear faults.
| Label | Description |
|---|---|
| f0 | Normal |
| f1 | Gear crack 0.5 mm |
| f2 | Gear tooth breakage 10% |
| f3 | Pinion pitting |
| f4 | Pinion with face wear 0.5 mm |
| f5 | Gear misalignment |
| f6 | Gear tooth breakage 50% |
| f7 | Gear tooth breakage 100% |
Gear faults.
| Image | Description |
|---|---|
| Gear crack 0.5 mm | |
| Gear tooth breakage 10% | |
| Pinion pitting | |
| Pinion with face wear 0.5 mm | |
| Gear misalignment | |
| Gear tooth breakage 50% | |
| Gear tooth breakage 100% |
Figure 4Feature extraction process from the vibration signal.
Figure 5The wavelet packet decomposition.
Figure 6Multi-stage approach for feature selection in fault diagnosis.
Figure 7Individual encoding.
Sets of the initial population. db7, Daubechies; sym3, symlet; coif4, Coifier; bior6.8, biorthogonal; rbior6.8, reverse biorthogonal.
| Number of Stages | Number of Features | Condition Parameters |
|---|---|---|
| 1 | 39 | 20 RMS values from frequency bands + 10 from the frequency domain |
| + 9 from the time-domain | ||
| 2 | 320 | energy from wavelets coefficients (db7 + sym3 + coif4 + bior6.8 + rbior6.8) |
| 3 | 256 | energy from wavelets coefficients (db7 + sym3 + coif4 + bior6.8) |
Figure 8Case study: Final results.
Figure 9Fitness function performance.
Best individual by using multi-stage selection.
| Parameters | Total Number | Description |
|---|---|---|
| RMS from frequency bands | 12 | −183 Hz (1st band) |
| 733 Hz to 915 Hz (5th band) | ||
| 915 Hz to 1098 Hz (6th band) | ||
| 1281 Hz to 2702 Hz (8th band to 14th band) | ||
| 2745 Hz to 3111 Hz (16th band to 17th band) | ||
| Frequency | 2 | Standard deviation |
| Skewness | ||
| Time | 4 | Mean |
| Standard deviation | ||
| Root Mean Square | ||
| Crest Factor | ||
| Energy from Wavelets | 144 | 36 coefficients from db7 |
| coefficients | 37 coefficients from sym3 | |
| 35 coefficients from coif4 | ||
| 36 coefficients from bior6.8 |
Figure 10Confusion matrix for the test set, at each stage.
Classifier performance by using multi-stage selection.
| Number of | Number of | Number of Available | Number of | Number of | |
|---|---|---|---|---|---|
| 1 | 0 | 39 | 18 | 43 | 0.9444 |
| 2 | 18 | 320 | 274 | 30 | 0.9778 |
| 3 | 18 | 256 | 162 | 36 | 0.9889 |
Classifier performance by using classical feature selection with GA.
| Number of
| Number of
| Number of
| Precision | Sensibility | |
|---|---|---|---|---|---|
| 320 | 156 | 63 | 0.9611 | 0.9611 | 0.9611 |
| 359 | 193 | 50 | 0.9778 | 0.9778 | 0.9778 |
Figure 11(a) Importance variables computed by the RF algorithm; (b) out-of-bag error in the training phase.
Classification performance by using RF-based feature selection and the NN-based diagnoser.
| Number of Features | Precision with 40 Hidden Nodes | Precision with 60 Hidden Nodes |
|---|---|---|
| 6 | 0.5000 | 0.5185 |
| 31 | 0.8278 | 0.8284 |
| 88 | 0.9037 | 0.9111 |
| 162 | 0.9389 | 0.9593 |
| 193 | 0.9481 | 0.9585 |
| 359 | 0.9481 | 0.9593 |
Best 162 features selected by RF.
| Parameters | Total Number | Description | Number of Coincident Parameters |
|---|---|---|---|
| RMS from frequency bands | 12 | 2nd band | 0 |
| 5th band to 7th band | 2 | ||
| 9th band | 1 | ||
| 11th band to 17th band | 5 | ||
| Frequency | 1 | Standard deviation | 1 |
| Time | 0 | ——– | 0 |
| Energy from Wavelets | 149 | 39 coefficients from db7 | 22 |
| coefficients | 39 coefficients from sym3 | 22 | |
| 24 coefficients from coif4 | 22 | ||
| 24 coefficients from bior6.8 | 22 | ||
| 23 coefficients from rbior6.8 | 0 |