| Literature DB >> 29621131 |
Zhenyu Wu1, Yang Guo2, Wenfang Lin3, Shuyang Yu4, Yang Ji5,6.
Abstract
Predictive maintenance plays an important role in modern Cyber-Physical Systems (CPSs) and data-driven methods have been a worthwhile direction for Prognostics Health Management (PHM). However, two main challenges have significant influences on the traditional fault diagnostic models: one is that extracting hand-crafted features from multi-dimensional sensors with internal dependencies depends too much on expertise knowledge; the other is that imbalance pervasively exists among faulty and normal samples. As deep learning models have proved to be good methods for automatic feature extraction, the objective of this paper is to study an optimized deep learning model for imbalanced fault diagnosis for CPSs. Thus, this paper proposes a weighted Long Recurrent Convolutional LSTM model with sampling policy (wLRCL-D) to deal with these challenges. The model consists of 2-layer CNNs, 2-layer inner LSTMs and 2-Layer outer LSTMs, with under-sampling policy and weighted cost-sensitive loss function. Experiments are conducted on PHM 2015 challenge datasets, and the results show that wLRCL-D outperforms other baseline methods.Entities:
Keywords: PHM; class imbalance; cyber-physical system; deep learning; fault diagnosis; feature extraction; time series
Year: 2018 PMID: 29621131 PMCID: PMC5948747 DOI: 10.3390/s18041096
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Formulate machine fault diagnostic problems into classification problems: segment the multi-dimensional sensor signals into time windows with fault type labels.
Figure 2System model: pipeline overview of processing imbalanced fault diagnosis.
Figure 3Main architecture of the LR-ConvLSTM model.
Figure 4Descriptions of the datasets.
The ratio of faults and normal events in plant #1. PF means plant faulty samples and PN means plant normal samples.
| Event Type | PF1 | PF2 | PF3 | PF4 | PF5 | PF6 | PN |
|---|---|---|---|---|---|---|---|
| Ratio | 4.83% | 3.75% | 3.39% | 0.06% | 0.65% | 19.26% | 68.06% |
The suffix of methods used based on baseline classifiers.
| Suffix | Description | Parameters |
|---|---|---|
| -W | Cost-sensitive Weight Method | —– |
| -O | SMOTE-based Method | All faulty classes are over-sampled to 5000 |
| -D | Random Under-sampling Method | Normal class is under-sampled to 10,000 |
Comparisons of average precision, recall and F1 among different methods based on 24, 48 and 100 time window lengths.
| Method | Precision | Recall | F1 |
|---|---|---|---|
| XGBoost | 53.57% | 59.81% | 56.02% |
| LRCL | 51.92% | 64.31% | 55.36% |
| wLRCL | 66.80% | 75.04% | 69.87% |
| CNN-D | 80.86% | 80.88% | 80.81% |
| Easy-SMT | 84.19% | 84.38% | 84.0% |
| LRCL-O | 84.94% | 89.23% | 86.95% |
| DeepConvLSTM-D | 88.48% | 88.37% | 88.40% |
| LRCL-D | 95.51% | 97.30% | 97.29% |
| wLRCL-D | 98.42% | 98.46% | 98.46% |
Figure 5Confusion matrix of wLRCL-D (a); LRCL-D (b); DeepConvLSTM-D (c) and CNN-D (d).
Figure 6t-SNE cluster map of wLRCL-D (a); LRCL-D (b); LRCL-O (c) and wLRCL (d).
Figure 7The influence of window length on LRCL-D and wLRCL-D.
The influence of window length on wLRCL and wLRCL-D.
| Window_length | 24 | 48 | 100 |
|---|---|---|---|
| F1(wLRCL) | 56.66% | 99.51% | 54.42% |
| Recall(wLRCL) | 64.23% | 99.51% | 61.38% |
| F1(wLRCL-D) | 98.24% | 98.40% | 98.75% |
| Recall(wLRCL-D) | 98.24% | 98.40% | 98.75% |
Figure 8Comparisons of recall values on LRCL-D and and wLRCL-D with different window lengths.
Figure 9Comparisons of F1 score on classifications of faults 4 and 5 based on different methods.