| Literature DB >> 36010786 |
Xiaorong Zheng1,2, Zhaojian Gu1,2, Caiming Liu1,2, Jiahao Jiang1,2, Zhiwei He1,2, Mingyu Gao1,2.
Abstract
Domain adaptation-based bearing fault diagnosis methods have recently received high attention. However, the extracted features in these methods fail to adequately represent fault information due to the versatility of the work scenario. Moreover, most existing adaptive methods attempt to align the feature space of domains by calculating the sum of marginal distribution distance and conditional distribution distance, without considering variable cross-domain diagnostic scenarios that provide significant cues for fault diagnosis. To address the above problems, we propose a deep convolutional multi-space dynamic distribution adaptation (DCMSDA) model, which consists of two core components: two feature extraction modules and a dynamic distribution adaptation module. Technically, a multi-space structure is proposed in the feature extraction module to fully extract fault features of the marginal distribution and conditional distribution. In addition, the dynamic distribution adaptation module utilizes different metrics to capture distribution discrepancies, as well as an adaptive coefficient to dynamically measure the alignment proportion in complex cross-domain scenarios. This study compares our method with other advanced methods, in detail. The experimental results show that the proposed method has excellent diagnosis performance and generalization performance. Furthermore, the results further demonstrate the effectiveness of each transfer module proposed in our model.Entities:
Keywords: domain adaptation; fault diagnosis; rolling bearing; transfer leaning
Year: 2022 PMID: 36010786 PMCID: PMC9407131 DOI: 10.3390/e24081122
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.738
Figure 1Intelligent fault diagnosis: (a) without dynamic distribution adaptation; (b) with marginal distribution adaptation; (c) with conditional distribution adaptation; and (d) with dynamic distribution adaptation.
Figure 2Framework of the proposed model.
Figure 3Structure of the conditional feature extraction module.
Architecture of the conditional feature extraction module.
| Layer | Kernel Size | Kernel Number | Strides | Output Shape |
|---|---|---|---|---|
| CNN Block1 | (1, 1) | 64 | 1 | (32, 64) |
| Average pooling | (7, 1) | 64 | 1 | (26, 64) |
| CNN Block2 | (1, 1) | 48 | 1 | (32, 48) |
| CNN Block3 | (5, 1) | 64 | 1 | (32, 64) |
| Average pooling | (7, 1) | 64 | 1 | (26, 64) |
| CNN Block4 | (1, 1) | 64 | 1 | (32, 64) |
| CNN Block5 | (3, 1) | 96 | 1 | (32, 96) |
| CNN Block6 | (3, 1) | 96 | 1 | (32, 96) |
| Average pooling | (7, 1) | 96 | 1 | (26, 96) |
Architectures of the marginal and conditional domain discriminators.
| Layer | Output Shape |
|---|---|
| F1: Fully connected marginal domain discriminator layer | (Batch_size, 1024) |
| F2: Fully connected marginal domain discriminator layer | (Batch_size, 1024) |
| F3: Fully connected marginal domain discriminator layer with one sigmoid | (Batch_size, 1) |
| F4: Fully connected conditional domain discriminator layer | (Batch_size, 1024) |
| F5: Fully connected conditional domain discriminator layer | (Batch_size, 1024) |
| F6: Fully connected conditional domain discriminator layer with one sigmoid | (Batch_size, 1) |
Figure 4Experimental setup of PU: (a) electric motor; (b) torque measuring shaft; (c) rolling bearing test module; (d) flywheel; and (e) load motor.
The diagnostic tasks of PU dataset.
| Task Code | 0 | 1 | 2 | 3 |
|---|---|---|---|---|
| Load torque (Nm) | 0.7 | 0.7 | 0.1 | 0.7 |
| Radial force (N) | 1000 | 1000 | 1000 | 400 |
| Speed (rpm) | 1500 | 900 | 1500 | 1500 |
Training parameter settings during the experiments.
| Parameter | Value | Parameter | Value |
|---|---|---|---|
| Epochs | 300 | Sample length | 1024 |
| Batch size | 64 | Marginal feature dimension | 512 |
| Weight decay | 0.00001 | Fused conditional feature dimension | 256 |
| Learning rate | 0.001 | - | - |
Methods and corresponding transfer modules used for fault diagnosis.
| Method | Transfer Module | Adaptive Coefficient |
|---|---|---|
| Resnet | No transfer | No |
| DAN | MK-MMD | No |
| JAN | JMMD | No |
| DANN | Adversarial | No |
| CDAN | Condition-adversarial | No |
| MRAN | Multi-space | No |
| DDAN | MK-MMD and LMMD | Yes |
| Proposed | MK-MMD, LMMD and Multi-space | Yes |
Means of the test accuracies in different tasks with the PU dataset (%).
| Transfer Task | Resnet | DAN | JAN | DANN | CDAN | MRAN | DDAN | Proposed |
|---|---|---|---|---|---|---|---|---|
| 0→1 | 24.27 | 53.13 | 62.92 | 68.93 | 68.18 | 60.25 | 58.16 | 71.35 |
| 0→2 | 92.83 | 94.37 | 94.03 | 93.23 | 94.92 | 93.54 | 93.56 | 95.51 |
| 0→3 | 52.12 | 78.14 | 83.31 | 80.12 | 85.02 | 81.87 | 82.48 | 85.08 |
| 1→0 | 41.13 | 57.65 | 56.50 | 60.12 | 59.83 | 56.56 | 63.32 | 73.63 |
| 1→2 | 45.28 | 65.57 | 69.89 | 67.34 | 66.75 | 68.89 | 67.18 | 69.33 |
| 1→3 | 22.74 | 37.10 | 38.24 | 43.40 | 45.22 | 37.25 | 39.61 | 46.49 |
| 2→0 | 90.96 | 92.03 | 94.47 | 92.60 | 95.05 | 90.33 | 90.63 | 96.33 |
| 2→1 | 30.35 | 57.79 | 65.34 | 68.08 | 68.31 | 64.56 | 57.91 | 68.66 |
| 2→3 | 59.01 | 83.99 | 88.39 | 89.82 | 88.93 | 87.23 | 86.84 | 90.21 |
| 3→0 | 52.07 | 81.18 | 83.68 | 81.81 | 83.01 | 81.37 | 79.66 | 83.06 |
| 3→1 | 34.52 | 47.00 | 44.66 | 50.09 | 43.05 | 47.87 | 51.29 | 56.76 |
| 3→2 | 58.42 | 85.33 | 87.10 | 86.54 | 87.19 | 85.33 | 85.43 | 88.77 |
| Average | 50.31 | 69.44 | 72.38 | 73.51 | 73.79 | 71.25 | 71.34 | 77.10 |
Figure 5Training accuracy of source domain and test (Valid) accuracy of target domain for different models under the 2→0 transfer task: (a) DDAN; (b) JAN; (c) CDAN; and (d) proposed.
Figure 6Test (Valid) accuracy of different models on target domain under the 2→0 transfer task.
Figure 7Feature visualization for different models under the 2→0 transfer task: (a) DDAN; (b) JAN; (c) CDAN; and (d) proposed.
Figure 8Confusion matrix for different models under the 2→0 task: (a) DDAN; (b) JAN; (c) CDAN; and (d) proposed.
Figure 9RoC curves and AuC values of classifiers for different models under the 2→0 task.
Cases and test accuracies of ablation experiments.
| Case | Transfer Module | Adaptive Coefficient | Test Accuracy (%) |
|---|---|---|---|
| Case 1 | MK-MMD, LMMD and marginal feature extraction module | Yes | 55.21 |
| Case 2 | MK-MMD, LMMD and conditional feature extraction module | Yes | 52.76 |
| Case 3 | MK-MMD and two feature extraction modules | Yes | 39.26 |
| Case 4 | LMMD and two feature extraction modules | Yes | 56.44 |
| Case 5 | MK-MMD, LMMD and two feature extraction modules | No | 50.46 |
| Case 6 | Two feature extraction modules | No | 30.54 |
| Proposed | MK-MMD, LMMD and two feature extraction modules | Yes | 57.98 |