| Literature DB >> 34960262 |
Lihao Ye1, Xue Ma1, Chenglin Wen2.
Abstract
Aiming at the problem of fault diagnosis when there are only a few labeled samples in the large amount of data collected during the operation of rotating machinery, this paper proposes a fault diagnosis method based on knowledge transfer in deep learning. First, we describe the data collected during the operation as a two-dimensional image with both time and frequency-domain characteristics. Second, we transform the trained source domain model into a shallow model suitable for small samples in the target domain, and we train the shallow model with small samples with labels. Third, we input a large number of unlabeled samples into the shallow model, and the output result of the system is regarded as the label of the input sample. Fourth, we combine the original data and the data annotated by the shallow model to train the new deep CNN fault diagnosis model so as to realize the migration of knowledge from the expert system to the deep CNN. The newly built deep CNN model is used for the online fault diagnosis of rotating machinery. The FFCNN-SVM shallow model tagger method proposed in this paper compares the fault diagnosis results with other transfer learning methods at this stage, and its correct rate has been greatly improved. This method provides new ideas for future fault diagnosis under small samples.Entities:
Keywords: deep CNN; fault diagnosis; knowledge transferring; model transferring; transfer learning
Year: 2021 PMID: 34960262 PMCID: PMC8709426 DOI: 10.3390/s21248168
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1CNN network structure.
Figure 2Schematic diagram of transfer learning principle.
Figure 3FFCNN-SVM model fault diagnosis flowchart.
Figure 4Block diagram of the process of transferring knowledge from a shallow model to a deep learning model.
Figure 5ZHS-2 type multifunctional motor test bench.
Source/Target domain data classification table.
| Domain | Data Set | Number of Data |
|---|---|---|
| Source domain | Training set | 1200 |
| Target domain | Training set | 16 |
| Target domain | Unlabeled set | 1176 |
| Target domain | Test set | 8 |
Figure 6Flow chart of various model transfer methods.
Source domain model and target domain preliminary model network structure.
| Type | Source Domain Model | Target Domain Model | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Input Size | Filter | Number | Padding | Stride | Input Size | Filter | Number | Padding | Stride | |
| Conv1 | 32 × 128 × 8 | (3,3) | 32 | (1,1) | (1,1) | 32 × 128 × 8 | (3,3) | 32 | (1,1) | (1,1) |
| Pool1 | 32 × 128 × 32 | (4,8) | - | (4,8) | - | 32 × 128 × 32 | (4,8) | - | (4,8) | - |
| Conv2 | 8 × 16 × 32 | (3,3) | 32 | (1,1) | (1,1) | 8 × 16 × 32 | (3,3) | 32 | (1,1) | (1,1) |
| Pool2 | 8 × 16 × 32 | (4,8) | - | (4,8) | - | 8 × 16 × 32 | (4,8) | - | (4,8) | - |
| Conv3 | - | - | - | - | - | 2 × 2 × 32 | (3,3) | 32 | (1,1) | (1,1) |
| Pool3 | - | - | - | - | - | 2 × 2 × 32 | (2,2) | - | (1,1) | - |
| Fc1 | 2 × 2 × 32 | - | 64 | - | - | 1 × 1 × 32 | - | 16 | - | - |
| Fc2 | 64 | - | 4 | - | - | 16 | - | 4 | - | - |
Classification accuracy of different methods for unlabeled sample sets.
| Classification Method | Unlabeled Set Accuracy | Parameter Amount |
|---|---|---|
| SVM | 0.392 | - |
| CNN-FC | 0.71003 | 18,595 |
| CNN-SVM | 0.827 | - |
| FFCNN | 0.7253 | 9827 |
|
|
|
|
Final deep CNN structure diagram.
| Type | Input Size | Filter | Number | Padding | Stride |
|---|---|---|---|---|---|
| Conv1 | 32 × 128 × 8 | (3,3) | 32 | (1,1) | (1,1) |
| Pool1 | 32 × 128 × 32 | (4,8) | (4,8) | - | |
| Conv2 | 8 × 16 × 32 | (3,3) | 32 | (1,1) | (1,1) |
| Pool2 | 8 × 16 × 32 | (4,8) | - | (4,8) | - |
| Fc1 | 2 × 2 × 32 | 64 | |||
| Fc2 | 64 | 4 |
Accuracy of the models.
| Target Domain Test Set Accuracy | |
|---|---|
| CNN(Train) | 1 |
| CNN(Test) | 0.75 |
| CNN-ATS(Train) | 1 |
|
|
|
Figure 7The test platform of CRWU.
Source/Target domain data classification table.
| Domain | Data Set | Number of Data |
|---|---|---|
| Source domain | Training set | 1200 |
| Target domain | Training set | 16 |
| Target domain | Unlabeled set | 1176 |
| Target domain | Test set | 8 |
Source domain model and target domain preliminary model network structure.
| Type | Source Domain Model | Target Domain Model | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Input Size | Filter | Number | Padding | Stride | Input Size | Filter | Number | Padding | Stride | |
| Conv1 | 8 × 64 × 3 | (3,3) | 32 | (1,1) | (1,1) | 8 × 64 × 3 | (3,3) | 32 | (1,1) | (1,1) |
| Pool1 | 8 × 64 × 32 | (2,4) | - | (2,4) | - | 8 × 64 × 32 | (2,4) | - | (2,4) | - |
| Conv2 | 4 × 16 × 32 | (3,3) | 32 | (1,1) | (1,1) | 4 × 16 × 32 | (3,3) | 32 | (1,1) | (1,1) |
| Pool2 | 4 × 16 × 32 | (2,4) | - | (2,4) | - | 4 × 16 × 32 | (2,4) | - | (2,4) | - |
| Conv3 | - | - | - | - | - | 2 × 4 × 32 | (3,3) | 32 | (1,1) | (1,1) |
| Pool3 | - | - | - | - | - | 2 × 4 × 32 | (2,4) | - | (1,1) | - |
| Fc1 | 2 × 4 × 32 | - | 64 | - | - | 1 × 1 × 32 | - | 16 | - | - |
| Fc2 | 64 | - | 4 | - | - | 16 | - | 4 | - | - |
Classification accuracy of different methods for unlabeled sample sets.
| Classification Method | Unlabeled Set Accuracy | Parameter Amount |
|---|---|---|
| SVM | 0.5773 | - |
| CNN-FC | 0.7057 | 17,556 |
| CNN-SVM | 0.9846 | - |
| FFCNN | 0.72193 | 11,380 |
|
|
|
|
Accuracy of the models.
| Target Domain Test Set Accuracy | |
|---|---|
| CNN (Train) | 1 |
| CNN (Test) | 0.75 |
| CNN-ATS (Train) | 1 |
|
|
|