| Literature DB >> 32545489 |
Ta-Wei Tang1, Wei-Han Kuo1, Jauh-Hsiang Lan1, Chien-Fang Ding2, Hakiem Hsu3, Hong-Tsu Young1.
Abstract
Recently, researchers have been studying methods to introduce deep learning into automated optical inspection (AOI) systems to reduce labor costs. However, the integration of deep learning in the industry may encounter major challenges such as sample imbalance (defective products that only account for a small proportion). Therefore, in this study, an anomaly detection neural network, dual auto-encoder generative adversarial network (DAGAN), was developed to solve the problem of sample imbalance. With skip-connection and dual auto-encoder architecture, the proposed method exhibited excellent image reconstruction ability and training stability. Three datasets, namely public industrial detection training set, MVTec AD, with mobile phone screen glass and wood defect detection datasets, were used to verify the inspection ability of DAGAN. In addition, training with a limited amount of data was proposed to verify its detection ability. The results demonstrated that the areas under the curve (AUCs) of DAGAN were better than previous generative adversarial network-based anomaly detection models in 13 out of 17 categories in these datasets, especially in categories with high variability or noise. The maximum AUC improvement was 0.250 (toothbrush). Moreover, the proposed method exhibited better detection ability than the U-Net auto-encoder, which indicates the function of discriminator in this application. Furthermore, the proposed method had a high level of AUCs when using only a small amount of training data. DAGAN can significantly reduce the time and cost of collecting and labeling data when it is applied to industrial detection.Entities:
Keywords: anomaly detection (AD); automated optical inspection (AOI); defect detection; dual auto-encoder generative adversarial network (DAGAN); generative adversarial network (GAN)
Year: 2020 PMID: 32545489 PMCID: PMC7349725 DOI: 10.3390/s20123336
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Pipelines of GAN-based anomaly detection networks: (a) AnoGAN, (b) GANomaly, and (c) Skip-GANomaly.
Advantages and limitations of AnoGAN, GANomaly, and Skip-GANomaly.
| AnoGAN | GANomaly | Skip-GANomaly | |
|---|---|---|---|
| Advantages | Training without anomaly data. | Significant improvement in detection time. | Better ability of image reconstruction. |
| Limitations | Excessive time to detection. | Cannot reconstruct complex images. | Model collapse during training. |
Figure 2Pipeline of the proposed method (DAGAN).
Figure 3Detection process of the proposed method.
Figure 4MVTec AD dataset for industrial inspection.
Figure 5Production line mobile phone screen glass dataset.
Figure 6Production line wood surface dataset.
AUCs of each category in MVTec AD dataset using AnoGAN, GANomaly, Skip-GANomaly, proposed method (DAGAN), and U-Net auto-encoder.
| Category | AnoGAN | GANomaly | Skip-GANomaly | DAGAN | U-Net |
|---|---|---|---|---|---|
| Bottle | 0.800 | 0.794 | 0.937 | 0.983 | 0.863 |
| Cable | 0.477 | 0.711 | 0.674 | 0.665 | 0.636 |
| Capsule | 0.442 | 0.721 | 0.718 | 0.687 | 0.673 |
| Carpet | 0.337 | 0.821 | 0.795 | 0.903 | 0.774 |
| Grid | 0.871 | 0.743 | 0.657 | 0.867 | 0.857 |
| Hazelnut | 0.259 | 0.874 | 0.906 | 1.00 | 0.996 |
| Leather | 0.451 | 0.808 | 0.908 | 0.944 | 0.870 |
| Metal Nut | 0.284 | 0.694 | 0.79 | 0.815 | 0.676 |
| Pill | 0.711 | 0.671 | 0.758 | 0.768 | 0.781 |
| Screw | 0.10 | 1.00 | 1.00 | 1.00 | 1.00 |
| Tile | 0.401 | 0.72 | 0.85 | 0.961 | 0.964 |
| Toothbrush | 0.439 | 0.700 | 0.689 | 0.950 | 0.811 |
| Transistor | 0.692 | 0.808 | 0.814 | 0.794 | 0.674 |
| Wood | 0.567 | 0.920 | 0.919 | 0.979 | 0.958 |
| Zipper | 0.715 | 0.744 | 0.663 | 0.781 | 0.750 |
Figure 7AUC after testing the proposed method (DAGAN) and three other GAN-based anomaly detection models with the MVTec AD dataset.
Figure 8Heat maps of the MVTec AD dataset generated by the proposed method (DAGAN).
AUCs of glass and wood datasets generated using AnoGAN, GANomaly, Skip-GANomaly, the proposed method (DAGAN), and U-Net auto-encoder.
| Category | AnoGAN | GANomaly | Skip-GANomaly | DAGAN | U-Net |
|---|---|---|---|---|---|
| Glass | 0.543 | 0.600 | 0.618 | 0.853 | 0.828 |
| Wood | 0.716 | 0.915 | 0.797 | 0.925 | 0.886 |
Figure 9Heat maps of the glass and wood datasets generated by the proposed method (DAGAN).
AUCs of training the proposed method (DAGAN) with few data .
|
| Bottle | Tile | Glass | Wood |
|---|---|---|---|---|
| 0 | 0.790 | 0.958 | 0.882 | 0.906 |
| 1 | 0.886 | 0.964 | 0.883 | 0.902 |
| 2 | 0.882 | 0.966 | 0.863 | 0.893 |
| 3 | 0.933 | 0.984 | 0.865 | 0.921 |
| 4 | 0.731 | 0.984 | 0.892 | 0.919 |
| 5 | 0.891 | 0.981 | 0.856 | 0.903 |
| 6 | 0.736 | 0.943 | 0.881 | 0.915 |
| 7 | 0.760 | 0.961 | 0.846 | 0.902 |
Figure 10AUCs of training the proposed method (DAGAN) with few data .