Submitted:
23 September 2023
Posted:
25 September 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction
- Performance Improvement: The proposed simple autoencoder architecture achieved a high level of performance comparable to that of State-of-the-Art (SOTA) methods. Despite its simplicity, the autoencoder demonstrated effectiveness in texture defect detection, proving that efficient defect detection can be achieved without the need for complex deep learning models.
- Integration of Techniques: This study employs a hybrid approach that combines deep learning and image processing methodologies to address texture defect detection. Specifically, deep learning was applied to achieve denoising and reconstruction tasks, whereas image processing methods were used to extract pertinent texture features and facilitate defect detection. This fusion of techniques yields notable benefits, including the elimination of the need for extensive data training. Consequently, this study proposes a streamlined and efficient methodology for texture defect detection achieved through the integration of deep learning and image processing techniques.
- Experimentation and Analysis: Detailed experiments and analyses were conducted using the necessary parameters. Various parameters were adjusted and compared to optimize texture defect detection performance. This provides insights into the parameters that impact the performance most significantly and offers practical guidelines for real-world applications.
2. Related Work
2.1. Anomaly Detection with Reconstruction
2.2. Defect Detection
3. Texture Defect Detection
3.1. Normal Texture Image Reconstruction
3.2. Application of Fourier Transform
3.3. Difference Images and Thresholding
4. Experimental Results
4.1. Datasets
4.2. Training Details
4.3. Performance Evaluation and Ablation Study
5. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Schlegl, T.; Seeböck, P.; Waldstein, S. M.; Schmidt-Erfurth, U.; Langs, G. Unsupervised anomaly detection with generative adversarial networks to guide marker discovery. In Proceedings of the International Conference on Information Processing in Medical Imaging, Boone. NC. USA, 25 June–30 June. 2017. 146-157. [Google Scholar]
- Schlegl, T.; Seeböck, P.; Waldstein, S. M.; Langs, G.; Schmidt-Erfurth, U. f-AnoGAN: Fast unsupervised anomaly detection with generative adversarial networks. Medical image analysis 2017, 54, 30–44. [Google Scholar] [CrossRef] [PubMed]
- Akçay, S.; Atapour-Abarghouei, A.; Breckon, T. P. GANomaly: Semi-supervised anomaly detection via adversarial training. In Proceedings of the Asian Conference on Computer Vision, Perth. Australia, 2 December–6 December 2018; pp. 622–637. [Google Scholar]
- Akçay, S.; Atapour-Abarghouei, A.; Breckon, T. P. Skip-GANomaly: Skip connected and adversarially trained encoder-decoder anomaly detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Budapest, Hungary, 14 July–19 July 2018; pp. 39–42. [Google Scholar]
- Gong, D.; Liu, L.; Le, V.; Saha, B.; Mansour, M. R.; Venkatesh, S.; Hengel, A. V. D. Memorizing normality to detect anomaly: Memory-augmented deep autoencoder for unsupervised anomaly detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Korea, 27 October–2 November 2019; pp. 1705–1714. [Google Scholar]
- Perera, P.; Nallapati, R.; Xiang, B. Ocgan: One-class novelty detection using gans with constrained latent representations. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, Long Beach, CA, 16 June–20 June 2019; pp. 2898–2906. [Google Scholar]
- Zhao, Y.; Chen, Z.; Gao, X.; Song, W.; Xiong, Q.; Hu, J.; Zhang, Z. Plant disease detection using generated leaves based on DoubleGAN. IEEE/ACM Transactions on Computational Biology and Bioinformatics. 2021, 19, 1817–1826. [Google Scholar] [CrossRef]
- Si. J.; Kim. S. Chili Pepper Disease Diagnosis via Image Reconstruction Using GrabCut and Generative Adversarial Serial Autoencoder. arXiv. 2023, arXiv:2306.12057. [Google Scholar]
- Si. J.; Kim. S., Traffic Accident Detection in First-Person Videos Based on Depth and Background Motion Estimation. Journal of Korean Institute of Information Technology (JKIIT). 2021, 19, 25–34. [Google Scholar] [CrossRef]
- Tsai, D. M.; Huang, C. K. Defect detection in electronic surfaces using template-based Fourier image reconstruction. IEEE Transactions on Components, Packaging and Manufacturing Technology. 2018, 9, 163–172. [Google Scholar] [CrossRef]
- Si, J.; Kim., S. Surface Anomaly Detection of Wood Grain Image Using Fourier Transform : A Preliminary Study. Proceedings of Korean Institute of Information Technology Conference, Jeju, Korea, 1 December–3 December 2022; 2022; pp. 86–87. [Google Scholar]
- Zavrtanik, V.; Kristan, M.; Skočaj, D. Draem-a discriminatively trained reconstruction embedding for surface anomaly detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Virtual, 11 October–17 October 2021; pp. 8330–8339. [Google Scholar]
- Liang, Y.; Zhang, J.; Zhao, S.; Wu, R.; Liu, Y.; Pan, S. Omni-frequency channel-selection representations for unsupervised anomaly detection. arXiv. 2023, arXiv:2203.00259. [Google Scholar] [CrossRef] [PubMed]
- Jang, J.; Hwang, E.; Park, S. H. N-pad: Neighboring pixel-based industrial anomaly detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, 17 June–21 June 2023; pp. 4364–4373. [Google Scholar]
- Si, J.; Kim., S. Difference Image Alignment Technique of Reconstruction Method for Detecting Defects in Thermal Image of Solar Cells. Journal of Korean Institute of Information Technology (JKIIT). 2023, 21, 11–19. [Google Scholar]
- Tsai, C. C.; Wu, T. H.; Lai, S. H. Multi-scale patch-based representation learning for image anomaly detection and segmentation. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, Waikoloa, Hawaii, 4 January–8 January 2022; pp. 3992–4000. [Google Scholar]
- Liu, T.; Li, B.; Zhao, Z.; Du, X.; Jiang, B.; Geng, L. Reconstruction from edge image combined with color and gradient difference for industrial surface anomaly detection. arXiv. 2022, arXiv:2210.14485. [Google Scholar]
- Shi, Y.; Yang, J.; Qi, Z. Unsupervised anomaly segmentation via deep feature reconstruction. Neurocomputing. 2021, 424, 9–22. [Google Scholar] [CrossRef]
- Jinlei, H.; Yingying, Z.; Qiaoyong, Z.; Di, X.; Shiliang, P.; Hong, Z. Divide-and-assemble: Learning block-wise memory for unsupervised anomaly detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Virtual, 11 October–17 October 2021; pp. 8791–8800. [Google Scholar]
- Bergmann, P.; Fauser, M.; Sattlegger, D.; Steger, C. MVTec AD--A comprehensive real-world dataset for unsupervised anomaly detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, Long Beach, CA, 16 June–20 June 2019; pp. 9592–9600. [Google Scholar]







| Train (Normal) | Test (Normal) | Test (Defect) | |
|---|---|---|---|
| Carpet | 280 | 84 | 89 |
| Grid | 264 | 63 | 57 |
| Leather | 245 | 96 | 92 |
| Tile | 230 | 99 | 84 |
| Wood | 247 | 57 | 60 |
| Carpet | 7 | 8 | 9 | 10 | 11 | 12 |
|---|---|---|---|---|---|---|
| 37 | 0.736 | 0.727 | 0.720 | 0.715 | 0.712 | 0.713 |
| 38 | 0.732 | 0.731 | 0.730 | 0.726 | 0.725 | 0.731 |
| 39 | 0.763 | 0.758 | 0.758 | 0.762 | 0.766 | 0.771 |
| 40 | 0.815 | 0.820 | 0.822 | 0.821 | 0.825 | 0.822 |
| 41 | 0.869 | 0.871 | 0.874 | 0.864 | 0.845 | 0.836 |
| 42 | 0.845 | 0.827 | 0.812 | 0.783 | 0.759 | 0.743 |
| Grid | 18 | 19 | 20 | 21 | 22 | 23 |
| 42 | 0.907 | 0.815 | 0.920 | 0.931 | 0.925 | 0.913 |
| 43 | 0.918 | 0.918 | 0.921 | 0.919 | 0.915 | 0.902 |
| 44 | 0.919 | 0.934 | 0.939 | 0.933 | 0.922 | 0.903 |
| 45 | 0.928 | 0.934 | 0.937 | 0.929 | 0.919 | 0.889 |
| 46 | 0.923 | 0.931 | 0.929 | 0.929 | 0.915 | 0.890 |
| 47 | 0.931 | 0.924 | 0.926 | 0.921 | 0.906 | 0.867 |
| Leather | 4 | 5 | 6 | 7 | 8 | 9 |
| 1 | 0.910 | 0.932 | 0.924 | 0.901 | 0.892 | 0.871 |
| 2 | 0.955 | 0.964 | 0.975 | 0.900 | 0.824 | 0.788 |
| 3 | 0.968 | 0.951 | 0.888 | 0.847 | 0.825 | 0.793 |
| 4 | 0.953 | 0.914 | 0.887 | 0.845 | 0.785 | 0.760 |
| 5 | 0.946 | 0.909 | 0.859 | 0.800 | 0.752 | 0.722 |
| 6 | 0.948 | 0.890 | 0.846 | 0.787 | 0.745 | 0.716 |
| Tile | 1 | 2 | 3 | 4 | 5 | 6 |
| 37 | 0.685 | 0.898 | 0.818 | 0.766 | 0.696 | 0.643 |
| 38 | 0.695 | 0.915 | 0.820 | 0.748 | 0.685 | 0.637 |
| 39 | 0.727 | 0.896 | 0.786 | 0.743 | 0.685 | 0.631 |
| 40 | 0.736 | 0.932 | 0.786 | 0.741 | 0.679 | 0.625 |
| 41 | 0.735 | 0.914 | 0.794 | 0.735 | 0.667 | 0.625 |
| 42 | 0.733 | 0.923 | 0.791 | 0.723 | 0.655 | 0.625 |
| Wood | 7 | 8 | 9 | 10 | 11 | 12 |
| 1 | 0.875 | 0.897 | 0.929 | 0.953 | 0.976 | 0.962 |
| 2 | 0.886 | 0.910 | 0.939 | 0.932 | 0.938 | 0.935 |
| 3 | 0.914 | 0.918 | 0.944 | 0.940 | 0.939 | 0.946 |
| 4 | 0.906 | 0.930 | 0.917 | 0.918 | 0.918 | 0.917 |
| 5 | 0.901 | 0.925 | 0.911 | 0.904 | 0.912 | 0.910 |
| 6 | 0.900 | 0.911 | 0.898 | 0.906 | 0.915 | 0.912 |
| AnoGAN [1] | memAE [5] |
OCGAN [6] |
GANomaly [3] | Skip-GANomaly [4] | DAAD [19] |
Ours | |
|---|---|---|---|---|---|---|---|
| Carpet | 0.337 | 0.386 | 0.348 | 0.699 | 0.795 | 0.866 | 0.874 |
| Grid | 0.871 | 0.805 | 0.855 | 0.708 | 0.657 | 0.957 | 0.939 |
| Leather | 0.451 | 0.423 | 0.624 | 0.842 | 0.908 | 0.862 | 0.975 |
| Tile | 0.401 | 0.718 | 0.806 | 0.794 | 0.850 | 0.882 | 0.932 |
| Wood | 0.567 | 0.954 | 0.959 | 0.834 | 0.919 | 0.982 | 0.976 |
| Average | 0.525 | 0.657 | 0.718 | 0.775 | 0.826 | 0.910 | 0.939 |
| Normal Reconstructed Template | X | O | O |
|---|---|---|---|
| Fourier Transform | O | X | O |
| Carpet | 0.791 | 0.659 | 0.874 |
| Grid | 0.885 | 0.508 | 0.939 |
| Leather | 0.344 | 0.634 | 0.975 |
| Tile | 0.819 | 0.505 | 0.932 |
| Wood | 0.923 | 0.867 | 0.976 |
| Average | 0.752 | 0.634 | 0.939 |
| Filters | Butterworth | Ideal (ours) |
|---|---|---|
| Carpet | 0.828 | 0.874 |
| Grid | 0.920 | 0.939 |
| Leather | 0.969 | 0.975 |
| Tile | 0.881 | 0.932 |
| Wood | 0.963 | 0.976 |
| Average | 0.912 | 0.939 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).