Submitted:
13 September 2024
Posted:
13 September 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
| Acronym | Abbreviation |
|---|---|
| AI | Artificial intelligence |
| AP | Average Precision |
| AUC | Area Under the Curve |
| CAD | Computer Aided Design |
| CM | Confusion Matrix |
| CNN | Convolutional Neural Network |
| COCO | Common Objects in COntext |
| CSPNet | Cross-Stage Partial Network |
| CT | Computed Tomography |
| CVAT | Computer Vision Annotation Tools |
| DL | Deep Learning |
| DOF | Degree of Freedom |
| F1 | Precision-recall score |
| FLOPS | Floating-point Operations Per Second |
| FPS | Frames per second |
| FPN | Feature Pyramid Networks |
| FN | False Negative |
| FP | False Positive |
| GPU | Graphics Processing Unit |
| HCPS | Human-Cyber-Physical Systems |
| HMI | Human Machine Interaction |
| IoU | Intersection over Union |
| mAP | Mean Average Precision |
| ML | Machine Learning |
| PANet | Pyramid Aggregation Network |
| ROC | Receiver Operating Characteristic |
| RPN | Region Proposal Network |
| SGD | Stochastic Gradient Descent |
| SPP | Spatial Pyramid Pooling |
| SSD | Single Shot Detector |
| SVM | Support Vector Machine |
| TN | True Negative |
| TP | True Positive |
| VGG | Visual Geometric Group |
| VOC | Visual Object Classes |
2. Machine Vision based Deep Learning Algorithm
2.1. Deep Learning Algorithm (VGG16, YOLOv5, YOLOv8)
2.1.1. VGG
2.1.2. YOLOv5
2.1.3. ResNet (Residual Network)
2.2. Performance Evaluation
2.2.1. Intersection over Union (IoU)
2.2.2. Precision and Recall
- -
- If IoU > 0.5, classify the item detection as true positive (TP).
- -
- If the IoU is less than 0.5, the detection is considered a false positive (FP).
- -
- A false negative (FN) occurs when an image contains a ground truth item that the model fails to detect.
| Actual | |||
| Positive | Negative | ||
| Predicted | Positive | True positive (TP) | False negative (FN) |
| Negative | False positive (FP) | True negative (TN) | |
2.2.3. F1-Score
2.2.4. Mean Average Precision
3. Methodology for Integrating Machine Vision and Robot Manipulator
3.1. Data Preprocessing
3.1.1. Annotation
3.1.2. Dataset Splitting
3.1.3. Augmentation
3.4. Hyperparameter Setting for Model Development
4. Results and Discussion
4.1. Model Performance
4.2. Mean Average Precision (mAP)
4.3. Loss Model
4.4. Precision-Recall Curve
4.5. Class-wise Performance
4.6. Confusion Matrix
5. Testing and Integration YOLOv5, Jetson Nano, and Mitsubishi RV-2F-1D1-S15 Robot Manipulator

| Edge Image Detection (input from Jetson Nano) |
Number of Passes for Grinding and Chamfering (robot manipulator action according to the Jetson Nano input) |
|---|---|
| Sharp | 5 |
| Chamfer | 3 |
| Burrs | 10 |
- The Mitsubishi Electric Melfa RV-2F-1D1-S15 manipulator was connected to the customized grinding equipment. The end effector was connected to the portable grinder.
- A selected metal workpiece with a sharp type was attached to the clamp close to the manipulator for testing of real-time machine vision method.
- A Logitech camera with 720p and 30 fps was set up to capture the tested metal workpiece with a sharp-end feature.
-
NVIDIA Jetson Nano with embedded YOLOv5 model was connected to a PC and camera. The following is a detail of the embedded system:To begin engagement with it. It is necessary to get the most recent operating system (OS) image known as Jetpack SDK. This software package encompasses the Linux Driver Package (L4T), which consists of the Linux operating system, as well as CUDA-X accelerated libraries and APIs. These components are specifically designed to facilitate Deep Learning, Computer Vision, Accelerated Computing, and Multimedia tasks. The website [45] provides comprehensive documentation, including all necessary materials and step-by-step instructions for utilizing the product. In this project, Ubuntu 20.04 has been utilized, encompassing the most recent iterations of CUDA, cuDNN, and TensorRT, which will be discussed in subsequent sections [46].
- A customized electrical board was designed as communication hardware between Jetson Nano and the robot manipulator. The communication was done through GPIO (General Purpose Input/Output) pins on the Jetson Nano device. This circuit board translates the output from GPIO inputs to the PLC controller of the manipulator. A detail of electrical circuit board in presented in Figure 13. The board has three inputs and three outputs. The inputs were from the Jetson Nano machine vision image detection result represented in GPIO input and the outputs for the Mitsubishi Electric Melfa RV-2F-1D1-S15 robot manipulator actuator. For example, if the NVIDIA Jetson Nano detect a sharp email, it will goes to GPIO input 1 and go through to output 1 in Mitsubishi Electric Melfa RV-2F-1D1-S15 robot manipulator.
- An image detection of the metal workpiece is shown. It was successfully detected as a sharp edge with a probability of 88%.
- The manipulator has been programmed initially as an action of the image detection from the Jetson Nano.
- The manipulator does the grinding process according to the result of the Jetson Nano that has been interpreted by the customized electrical circuit and the predetermined manipulator program.

7. Conclusions
- The present study used YOLOv5 for the embedded machine vision algorithm, and will try YOLOv8 in the further study.
- Three different edge condition were used in present study. As there are a number of edge condition in practice such as crack and chip, future study will use more than 3 different edge condition.
- A similar size of the metal workpiece is maintain in the present study. In the future different size (length, height, and weight) will be considered for the real-time machine vision system.
- The Mitsubishi Electric Melfa RV-2F-1D1-S15 robot manipulator, which is equipped with a chamfering tool attached to the tooltip, serves as the medium for automatic intervention. The present study has successfully deliver an experimental study. In the future, in the event of the presence of sharp edges or burrs, the manipulator employs a chamfering tool to modify the surface of the metal workpiece. This process is repeated until the edge of the metal workpiece becomes appropriately blunted, thereby effectively correcting the defect.
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Park, M.; Jeong, J. Design and Implementation of Machine Vision-Based Quality Inspection System in Mask Manufacturing Process. Sustainability 2022, 14, 6009. [Google Scholar] [CrossRef]
- Psarommatis, F.; May, G.; Dreyfus, P.-A.; Kiritsis, D. Zero Defect Manufacturing: State-of-the-Art Review, Shortcomings and Future Directions in Research. International Journal of Production Research 2020, 58, 1–17. [Google Scholar] [CrossRef]
- Powell, D.; Magnanini, M.C.; Colledani, M.; Myklebust, O. Advancing Zero Defect Manufacturing: A State-of-the-Art Perspective and Future Research Directions. Computers in Industry 2022, 136, 103596. [Google Scholar] [CrossRef]
- Zhou, J.; Li, P.; Zhou, Y.; Wang, B.; Zang, J.; Meng, L. Toward New-Generation Intelligent Manufacturing. Engineering 2018, 4, 11–20. [Google Scholar] [CrossRef]
- Yang, X.; Han, M.; Tang, H.; Li, Q.; Luo, X. Detecting Defects With Support Vector Machine in Logistics Packaging Boxes for Edge Computing. IEEE Access 2020, 8, 64002–64010. [Google Scholar] [CrossRef]
- C. Hui; B. Xingcan; L. Mingqi Research on Image Edge Detection Method Based on Multi-Sensor Data Fusion. In Proceedings of the 2020 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA); June 27 2020; pp. 789–792.
- Yang, X.; Han, M.; Tang, H.; Li, Q.; Luo, X. Detecting Defects With Support Vector Machine in Logistics Packaging Boxes for Edge Computing. IEEE Access 2020, 8, 64002–64010. [Google Scholar] [CrossRef]
- Wittmann, J.; Herl, G.; Deggendorf Institute of Technology. Canny-Net: Known Operator Learning for Edge Detection. eJNDT 2023, 28. [Google Scholar] [CrossRef]
- S. Lei; Y. Guo; Y. Liu; F. Li; G. Zhang; D. Yang Detection of Mechanical Defects of High Voltage Circuit Breaker Based on Improved Edge Detection and Deep Learning Algorithms. In Proceedings of the 2022 6th International Conference on Electric Power Equipment - Switching Technology (ICEPE-ST), 15 March 2022; pp. 372–375.
- Zhou, L.; Zhang, L.; Konz, N. Computer Vision Techniques in Manufacturing 2022.
- González, M.; Rodríguez, A.; López-Saratxaga, U.; Pereira, O.; López De Lacalle, L.N. Adaptive Edge Finishing Process on Distorted Features through Robot-Assisted Computer Vision. Journal of Manufacturing Systems 2024, 74, 41–54. [Google Scholar] [CrossRef]
- Han, D.; Mulyana, B.; Stankovic, V.; Cheng, S. A Survey on Deep Reinforcement Learning Algorithms for Robotic Manipulation. Sensors 2023, 23, 3762. [Google Scholar] [CrossRef]
- Ichnowski, J.; Avigal, Y.; Satish, V.; Goldberg, K. Deep Learning Can Accelerate Grasp-Optimized Motion Planning. Sci. Robot. 2020, 5, eabd7710. [Google Scholar] [CrossRef]
- Qiu, S.; Yang, Z.; Huang, L.; Zhang, X. Design and Implementation of Temperature Verification System Based on Robotic Arm. J. Phys.: Conf. Ser. 2022, 2203, 012015. [Google Scholar] [CrossRef]
- Ban, S.; Lee, Y.J.; Yu, K.J.; Chang, J.W.; Kim, J.-H.; Yeo, W.-H. Persistent Human–Machine Interfaces for Robotic Arm Control Via Gaze and Eye Direction Tracking. Advanced Intelligent Systems 2023, 5, 2200408. [Google Scholar] [CrossRef]
- Sekkat, H.; Tigani, S.; Saadane, R.; Chehri, A. Vision-Based Robotic Arm Control Algorithm Using Deep Reinforcement Learning for Autonomous Objects Grasping. Applied Sciences 2021, 11, 7917. [Google Scholar] [CrossRef]
- Kijdech, D.; Vongbunyong, S. Pick-and-Place Application Using a Dual Arm Collaborative Robot and an RGB-D Camera with YOLOv5. IJRA 2023, 12, 197. [Google Scholar] [CrossRef]
- Aein, S.; Thu, T.; Htun, P.; Paing, A.; Htet, H. YOLO Based Deep Learning Network for Metal Surface Inspection System. In; 2022; pp. 923–929 ISBN 978-981-16-8128-8.
- Li, J.; Su, Z.; Geng, J.; Yin, Y. Real-Time Detection of Steel Strip Surface Defects Based on Improved YOLO Detection Network. IFAC-PapersOnLine 2018, 51, 76–81. [Google Scholar] [CrossRef]
- Xu, Y.; Zhang, K.; Wang, L. Metal Surface Defect Detection Using Modified YOLO. Algorithms 2021, 14, 257. [Google Scholar] [CrossRef]
- Ahmed, S.; Bhatti, M.T.; Khan, M.G.; Lövström, B.; Shahid, M. Development and Optimization of Deep Learning Models for Weapon Detection in Surveillance Videos. Applied Sciences 2022, 12, 5772. [Google Scholar] [CrossRef]
- Fahd Al-Selwi, H.; Hassan, N.; Ab Ghani, H.B.; Binti Amir Hamzah, N.A.; Bin, A.; Aziz, A. Face Mask Detection and Counting Using You Only Look Once Algorithm with Jetson Nano and NVDIA Giga Texel Shader Extreme. IJ-AI 2023, 12, 1169. [Google Scholar] [CrossRef]
- Abdul Hassan, N.F.; Abed, A.A.; Abdalla, T.Y. Face Mask Detection Using Deep Learning on NVIDIA Jetson Nano. IJECE 2022, 12, 5427. [Google Scholar] [CrossRef]
- D. -L. Nguyen; M. D. Putro; K. -H. Jo Facemask Wearing Alert System Based on Simple Architecture With Low-Computing Devices. IEEE Access 2022, 10, 29972–29981. [Google Scholar] [CrossRef]
- H. Feng; G. Mu; S. Zhong; P. Zhang; T. Yuan Benchmark Analysis of YOLO Performance on Edge Intelligence Devices. In Proceedings of the 2021 Cross Strait Radio Science and Wireless Technology Conference (CSRSWTC); October 11 2021; pp. 319–321.
- Saponara, S.; Elhanashi, A.; Gagliardi, A. Implementing a Real-Time, AI-Based, People Detection and Social Distancing Measuring System for Covid-19. J Real Time Image Process 2021, 18, 1937–1947. [Google Scholar] [CrossRef] [PubMed]
- H. -S. Son; D. -K. Kim; S. -H. Yang; Y. -K. Choi Real-Time Power Line Detection for Safe Flight of Agricultural Spraying Drones Using Embedded Systems and Deep Learning. IEEE Access 2022, 10, 54947–54956. [Google Scholar] [CrossRef]
- Qasim, M.; Ismael, O.Y. Shared Control of a Robot Arm Using BCI and Computer Vision. JESA 2022, 55, 139–146. [Google Scholar] [CrossRef]
- MIZUMOTO, Y.; ASAKAWA, N.; Morishige, K.; TAKEUCHI, Y. Automation of Chamfering with Industrial Robot : Application of Improved Intelligent Holder. The proceedings of the JSME annual meeting 2000, 3, 537–538. [Google Scholar] [CrossRef]
- Giuffrida, G.; Meoni, G.; Fanucci, L. A YOLOv2 Convolutional Neural Network-Based Human–Machine Interface for the Control of Assistive Robotic Manipulators. Applied Sciences 2019, 9, 2243. [Google Scholar] [CrossRef]
- Nantzios, G.; Baras, N.; Dasygenis, M. Design and Implementation of a Robotic Arm Assistant with Voice Interaction Using Machine Vision. Automation 2021, 2, 238–251. [Google Scholar] [CrossRef]
- Pranoto, K.A.; Caesarendra, W.; Petra, I. Burrs and Sharp Edge Detection of Metal Workpiece Using CNN Image Classification Method for Intelligent Manufacturing Application.
- Ibrahim, A.A.M.S.; Tapamo, J.R. Transfer Learning-Based Approach Using New Convolutional Neural Network Classifier for Steel Surface Defects Classification. Scientific African 2024, 23, e02066. [Google Scholar] [CrossRef]
- Jarkas, O.; Hall, J.; Smith, S.; Mahmud, R.; Khojasteh, P.; Scarsbrook, J.; Ko, R.K.L. ResNet and Yolov5-Enabled Non-Invasive Meat Identification for High-Accuracy Box Label Verification. Engineering Applications of Artificial Intelligence 2023, 125, 106679. [Google Scholar] [CrossRef]
- Chen, C.; Gu, H.; Lian, S.; Zhao, Y.; Xiao, B. Investigation of Edge Computing in Computer Vision-Based Construction Resource Detection. Buildings 2022, 12, 2167. [Google Scholar] [CrossRef]
- Roy, A.M.; Bhaduri, J. DenseSPH-YOLOv5: An Automated Damage Detection Model Based on DenseNet and Swin-Transformer Prediction Head-Enabled YOLOv5 with Attention Mechanism. Advanced Engineering Informatics 2023, 56, 102007. [Google Scholar] [CrossRef]
- Zhang, Y.-F.; Ren, W.; Zhang, Z.; Jia, Z.; Wang, L.; Tan, T. Focal and Efficient IOU Loss for Accurate Bounding Box Regression 2022.
- Yap, M.H.; Hachiuma, R.; Alavi, A.; Brüngel, R.; Cassidy, B.; Goyal, M.; Zhu, H.; Rückert, J.; Olshansky, M.; Huang, X.; et al. Deep Learning in Diabetic Foot Ulcers Detection: A Comprehensive Evaluation. Computers in Biology and Medicine 2021, 135, 104596. [Google Scholar] [CrossRef] [PubMed]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, June 2016; pp. 770–778. [Google Scholar]
- Veit, A.; Wilber, M.; Belongie, S. Residual Networks Behave Like Ensembles of Relatively Shallow Networks.
- Padilla, R.; Passos, W.L.; Dias, T.L.B.; Netto, S.L.; Da Silva, E.A.B. A Comparative Analysis of Object Detection Metrics with a Companion Open-Source Toolkit. Electronics 2021, 10, 279. [Google Scholar] [CrossRef]
- Chicco, D.; Jurman, G. The Advantages of the Matthews Correlation Coefficient (MCC) over F1 Score and Accuracy in Binary Classification Evaluation. BMC Genomics 2020, 21, 6. [Google Scholar] [CrossRef] [PubMed]
- Lindholm, A.; Wahlström, N.; Lindsten, F.; Schön, T.B. Machine Learning: A First Course for Engineers and Scientists, 1st ed.; Cambridge University Press: Cambridge, UK, 2022; ISBN 978-1-108-91937-1. [Google Scholar]
- Cai, W.; Xiong, Z.; Sun, X.; Rosin, P.L.; Jin, L.; Peng, X. Panoptic Segmentation-Based Attention for Image Captioning. Applied Sciences 2020, 10, 391. [Google Scholar] [CrossRef]
- Nvidia Developer JetPack SDK. Available online: https://developer.nvidia.com/embedded/jetpack (accessed on 9 September 2024).
- NVIDIA Corporation NVIDIA Jetson Nano. Available online: https://www.nvidia.com/en-us/autonomous-machines/embedded-systems/jetson-nano/product-development/ (accessed on 9 September 2024).



















| Total number of annotations | Sharp edge | Chamfer edge | Burrs edge |
|---|---|---|---|
| 300 | 100 | 100 | 100 |
| 100% | 33.33% | 33.33% | 33.33% |
| Number of images | Training | Validation | Testing |
|---|---|---|---|
| 300 | 210 | 45 | 45 |
| 100% | 70% | 15% | 15% |
| Hyperparameter | Value/Description |
|---|---|
| Batch size | 8 |
| Epochs | 24 |
| Image size | 224 x 224 |
| Learning rate (YOLOv5) | 0.01 |
| Learning rate (VGG16) | 0.001 |
| Learning rate (ResNet) | 0.001 |
| Optimizer | Adam |
| Model | Accuracy | mAP | Avg. Precision | Avg. Recall | Avg. F1-Score |
|---|---|---|---|---|---|
| VGG-16 | 0.92 | 0.95 | 0.91 | 0.93 | 0.92 |
| ResNet | 0.92 | 0.85 | 0.92 | 0.91 | 0.92 |
| Yolov5 | 0.97 | 0.98 | 0.98 | 0.97 | 0.97 |
| Model | Class | Precision | Recall | F1-Score |
|---|---|---|---|---|
| VGG-16 | Burrs | 0.90 | 0.89 | 0.89 |
| Sharp | 0.92 | 0.91 | 0.91 | |
| Chamfer | 0.91 | 0.90 | 0.90 | |
| ResNet | Burrs | 0.87 | 0.85 | 0.86 |
| Sharp | 0.89 | 0.88 | 0.88 | |
| Chamfer | 0.88 | 0.88 | 0.88 | |
| Yolov5 | Burrs | 0.94 | 0.93 | 0.93 |
| Sharp | 0.95 | 0.94 | 0.94 | |
| Chamfer | 0.93 | 0.92 | 0.92 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).