Submitted:
10 June 2025
Posted:
11 June 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- a)
- This study builds an HVAC temperature detection dataset containing 19 temperature categories and one category for device shutdown, supporting real-time recognition of set environmental temperatures.
- b)
- A YOLO-based detection network with Wavelet Pool enhances multi-scale feature fusion and improves accuracy in recognizing digital HVAC temperature values.
- c)
- Performance comparison experiments use the custom dataset and include several conventional lightweight YOLO models. The proposed network shows the best overall performance across all metrics.
- d)
- Ablation experiments examine the influence of each improved module on detection accuracy and stability.
2. Related Works
3. Data Resources and Platform
3.1. Inspection Robot Platform
3.2. HVAC Temperature Detection Dataset
3.3. Loss Function and Evaluation Metrics
4. Design of Detection Network
4.1. Model Enhancement Strategies
4.2. Architecture of WCA-YOLO
- (1)
- Wavelet Pool module
- (2)
- C3k2 Module
- (3)
- SPPF Module
- (4)
- C2PSA Module
- (5)
- Upsample Module
- (6)
- Detect
5. Experimental Results and Analysis
5.1. HVAC Temperature Image Detection
5.2. Performance Comparison of Different Detection Models
5.3. Ablation Experiment
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Ahmed, A; Imran, A S; Manaf, A, et al. Enhancing wrist abnormality detection with yolo: Analysis of state-of-the-art single-stage detection models. Biomedical Signal Processing and Control, 2024, 93: 106144. [CrossRef]
- Moussaoui, H; Akkad, N E; Benslimane, M, et al. Enhancing automated vehicle identification by integrating YOLO v8 and OCR techniques for high-precision license plate detection and recognition. Scientific Reports, 2024, 14(1): 14389. [CrossRef]
- Lu M, Sheng W, Zou Y, et al. WSS-YOLO: An improved industrial defect detection network for steel surface defects. Measurement, 2024, 236: 115060.
- Gündüz, M. Ş., & Işık, G. A new YOLO-based method for real-time crowd detection from video and performance analysis of YOLO models. Journal of Real-Time Image Processing, 2023, 20(1), 5.
- Shen, X., Shi, G., Ren, H., & Zhang, W. Biomimetic vision for zoom object detection based on improved vertical grid number YOLO algorithm. Frontiers in Bioengineering and Biotechnology, 2022, 10, 905583. [CrossRef]
- Agarwal, V., & Bansal, G. Automatic number plate detection and recognition using YOLO world. Computers and Electrical Engineering, 2024, 120, 109646.
- Mimura, K., Nakamura, K., Yasukawa, K., Sibert, E., Ohta, J., Kitazawa, T., & Kato, Y. Applicability of object detection to microfossil research: Implications from deep learning models to detect microfossil fish teeth and denticles using YOLO-v7. Earth and Space Science, 2024, 11(1), e2023EA003122. [CrossRef]
- Zhu, X.; Wang, R.; Shi, W.; Liu, X.; Ren, Y.; Xu, S.; Wang, X. Detection of Pine-Wilt-Disease-Affected Trees Based on Improved YOLO v7. Forests, 2024, 15, 691. https://doi.org/10.3390/f15040691.
- Zhou Z, Hu Y, Yang X, et al. YOLO-based marine organism detection using two-terminal attention mechanism and difficult-sample resampling. Applied Soft Computing, 2024, 153: 111291.
- Kusetogullari H, Yavariabdi A, Hall J, et al. Digitnet: a deep handwritten digit detection and recognition method using a new historical handwritten digit dataset. Big Data Research, 2021, 23: 100182. [CrossRef]
- Lu J, Yu R, Zhang H, et al. Highly reflective metal industrial parts surface detection with polarization vision and modified YOLO. Digital Signal Processing, 2025, 163: 105238.
- Zhang Y, Zhao H, Li X, et al. MERS-Net: A Lightweight And Efficient Remote Sensing Image Object Detector. IEEE Geoscience and Remote Sensing Letters, 2025. https://doi.org/10.1109/LGRS.2025.3569672.
- Cai S, Zhang X, Mo Y. A Lightweight underwater detector enhanced by Attention mechanism, GSConv and WIoU on YOLOv8. Scientific Reports, 2024, 14(1): 25797. [CrossRef]
- Qu Y, Zhang F. FWL-YOLO: a lightweight model for barcode detection and recognition in express delivery waybills. Signal, Image and Video Processing, 2025, 19(3): 261.
- Li W, Xiao L, Yao S, et al. ED-YOLO: an object detection algorithm for drone imagery focusing on edge information and small object features. Multimedia Systems, 2025, 31(3): 1-15.
- Lan X, Liu L, Wang X. Dal-yolo: a multi-target detection model for UAV-based road maintenance integrating feature pyramid and attention mechanisms. Journal of Real-Time Image Processing, 2025, 22(3): 105.
- Bhavana N, Kodabagi M M, Kumar B M, et al. POT-YOLO: Real-Time Road Potholes Detection using Edge Segmentation based Yolo V8 Network. IEEE Sensors Journal, 2024. [CrossRef]
- Pan W, Sun X, Wang Y, et al. Enhanced photovoltaic panel defect detection via adaptive complementary fusion in YOLO-ACF. Scientific Reports, 2024, 14(1): 26425.
- Han Y, Wang F, Wang W, et al. EDN-YOLO: Multi-scale traffic sign detection method in complex scenes. Digital Signal Processing, 2024, 153: 104615.
- Zhou S, Zhou H, Qian L. A multi-scale small object detection algorithm SMA-YOLO for UAV remote sensing images. Scientific Reports, 2025, 15(1): 9255.
- Lei Y, Wang X, An A, et al. Deeplab-YOLO: a method for detecting hot-spot defects in infrared image PV panels by combining segmentation and detection. Journal of Real-Time Image Processing, 2024, 21(2): 52. [CrossRef]
- Liu Z, Zhang Y, He J, et al. Enhancing Infrared Small Target Detection: A Saliency-Guided Multi-Task Learning Approach. IEEE Transactions on Intelligent Transportation Systems, 2025. 26, (3):3603-3618. [CrossRef]
- Zhang Q, Guo W, Lin M. LLD-YOLO: a multi-module network for robust vehicle detection in low-light conditions. Signal, Image and Video Processing, 2025, 19(4): 1-11. [CrossRef]
- Wang, Yu, and Xiaodong Xiang. GMS-YOLO: an enhanced algorithm for water meter reading recognition in complex environments. Journal of Real-Time Image Processing, 2024, 21(5): 173.
- Jiao B, Wang Y, Wang P, et al. RS-YOLO: An efficient object detection algorithm for road scenes. Digital Signal Processing, 2025, 157: 104889. [CrossRef]
- Karaca Aydemir B K, Telatar Z, Güney S, et al. Detecting and classifying breast masses via YOLO-based deep learning. Neural Computing and Applications, 2025: 1-28.
- Mela, Jose Luis, and Carlos García Sánchez. Yolo-based power-efficient object detection on edge devices for USVs. Journal of Real-Time Image Processing, 2025,22(3): 1-14. [CrossRef]
- Mao G, Wang K, Du H, et al. SRS-YOLO: Improved YOLOv8-Based Smart Road Stud Detection. IEEE Transactions on Intelligent Transportation Systems, 2025. https://doi.org/10.1109/TITS.2025.3545942.
- Zhao Y, Lv W, Xu S, et al. Detrs beat yolos on real-time object detection. //Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2024: 16965-16974.
- Chen J, Kao S, He H, et al. Run, Don’t Walk: Chasing Higher FLOPS for Faster Neural Networks. //Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2023: 12021-12031.







| Name | Version |
|---|---|
| OS | Ubuntu 16.04 |
| CPU | Intel(R) Core(TM) i5-12400F |
| RAM | 32GB |
| GPU | Nvidia RTX 3060 |
| Driver | 515.65.01 |
| CUDA | 11.3.1 |
| python | 3.9 |
| torch | 1.10.1+cu111 |
| torchvision | 0.11.2++cu111 |
| Args | Value | Args | Value |
|---|---|---|---|
| epochs | 500 | imgsz | 640 |
| Lr0 | 0.01 | lrf | 0.01 |
| box | 7.5 | batch | 16 |
| optimizer | SGD | mosaic | 1.0 |
| Algorithms | FPS | Precision(%) | recall(%) | mAP@0.5(%) | mAP@0.5:0.95(%) |
|---|---|---|---|---|---|
| YOLOv3-Tiny | 286 | 63.1 | 61.5 | 69.7 | 63.5 |
| YOLOv5n | 270 | 51.0 | 46.8 | 48.9 | 45.0 |
| YOLOv5s | 200 | 73.5 | 66.9 | 74.8 | 69.2 |
| YOLOv7-Tiny | 222 | 54.7 | 59.2 | 62.7 | 57.1 |
| YOLOv8n | 185 | 98.2 | 98.2 | 99.4 | 96.7 |
| WCA-YOLO | 161 | 99.4 | 99.3 | 99.5 | 96.7 |
| Modules used | FPS | Precision(%) | recall(%) | mAP@0.5(%) | mAP@0.5:0.95(%) |
|---|---|---|---|---|---|
| Base | 185 | 98.2 | 98.2 | 99.4 | 96.7 |
| HGNetv2 [29]+C3k2 | 147 | 98.6 | 98.4 | 99.5 | 96.6 |
| HGNetv2+C3k2+FasterBlock [30]+Wavelet Pool | 154 | 98.2 | 98.0 | 99.3 | 96.3 |
| C3k2+gCONV | 159 | 98.8 | 98.0 | 99.5 | 96.0 |
| C3k2+DynamicCONV+HGNetv2 | 141 | 98.1 | 98.8 | 99.5 | 96.5 |
| WCA-YOLO(Wavelet Pool+C3k2) | 161 | 99.4 | 99.3 | 99.5 | 96.7 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).