Submitted:
09 August 2023
Posted:
10 August 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Max-Pooling Operation
3. Rank Tracking Based Max-Pooling (RTB-MAXP) Engine
4. Cascaded Maximum Based Max-Pooling (CMB-MAXP) Engine
5. Implementations
6. Conclusions
References
- Zhao, Z.; Zheng, P.; Xu, S.; Wu, X. Object detection with deep learning: A review. IEEE trans. neural networks and learning systems 2019, 30, 3212–3232. [Google Scholar] [CrossRef] [PubMed]
- Lee, D.-H. Fully Convolutional Single-Crop Siamese Networks for Real-Time Visual Object Tracking. Electronics 2019, 8, 1084. [Google Scholar] [CrossRef]
- Shawahna, A.; Sait, S.; El-Maleh, A. FPGA-based Accelerators of Deep Learning Networks for Learning and Classification: A Review. IEEE Access 2018, 4, 1–41. [Google Scholar] [CrossRef]
- Huang, J.; Liu, X.; Guo, T.; Zhao, Z. A High-Performance FPGA-Based Depthwise Separable Convolution Accelerator. Electronics 2023, 12, 1571. [Google Scholar] [CrossRef]
- Xie, Y.; Majoros, T.; Oniga, S. FPGA-Based Hardware Accelerator on Portable Equipment for EEG Signal Patterns Recognition. Electronics 2022, 11, 2410. [Google Scholar] [CrossRef]
- Zhang, L.; Tang, X.; Hu, X.; Zhou, T.; Peng, Y. FPGA-Based BNN Architecture in Time Domain with Low Storage and Power Consumption. Electronics 2022, 11, 1421. [Google Scholar] [CrossRef]
- Lomas-Barrie, V.; Silva-Flores, R.; Neme, A.; Pena-Cabrera, M. A Multiview Recognition Method of Predefined Objects for Robot Assembly Using Deep Learning and Its Implementation on an FPGA. Electronics 2022, 11, 696. [Google Scholar] [CrossRef]
- Zhou, H.; Xiao, Y.; Zheng, Z.; Yang, B. YOLOv2-tiny Target Detection System Based on FPGA Platform. ICBAIE 2022, 289–292. [Google Scholar]
- Wang, C.; Bochkovskiy, A.; Liao, H. Scaled-yolov4: Scaling cross stage partial network. Proceedings of the IEEE/cvf conf. comp. vis. and patt. recog. 2021. [Google Scholar]
- Bochkovskiy, A.; Wang, C.; Liao, H. Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 2020. [Google Scholar]
- Rzaev, E.; Khanaev, A.; Amerikanov, A. Neural Network for Real-Time Object Detection on FPGA. ICIEAM, 2021; 719–723. [Google Scholar]
- Archana, V. An FPGA-Based Computation-Efficient Convolutional Neural Network Accelerator. IPRECON, 2022; Kollam, India; 1–4. [Google Scholar] [CrossRef]
- Wang, Z.; Xu, K.; Wu, S.; Liu, L.; Wang, D. Sparse-YOLO: Hardware/Software Co-Design of an FPGA Accelerator for YOLOv2. IEEE Access 8, 116569–116585. [CrossRef]
- Zhao, B.; Chong, Y.; Do, A. Area and Energy Efficient 2D Max-Pooling for Convolutional Neural Network Hardware Accelerator. IECON, 2020; 423–427. [Google Scholar]
- Zhao, D. F-CNN: An FPGA-based framework for training Convolutional Neural Networks. IEEE ASAP, 2016; London, UK; 107–114. [Google Scholar] [CrossRef]











| Multiplexer Switch (MS) | Multiplexer | ||||||
| Input | Output | Output | |||||
| mv-1 | mv | mv+1 | nv | cv[0] | cv[1] | rv | dv |
| 0 | 0 | X | 0 | 0 | 0 | rv | dv+1 |
| 0 | 1 | X | 0 | 0 | 1 | xp(i,j) | 0 |
| 0 | 1 | X | 1 | 0 | 0 | rv | dv+1 |
| 1 | 0 | X | 0 | 0 | 0 | rv | dv+1 |
| 1 | 1 | X | 0 | 1 | 0 | rv+1 | dv+1+1 |
| 1 | 1 | X | 1 | 0 | 0 | rv | dv+1 |
| X | 0 | 0 | 1 | 1 | 1 | rv-1 | dv-1+1 |
| X | 0 | 1 | 1 | 0 | 1 | xp(i,j) | 0 |
| RTB-MAXP | CMB-MAXP | |||
| UA | UP | UA | UP | |
| LUT | 158,515 | 13.4% | 28,765 | 2.43% |
| LUTRAM | - | - | 2,688 | 0.45% |
| FF | 99,342 | 4.2% | 76,906 | 3.25% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).