Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Puppis: Hardware Accelerator of Single-Shot Multibox Detectors for Edge-Based Applications

Version 1 : Received: 8 August 2023 / Approved: 9 August 2023 / Online: 10 August 2023 (10:25:43 CEST)

A peer-reviewed article of this Preprint also exists.

Vrbaski, V.; Josic, S.; Vranjkovic, V.; Teodorovic, P.; Struharik, R. Puppis: Hardware Accelerator of Single-Shot Multibox Detectors for Edge-Based Applications. Electronics 2023, 12, 4557. Vrbaski, V.; Josic, S.; Vranjkovic, V.; Teodorovic, P.; Struharik, R. Puppis: Hardware Accelerator of Single-Shot Multibox Detectors for Edge-Based Applications. Electronics 2023, 12, 4557.

Abstract

Object detection is a popular image processing technique, widely used in numerous applications for detecting and locating objects in images or videos. While being one of the fastest algorithms for object detection, Single-Shot Multibox Detection (SSD) networks are also computationally very demanding, which limits their usage in real-time edge applications. Even though the SSD post-processing algorithm is not the most complex segment of the overall SSD object detection network, it is still computationally demanding and can become a bottleneck with respect to processing latency and power consumption, especially in edge applications with limited resources. When using hardware accelerators to accelerate backbone CNN processing, the SSD post-processing step implemented in software can become critical for high-end applications where high frame rates are required, as this paper shows. To overcome this problem, we propose Puppis, an architecture for hardware acceleration of SSD post-processing algorithm. As experiments will show, our solution will lead to an average SSD post-processing speedup of 34.42 when compared with a software implementation. Furthermore, execution of a complete SSD network will be on average 45.39 times faster than software implementation when the proposed Puppis SSD hardware accelerator is used together with some existing CNN accelerators.

Keywords

Hardware acceleration; Convolutional Neural Networks; Single-Shot Multibox Detector

Subject

Engineering, Electrical and Electronic Engineering

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.