Sort by
A Unified GF(4)–Symplectic Framework for Quantum Error Correction: A Constructive, Pedagogical Derivation of the Steane [[7,1,3]] Code
Amir Hameed Mir
Posted: 04 December 2025
ESDM–SMTJ: An Entropic Semantic Dynamics Model for Classical Probabilistic Hardware with Superparamagnetic Tunnel Junctions
Ezequiel Lapilover
Posted: 02 December 2025
Design of an Energy-Efficient SHA-3 Accelerator on Artix-7 FPGA for Secure Network Applications
Abdulmunem A. Abdulsamad
,Sándor R. Répás
Posted: 28 November 2025
Large Pages, Large Leaks? Hugepage-Induced Side-Channels vs. Performance Improvements in Cryptographic Computations
Xinyao Li
,Akhilesh Tyagi
Posted: 20 November 2025
The Spike Processing Unit (SPU): An IIR Filter Approach to Hardware-Efficient Spiking Neurons
Hugo Puertas de Araújo
Posted: 18 September 2025
Cooling, Placement, and Virtualization for Sustainability
Pedro Ramos Brandao
Posted: 18 August 2025
An Open Chisel-Based Framework for Hardware Acceleration on High-Performance FPGA Cards
Robin Gay
,Tarek Ould-Bachir
Posted: 13 August 2025
Near-Optimal Multirun March Memory Tests for Neighborhood Pattern-Sensitive Faults in Random-Access Memories
Petru Cascaval
,Doina Cascaval
Posted: 09 July 2025
Extending a Moldable Computer Architecture to Accelerate DL Inference on FPGA
Mirko Mariotti
,Giulio Bianchini
,Igor Neri
,Daniele Spiga
,Diego Ciangottini
,Loriano Storchi
Posted: 27 May 2025
Plücker Conoid-Inspired Geometry for Wave-Based Computing Systems
Arturo Tozzi
Posted: 18 April 2025
Adaptive NVM Word Compression Based on Cache Line Dynamics on Micro-Architecture
Jialin Wang
,Zhen Yang
,Zhenghao Yin
,Yajuan Du
Posted: 15 April 2025
A Survey on Advancements in Scheduling Techniques for Efficient Deep Learning Computations on GPUs
Rupinder Kaur
,Arghavan Asad
,Seham Al Abdul Wahid
,Farah Mohammadi
Posted: 20 February 2025
Benchmarking Hyper-Breakpoints for Efficient Virtual Machine Introspection
Lukas Beierlieb
,Alexander Schmitz
,Christian Dietrich
,Raphael Springer
,Lukas Iffländer
Posted: 03 January 2025
Object Detection Post-Processing Accelerator Based on Co-Design of Hardware and Software
Dengtian Yang
,Lan Chen
,Xiaoran Hao
,Mao Ni
,Ming Chen
,Yiheng Zhang
Deep learning significantly advances object detection. Post process, a critical component of this process, selects valid bounding boxes to represent true targets during inference and assigns boxes and labels to these objects during training to optimize the loss function. However, post process constitutes a substantial portion of the total processing time for a single image. This inefficiency primarily arises from the extensive Intersection over Union (IoU) calculations required between numerous redundant bounding boxes in post-processing algorithms. To reduce the redundant IoU calculations, we introduce a classification prioritization strategy during both training and inference post processes. Additionally, post process involves sorting operations that contribute to inefficiency. To minimize unnecessary comparisons in Top-K sorting, we have improved the bitonic sorter by developing a hybrid bitonic algorithm. These improvements have effectively accelerated post process. Given the similarities between training and inference post processes, we unify four typical post-processing algorithms and design a hardware accelerator based on this framework. Our accelerator achieves at least 7.55 times the speed in inference post process compared to recent accelerators. When compared to the RTX 2080 Ti system, our proposed accelerator offers at least 21.93 times the speed for training post process and 19.89 times for inference post process, thereby significantly enhancing the efficiency of loss function minimization.
Deep learning significantly advances object detection. Post process, a critical component of this process, selects valid bounding boxes to represent true targets during inference and assigns boxes and labels to these objects during training to optimize the loss function. However, post process constitutes a substantial portion of the total processing time for a single image. This inefficiency primarily arises from the extensive Intersection over Union (IoU) calculations required between numerous redundant bounding boxes in post-processing algorithms. To reduce the redundant IoU calculations, we introduce a classification prioritization strategy during both training and inference post processes. Additionally, post process involves sorting operations that contribute to inefficiency. To minimize unnecessary comparisons in Top-K sorting, we have improved the bitonic sorter by developing a hybrid bitonic algorithm. These improvements have effectively accelerated post process. Given the similarities between training and inference post processes, we unify four typical post-processing algorithms and design a hardware accelerator based on this framework. Our accelerator achieves at least 7.55 times the speed in inference post process compared to recent accelerators. When compared to the RTX 2080 Ti system, our proposed accelerator offers at least 21.93 times the speed for training post process and 19.89 times for inference post process, thereby significantly enhancing the efficiency of loss function minimization.
Posted: 05 December 2024
Dynamic Key Replacement Mechanism for Lightweight IoT Microcontrollers to Resist Side-channel Attacks
Chung-Wei Kuo
,Wei Wei
,Chun-Chang Lin
,Yu-Yi Hong
,Jia-Ruei Liu
,Kuo-Yu Tsai
Posted: 29 November 2024
Redfish API And vSphere Hypervisor API: A Unified Framework For Policy-Based Server Monitoring
Vedran Dakić
,Karlo Bertina
,Jasmin Redžepagić
,Damir Regvart
Posted: 17 November 2024
Challenges of the QWERTY Keyboard for Quechua Speakers in the Puno Region in Perú
Henry Juarez Vargas
,Roger Mijael Mansilla Huanacuni
,Fred Torres Cruz
Posted: 09 October 2024
Container Based Electronic Control Unit Virtualisation: A Paradigm Shift Towards a Centralised Automotive E/E Architecture
NIcholas Ayres
,Lipika Deka
,Daniel Paluszczyszyn
Posted: 21 August 2024
Designing a Scalable and Area-Efficient Hardware Accelerator Supporting Multiple PQC Schemes
Heonhui Jung
,Hyunyoung Oh
Posted: 01 July 2024
A Comprehensive Review on Processing-in-Memory Architectures for Deep Neural Networks
Rupinder Kaur
,Arghavan Asad
,Farahnaz Mohammadi
Posted: 21 June 2024
of 4