Submitted:
15 April 2026
Posted:
16 April 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
- Following a systematic literature search protocol (Supplementary File S1: novelty_search_protocol.md), the first publicly documented IDS classifier deployment on a Cortex-M class MCU paired with a general-purpose NPU (Neural-ART), achieving 0.29–0.46 ms inference at an estimated 44–69 µJ per inference.
- A four-dataset multi-seed study (5–20 seeds per arm, see Table 3 caption) with paired Wilcoxon signed-rank tests under Holm–Bonferroni family-wise error control, covering NSL-KDD, UNSW-NB15, CICIDS2017, and IoT-23.
- A non-MLP TinyCNN baseline (Conv2D ) under the same Neural-ART operator constraints, enabling same-hardware comparison.
- Empirical validation of practical SNN–ANN approximation on commercial NPU silicon across four datasets, with 99% prediction agreement between FP32 and INT8 models on NSL-KDD.
- A QCFS sweep () justifying and quantifying Floor-triggered CPU fallback (+17.6% latency).
2. Related Work
3. System Design
3.1. SNN–ANN Equivalence
3.2. Neural-ART NPU
3.3. Model Architecture
3.4. Datasets and Training
3.5. NPU Compilation and Operator Mapping
4. Experiments
4.1. Classification Results
4.2. Energy, Latency, and Memory
4.3. INT8 Quantization Robustness

4.4. QCFS Hyperparameter Sweep
4.5. Statistical Significance
4.6. CICIDS2017 and IoT-23 Results
4.7. Model Capacity Is Not the Bottleneck
4.8. When INT8 Equivalence Breaks Down
5. Discussion and Conclusions
References
- Chehade, A.; Ragusa, E.; Gastaldo, P.; Zunino, R. Energy-Efficient Deep Learning for Traffic Classification on Microcontrollers. In Proceedings of the Proc. IEEE ISCC, 2025. [Google Scholar]
- Diab, A.; Chehade, A.; Ragusa, E.; Gastaldo, P.; Zunino, R.; Baghdadi, A.; Rizk, M. Intrusion Detection on Resource-Constrained IoT Devices with Hardware-Aware ML and DL. arXiv 2025, arXiv:2512.02272. [Google Scholar]
- Bu, T.; Fang, W.; Ding, J.; Dai, P.; Yu, Z.; Huang, T. Optimal ANN-SNN Conversion for High-accuracy and Ultra-low-latency Spiking Neural Networks. In Proceedings of the Proc. ICLR, 2022. [Google Scholar]
- Jiang, H.; Anumasa, S.; De Masi, G.; Xiong, H.; Gu, B. A Unified Optimization Framework of ANN-SNN Conversion: Towards Optimal Mapping from Activation Values to Firing Rates. Proceedings of the Proc. ICML 2023, Vol. 202, PMLR. [Google Scholar]
- Bu, T.; Li, M.; Yu, Z. Inference-Scale Complexity in ANN-SNN Conversion for High-Performance and Low-Power Applications. In Proceedings of the Proc. IEEE/CVF CVPR, 2025. [Google Scholar]
- Ngo, D.M.; Lightbody, D.; Temko, A.; Murphy, C.C.; Popovici, E. HH-NIDS: Hardware Heterogeneous Network Intrusion Detection System Using MAX78000. Future Internet 2022, 15, 9. [Google Scholar] [CrossRef]
- Zahm, W.; Nishibuchi, G.; Jose, A.; Chelian, S.; Vasan, S. Low-Power Cybersecurity Attack Detection Using Deep Learning on Neuromorphic Technologies. Technical report, CSIAC, 2024. [Google Scholar]
- Farooq, M.A.; Rafique, A.; Fahmy, S.A.; Arora, A. High Throughput Low Latency Network Intrusion Detection on FPGAs: A Raw Packet Approach. In Proceedings of the Proc. IEEE IPDPSW Edge-IIoT dataset, up to 1162M inferences/sec on Virtex Ultrascale+ FPGA, 2025; pp. 1201–1207. [Google Scholar] [CrossRef]
- Wang, Z.; et al. An efficient intrusion detection model based on convolutional spiking neural network. Scientific Reports 2024, 14, 7054. [Google Scholar] [CrossRef]
- Prajwalasimha, S.N.; et al. Event-Driven Intrusion Detection Systems using Spiking Neural Networks for Edge and IoT Security. In Proceedings of the Proc. IEEE ICSCSA, 2025. [Google Scholar] [CrossRef]
- Karthik, M.G.; et al. Energy-efficient intrusion detection with a protocol-aware transformer–spiking hybrid model. Scientific Reports 2026, 16, 7095. [Google Scholar] [CrossRef]
- STMicroelectronics. ST Neural-ART NPU Concepts UM3225 2025.
- Tavallaee, M.; Bagheri, E.; Lu, W.; Ghorbani, A.A. A Detailed Analysis of the KDD CUP 99 Data Set. In Proceedings of the Proc. IEEE CISDA, 2009; pp. 1–6. [Google Scholar] [CrossRef]
- Moustafa, N.; Slay, J. UNSW-NB15: A Comprehensive Data Set for Network Intrusion Detection Systems. In Proceedings of the Proc. MilCIS, 2015; pp. 1–6. [Google Scholar] [CrossRef]
- Sharafaldin, I.; Lashkari, A.H.; Ghorbani, A.A. Toward Generating a New Intrusion Detection Dataset and Intrusion Traffic Characterization. In Proceedings of the Proc. ICISSP, 2018; pp. 108–116. [Google Scholar] [CrossRef]
- Engelen, G.; Rimmer, V.; Joosen, W. Troubleshooting an Intrusion Detection Dataset: the CICIDS2017 Case Study. In Proceedings of the Proc. IEEE S&P Workshops (SPW), 2021; pp. 7–12. [Google Scholar] [CrossRef]
- Garcia, S.; Parmisano, A.; Erquiaga, M.J. IoT-23: A Labeled Dataset with Malicious and Benign IoT Network Traffic. Zenodo Dataset 2020. Dataset landing page. Available online: https://www.stratosphereips.org/datasets-iot23. [CrossRef]
- STMicroelectronics. ONNX Toolbox Support, 2025. ST Edge AI Developer Cloud Embedded Documentation.
- STMicroelectronics. Application Note AN5946; How to Optimize Low-Power Modes on STM32N6 MCUs. 2025.
- STMicroelectronics. STEDGEAI-DC: Artificial Intelligence Developer Cloud for ST Microcontrollers, Neural-ART Accelerator, Microprocessors, and Smart Sensors. Data Brief; 2025. [Google Scholar]
- STMicroelectronics. AI: Getting Started with ST Edge AI Developer Cloud. STM32 Wiki 2025. [Google Scholar]

| Work | Platform | Class | Task | Acc. | Lat. | Energy | NPU |
|---|---|---|---|---|---|---|---|
| Ngo [6] | MAX78000 | AI-MCU | bin. | 98.6% | — | 18 mW* | CNN eng. |
| Zahm [7] | Akida | ASIC | multi | 98.4% | — | ∼1 W* | Neurom. |
| Chehade [1] | STM32F7 | MCU | multi | 96.6% | 31 ms | 7.86 mJ | None |
| Diab [2] | RPi 3B+ | SBC | multi | 95.3% | 27 ms | ∼6.75 mJ‡ | None |
| Farooq [8]§ | Xilinx FPGA | FPGA | bin. | — | <1 µs | — | — |
| This work | STM32N6 | MCU | 5/10/15/5 | 78.6/64.7/91.9/75.6%† | 0.29 ms | 44 µJ | NPU |
| ONNX Operator | NPU Support | Epoch Type | Notes |
|---|---|---|---|
| Gemm (INT8) | ✔ | HW | Fused weight + bias |
| Relu | ✔ | HW | Fused with preceding Gemm |
| Clip | ✔ | HW | Used by QCFS |
| Floor | ✗ | SW (float) | CPU fallback, +0.08 ms |
| QuantizeLinear | ✔ | HW/Hyb | At model boundary |
| DequantizeLinear | ✔ | HW/Hyb | At model boundary |
| Dataset | Model | Overall | Macro F1 | NPU? |
|---|---|---|---|---|
| NSL-KDD | ReLU MLP | 78.57±1.28 | 58.91±2.80 | ✔ |
| QCFS MLP | 78.14±1.08 | 58.28±2.70 | Partial | |
| TinyCNN† | 80.32±0.48 | 60.69±0.77 | ✔ | |
| Rand. Forest | 73.84±0.19 | 47.13±0.33 | ✗ | |
| UNSW-NB15 | ReLU MLP | 64.67±0.55 | 40.18±1.02 | ✔ |
| QCFS MLP | 64.67±0.49 | 39.94±0.72 | Partial | |
| TinyCNN† | 63.28±0.23 | 38.12±0.47 | ✔ | |
| Rand. Forest | 69.46±0.10 | 48.63±0.38 | ✗ | |
| CICIDS2017 | ReLU MLP | 91.89±1.21 | 56.35±2.80 | ✔ |
| QCFS MLP | 90.99±0.48 | 57.65±2.38 | Partial | |
| TinyCNN† | 90.57±0.13 | 56.74±0.13 | ✔ | |
| IoT-23 | ReLU MLP | 75.59±2.71 | 66.41±1.50 | ✔ |
| QCFS MLP | 77.65±1.18 | 67.40±0.52 | Partial |
| Model | Dataset | Time | Energy | HW | Hyb | SW | Flash | RAM |
|---|---|---|---|---|---|---|---|---|
| (ms) | (µJ) | (KB) | (KB) | |||||
| ReLU FP32 | NSL-KDD | 1.24 | — | 0 | 0 | 11 | 466.4 | 2.17 |
| ReLU INT8 | NSL-KDD | 0.46 | 69 | 5 | 1 | 2 | 137.7 | 1.25 |
| ReLU FP32 | UNSW-NB15 | 1.23 | — | 0 | 0 | 11 | 461.9 | 2.14 |
| ReLU INT8 | UNSW-NB15 | 0.29 | 44 | 4 | 0 | 0 | 120.6 | 0.50 |
| ReLU INT8 | CICIDS2017 | 0.42 | 63 | 4 | 0 | 0 | 120.6 | 0.50 |
| ReLU INT8 | IoT-23 | 0.38 | 57 | 4 | 0 | 0 | 105.0 | 0.50 |
| QCFS INT8 | NSL-KDD | 0.54 | 81 | 13 | 1 | 14 | 138.0 | 2.00 |
| Layer | Cosine Sim | MAE | |
|---|---|---|---|
| Relu_0 (256) | 0.667 | 0.435 | 43.7 |
| Relu_1 (256) | 0.655 | 0.438 | 62.6 |
| Relu_2 (128) | 0.683 | 0.400 | 65.2 |
| Logits (5) | 0.978 | 0.103 | 19.8 |
| Dataset | ||||
|---|---|---|---|---|
| NSL-KDD OA | 75.48±0.92 | 75.03±0.63 | 75.14±0.74 | 75.22±1.07 |
| NSL-KDD MF1 | 52.40±1.59 | 51.17±0.83 | 50.94±1.45 | 51.07±1.86 |
| UNSW-NB15 OA | 64.85±0.20 | 65.76±0.19 | 65.59±0.14 | 65.56±0.54 |
| UNSW-NB15 MF1 | 40.64±0.20 | 41.36±0.25 | 41.23±0.36 | 41.28±0.29 |
| Dataset | Comparison | p | Reject? | ||
|---|---|---|---|---|---|
| NSL-KDD | ReLU vs. QCFS () | 0.227 | 0.227 | ✗ | |
| ReLU vs. TinyCNN () | 0.027 | 0.055 | ✗ | ||
| UNSW-NB15 | ReLU vs. QCFS () | 0.846 | 0.846 | ✗ | |
| ReLU vs. TinyCNN () | 0.002 | 0.004 | ✔ | ||
| CICIDS2017 | ReLU vs. QCFS () | 0.312 | 0.312 | ✗ | |
| ReLU vs. TinyCNN () | 0.063 | 0.125 | ✗ | ||
| IoT-23 | ReLU vs. QCFS () | 0.438 | 0.438 | ✗ |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).