Submitted:
18 June 2026
Posted:
23 June 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
- A leak-safe within-user extrapolation protocol and a paired-metric reporting standard. Splitting by contiguous recording segment and testing on later repetitions removes the window-overlap leakage and within-session interpolation that inflate standard splits; reporting per-window and per-execution balanced accuracy with the rest false-activation rate exposes the onset-transient behaviour a single voted number hides.
- An architectural choice for edge decoding: rather than scaling a parameter-heavy network to learn temporal structure, we offload it into a zero-parameter, population-learned transition grammar combined online with per-window evidence by a causal forward filter. It recovers boundary windows by inference, requires no extra calibration, and keeps the model at a microcontroller-class footprint (28k params, kB int8).
- A controlled, honest benchmark: within-user and cross-subject (LOSO) results on NinaPro DB2 with per-subject variance and significance tests, a classical-baseline and component ablation (which transparently shows masked pretraining contributes little at this data scale and the grammar’s gain over voting is small but significant), and a hardware-agnostic compute-cost breakdown, positioned against recent embedded and foundation-model EMG decoders. Code, configurations, and a synthetic positive-control are released at https://github.com/seanb9/emg-leaksafe-benchmark.
- An acquisition and cross-population study. We isolate signal bandwidth as a significant, model-free accuracy lever ( per-window points from a wider analysis band, ); we validate the decoder as a causal real-time stream; and we extend the leak-safe benchmark to 12 amputees, where we report an honest negative result: we find no evidence that the deep model’s able-bodied advantage over a classical baseline transfers.
2. Related Work
3. Methods
3.1. Data and Preprocessing
3.2. Leak-Safe Windowing and Splits
3.3. Model
3.4. Training
3.5. Causal Sequence-Reasoning Decoder
3.6. Evaluation Metrics
4. Experiments and Results
4.1. Setup
4.2. Within-User Results

| Configuration | Per-window | Per-exec | False-act |
| Classical: TD + LDA | 66.9 ± 8.1 | 94.7 ± 7.8 | – |
| Causal TCN, raw (57k params) | 76.3 ± 7.8 | 95.2 ± 2.1 | – |
| CNN, raw evidence (no logic) | 72.6 ± 7.8 | 94.0 ± 11.4 | – |
| CNN + majority vote () | 73.1 ± 7.7 | 94.0 ± 11.4 | – |
| CNN + majority vote () | 73.7 ± 7.6 | 94.7 ± 9.3 | – |
| CNN + HMM, gate-free (ours) | 75.2 ± 7.8 | 92.7 ± 11.5 | 4.5 ± 2.0 |
| CNN + HMM, scaled-likelihood | 77.4 ± 7.1 | – | 52.3 ± 15.4 |
| ours, no masked pretraining | 75.0 ± 7.6 | 93.4 ± 11.5 | 4.6 ± 2.1 |
4.3. Effect of Sequence Reasoning
4.4. Operating Point



4.5. Cross-Subject (LOSO) Results
4.6. Baselines and Ablations
4.7. Online Behaviour and the Onset-Latency Floor
4.8. Acquisition Bandwidth
4.9. Amputee Benchmark and Cross-Population Transfer
4.10. Cross-Session Re-Donning Gap
4.11. Deployability
| Model | Params | Int8 size | Measured on-device |
| Ours | 28.2 k | 27.5 kB | compute only (MMAC/win) |
| Bioformers [4] | – | kB | GAP8: mJ, ms |
| TinyMyo [6] | M | – | GAP9: mJ |
5. Discussion
6. Conclusions
Reproducibility
Institutional Review Board Statement
Data Availability Statement
Author Contributions
Funding
Conflicts of Interest
Acknowledgments
References
- Atzori, M.; Gijsberts, A.; Castellini, C.; Caputo, B.; Mittaz Hager, A.G.; Elsig, S.; Giatsidis, G.; Bassetto, F.; Müller, H. Electromyography data for non-invasive naturally-controlled robotic hand prostheses. Scientific Data 2014, 1, 1–13. [CrossRef]
- Atzori, M.; Cognolato, M.; Müller, H. Deep learning with convolutional neural networks applied to electromyography data: A resource for the classification of movements for prosthetic hands. Frontiers in Neurorobotics 2016, 10, 9. [CrossRef]
- Jiang, B.; Wu, H.; Xia, Q.; Xiao, H.; Peng, B.; Wang, L.; Zhao, Y. An efficient surface electromyography-based gesture recognition algorithm based on multiscale fusion convolution and channel attention. Scientific Reports 2024, 14, 30867. [CrossRef]
- Burrello, A.; Bianco Morghet, F.; Scherer, M.; Benatti, S.; Benini, L.; Macii, E.; Poncino, M.; Jahier Pagliari, D. Bioformers: Embedding Transformers for Ultra-Low Power sEMG-based Gesture Recognition. In Proceedings of the Design, Automation & Test in Europe Conference (DATE), 2022. [CrossRef]
- Zanghieri, M.; Benatti, S.; Burrello, A.; Kartsch, V.; Conti, F.; Benini, L. Robust Real-Time Embedded EMG Recognition Framework Using Temporal Convolutional Networks on a Multicore IoT Processor. IEEE Transactions on Biomedical Circuits and Systems 2020, 14, 244–256. [CrossRef]
- Fasulo, M.; Spacone, G.; Ingolfsson, T.M.; Li, Y.; Benini, L.; Cossettini, A. TinyMyo: A Tiny Foundation Model for Flexible EMG Signal Processing at the Edge. arXiv preprint arXiv:2512.15729 2025.
- Yang, J.; Soh, M.; Lieu, V.; Weber, D.J.; Erickson, Z. EMGBench: Benchmarking Out-of-Distribution Generalization and Adaptation for Electromyography. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), Datasets and Benchmarks Track, 2024. [CrossRef]
- Wang, D.; Shelhamer, E.; Liu, S.; Olshausen, B.; Darrell, T. Tent: Fully Test-Time Adaptation by Entropy Minimization. In Proceedings of the International Conference on Learning Representations (ICLR), 2021.
- Sun, Y.; Wang, X.; Liu, Z.; Miller, J.; Efros, A.; Hardt, M. Test-Time Training with Self-Supervision for Generalization under Distribution Shifts. In Proceedings of the International Conference on Machine Learning (ICML), 2020.
- Barachant, A.; Bonnet, S.; Congedo, M.; Jutten, C. Multiclass Brain–Computer Interface Classification by Riemannian Geometry. IEEE Transactions on Biomedical Engineering 2012, 59, 920–928. [CrossRef]
- Touko, N.; Ellis, M.O.A.; Capone, C.; Burrello, A.; Donati, E.; Manneschi, L. Lightweight Test-Time Adaptation for EMG-Based Gesture Recognition. arXiv preprint arXiv:2601.04181 2026.
- Cognolato, M.; Gijsberts, A.; Gregori, V.; Saetta, G.; Giacomino, K.; Mittaz Hager, A.G.; Gigli, A.; Faccio, D.; Tiengo, C.; Bassetto, F.; et al. Gaze, visual, myoelectric, and inertial data of grasps for intelligent prosthetics. Scientific Data 2020, 7, 43. [CrossRef]
- Betthauser, J.L.; Krall, J.T.; Bannowsky, S.G.; Levay, G.; Kaliki, R.R.; Fifer, M.S.; Thakor, N.V. Stable Responsive EMG Sequence Prediction and Adaptive Reinforcement With Temporal Convolutional Networks. IEEE Transactions on Biomedical Engineering 2020, 67, 1707–1717. [CrossRef]
- Simon, A.M.; Hargrove, L.J.; Lock, B.A.; Kuiken, T.A. A Decision-Based Velocity Ramp for Minimizing the Effect of Misclassifications During Real-Time Pattern Recognition Control. IEEE Transactions on Biomedical Engineering 2011, 58, 2360–2368. [CrossRef]
- Chan, A.D.C.; Englehart, K.B. Continuous Myoelectric Control for Powered Prostheses Using Hidden Markov Models. IEEE Transactions on Biomedical Engineering 2005, 52, 121–124. [CrossRef]
- Rabiner, L.R. A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE 1989, 77, 257–286. [CrossRef]
- Ortiz-Catalan, M.; Rouhani, F.; Brånemark, R.; Håkansson, B. Offline accuracy: A potentially misleading metric in myoelectric pattern recognition for prosthetic control. In Proceedings of the Proc. 37th Annu. Int. Conf. IEEE Eng. Med. Biol. Soc. (EMBC), 2015. [CrossRef]
- Vujaklija, I.; Shalchyan, V.; Kamavuako, E.N.; Jiang, N.; Marateb, H.R.; Farina, D. Online mapping of EMG signals into kinematics by autoencoding. Journal of NeuroEngineering and Rehabilitation 2018, 15, 21. [CrossRef]
- Englehart, K.; Hudgins, B. A Robust, Real-Time Control Scheme for Multifunction Myoelectric Control. IEEE Transactions on Biomedical Engineering 2003, 50, 848–854. [CrossRef]
- Hudgins, B.; Parker, P.; Scott, R.N. A New Strategy for Multifunction Myoelectric Control. IEEE Transactions on Biomedical Engineering 1993, 40, 82–94. [CrossRef]
- He, K.; Chen, X.; Xie, S.; Li, Y.; Dollár, P.; Girshick, R. Masked Autoencoders Are Scalable Vision Learners. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022. [CrossRef]
- Khosla, P.; Teterwak, P.; Wang, C.; Sarna, A.; Tian, Y.; Isola, P.; Maschinot, A.; Liu, C.; Krishnan, D. Supervised Contrastive Learning. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), 2020.
- Bourlard, H.A.; Morgan, N. Connectionist Speech Recognition: A Hybrid Approach; Kluwer Academic Publishers, 1994.
- Bai, S.; Kolter, J.Z.; Koltun, V. An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling. arXiv preprint arXiv:1803.01271 2018.
- Page, E.S. Continuous Inspection Schemes. Biometrika 1954, 41, 100–115. [CrossRef]
- Krasoulis, A.; Kyranou, I.; Erden, M.S.; Nazarpour, K.; Vijayakumar, S. Improved prosthetic hand control with concurrent use of myoelectric and inertial measurements. Journal of NeuroEngineering and Rehabilitation 2017, 14, 71. [CrossRef]
- Palermo, F.; Cognolato, M.; Gijsberts, A.; Müller, H.; Caputo, B.; Atzori, M. Repeatability of grasp recognition for robotic hand prosthesis control based on sEMG data. In Proceedings of the 2017 International Conference on Rehabilitation Robotics (ICORR). IEEE, 2017, pp. 1154–1159. [CrossRef]





| Calib. ratio | Per-window | +sequence logic | Per-exec. |
| 0% | 38.6 ± 7.7 | 42.9 ± 8.5 | 50.6 ± 12.9 |
| 5% | 53.0 ± 7.7 | 59.8 ± 7.8 | 74.9 ± 11.5 |
| 10% | 59.3 ± 7.8 | 66.6 ± 6.9 | 84.3 ± 8.3 |
| 20% | 64.2 ± 8.2 | 71.1 ± 7.1 | 89.1 ± 7.7 |
| 30% | 66.4 ± 8.2 | 73.2 ± 7.6 | 91.3 ± 6.7 |
| Sampling | Band | Per-win. (raw) | Per-win. (+logic) | Per-exec. |
| Hz | 20–120 | 71.9 ± 8.1 | 74.8 ± 7.9 | 93.3 ± 11.5 |
| kHz | 20–120 | 72.4 ± 8.0 | 75.5 ± 7.6 | 92.6 ± 11.5 |
| kHz | 20–450 | 74.4 ± 7.5 | 77.4 ± 7.3 | 95.3 ± 7.9 |
| Method | Per-window | Per-execution |
| Classical TD+LDA | 54.6 ± 16.5 | 74.2 ± 24.6 |
| CNN, raw evidence | 55.0 ± 16.8 | 66.9 ± 26.5 |
| CNN + HMM grammar (ours) | 55.0 ± 18.4 | 63.8 ± 26.0 |
| Able-bodied (ours), reference | 77.4 ± 7.3 | 94.0 ± 11.4 |
| Test day | Day-1 decoder (no adapt.) | + per-don recalibration |
| D1 (within-session) | 93.4 ± 4.3 | 94.0 ± 4.0 |
| D2 | 47.9 ± 17.8 | 94.0 ± 3.8 |
| D3 | 49.4 ± 10.9 | 94.4 ± 2.6 |
| D4 | 47.7 ± 15.2 | 95.3 ± 3.8 |
| D5 | 38.8 ± 17.8 | 93.1 ± 5.3 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).