Submitted:
17 December 2023
Posted:
18 December 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction
- We propose using the SRIF model for supervised training-based SNNs. By retaining the “residual” membrane potential, SRIF enables the networks to distinguish the differences among those membrane potentials that exceed the firing threshold via subtracting their spike values thus enhancing the information encoding capacity of supervised training-based SNNs.
- We present MPR to mitigate the quantization error. By utilizing a non-linear function to modulate the membrane potential close to 0/1 before firing activity triggers, the gap between the potential and its corresponding 0/1 spike value is minified while maintaining the sparse spike activation mechanism of SNNs. To our best knowledge, few works have noticed the quantization error in SNNs, and a simple but effective method for addressing this problem is presented.
- Extensive experiments on both static and dynamic datasets were conducted to verify our method. Results show that the SNN trained with the proposed method is highly effective and efficient compared with other state-of-the-art SNN models, e.g., 96.49% top-1 accuracy and 79.41% top-1 accuracy are achieved on the CIFAR-10 and CIFAR-100. These results of our models even outperform their DNN counterparts surprisingly, and it is very rare that SNNs may have a chance to surpass their DNN counterparts.
2. Related Work
2.1. Learning Methods of Spiking Neural Networks
2.2. Threshold-dependent Batch Normalization
3. Preliminary and Methodology
3.1. “Soft Reset" IF Model
3.2. Membrane Potential Rectificater
- Spike-approaching: the modulated membrane potential, should be closer to the spikes than the original membrane potential, H. This principle ensures quantization error reduction.
- Firing-invariance: for the H less than , the MPR should not produce the greater than and vice versa. This principle ensures the neuron output be consistent with or without using MPR.
| Algorithm 1: Feed-Forward procedures for the “soft reset" IF neuron with MPR. |
Input: the input current, X.
Output: the output spike train, O.
Feed-Forward:
|
4. Experiment
4.1. Datasets and Settings
4.2. Ablation Study for Different Neuron Models
4.3. Addition of MPR
4.4. Comparisons with Other Methods
5. Conclusions
References
- He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition; 2016; pp. 770–778.
- Ronneberger, O., Fischer, P., Brox, T.: U-net: Convolutional networks for biomedical image segmentation. nternational Conference on Medical image computing and computer-assisted intervention; 2015; pp. 234–241.
- Bewley, A., Ge, Z., Ott, L., Ramos, F., Upcroft, B.: Simple online and realtime tracking. 2016 IEEE international conference on image processing (ICIP); IEEE, 2016; pp. 3464–3468.
- Gong, R., Liu, X., Jiang, S., Li, T., Hu, P., Lin, J., Yu, F., Yan, J.: Differentiable soft quantization: Bridging full-precision and low-bit neural networks. In Proceedings of the IEEE/CVF International Conference on Computer Vision; 2019; pp. 4852–4861.
- Li, Y., Dong, X., Wang, W.: Additive powers-of-two quantization: An efficient non-uniform discretization for neural networks. 2019, arXiv:1909.13144.
- Li, Y., Gong, R., Tan, X., Yang, Y., Hu, P., Zhang, Q., Yu, F., Wang, W., Gu, S.: Brecq: Pushing the limit of post-training quantization by block reconstruction. 2021, arXiv:2102.05426.
- He, Y., Zhang, X., Sun, J.: Channel pruning for accelerating very deep neural networks. In Proceedings of the IEEE international conference on computer vision; 2017; pp. 1389–1397.
- olino, A., Pascanu, R., Alistarh, D.: Model compression via distillation and quantization. arXiv 2018, arXiv:1802.05668.
- Fang, W. ang, W., Yu, Z., Chen, Y., Masquelier, T., Huang, T., Tian, Y.: Incorporating learnable membrane time constant to enhance learning of spiking neural networks. In Proceedings of the IEEE/CVF International Conference on Computer Vision; 2021; pp. 2661–2661. [Google Scholar]
- Guo, Y., Huang, X., Ma, Z.: Direct learning-based deep spiking neural networks: a review. Frontiers in Neuroscience 2023, 17, 1209795. [CrossRef] [PubMed]
- Guo, Y., Liu, X., Chen, Y., Zhang, L., Peng, W., Zhang, Y., Huang, X., Ma, Z.: Rmp-loss: Regularizing membrane potential distribution for spiking neural networks. arXiv 2023, arXiv:2308.06787.
- Guo, Y. , Peng, W., Chen, Y., Zhang, L., Liu, X., Huang, X., Ma, Z.: Joint a-snn: Joint training of artificial and 37 spiking neural networks via self-distillation and weight factorization. Pattern Recognition p. 109639 ( 2023.
- Guo, Y., Tong, X., Chen, Y., Zhang, L., Liu, X., Ma, Z., Huang, X.: Recdis-snn: Rectifying membrane potential distribution for directly training spiking neural networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); 20 June 2022; pp. 326–335.
- Guo, Y., Zhang, Y., Chen, Y., Peng, W., Liu, X., Zhang, L., Huang, X., Ma, Z.: Membrane potential batch normalization for spiking neural networks. arXiv; 2023.
- Li, Y., Deng, S., Dong, X., Gong, R., Gu, S.: A free lunch from ann: Towards efficient, accurate spiking neural networks calibration. nternational Conference on Machine Learning; 2021; pp. 6316–6325.
- Li, Y., Guo, Y., Zhang, S., Deng, S., Hai, Y., Gu, S.: Differentiable spike: Rethinking gradient-descent for training spiking neural networks. Advances in Neural Information Processing Systems 2021, 34, 23426–2021.
- Wu, Y., Deng, L., Li, G., Zhu, J., Xie, Y., Shi, L.: Direct training for spiking neural networks: Faster, larger, better. In Proceedings of the AAAI Conference on Artificial Intelligence; 2019; vol. 33, pp. 1311–1318.
- Khan, M.M., Lester, D.R., Plana, L.A., Rast, A., Jin, X., Painkras, E., Furber, S.B.: Spinnaker: mapping neural networks onto a massively-parallel chip multiprocessor. 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence); 2008; pp. 2849–2856.
- Akopyan, F., Sawada, J., Cassidy, A., Alvarez-Icaza, R., Arthur, J., Merolla, P., Imam, N., Nakamura, Y., Datta, P., Nam, G.J., et al.: Truenorth: Design and tool flow of a 65 mw 1 million neuron programmable neurosynaptic chip. IEEE transactions on computer-aided design of integrated circuits and systems 2015, 34, 1537–1557. [CrossRef]
- Ma, D., Shen, J., Gu, Z., Zhang, M., Zhu, X., Xu, X., Xu, Q., Shen, Y., Pan, G.: Darwin: A neuromorphic hardware co-processor based on spiking neural networks. Journal of Systems Architecture 2017, 77, 43–51. [CrossRef]
- Pei, J., Deng, L., Song, S., Zhao, M., Zhang, Y., Wu, S., Wang, G., Zou, Z., Wu, Z., He, W., et al.: Towards artificial general intelligence with hybrid tianjic chip architecture. Nature 2019, 572, 106–111. [CrossRef]
- Davies, M., Srinivasa, N., Lin, T.H., Chinya, G., Cao, Y., Choday, S.H., Dimou, G., Joshi, P., Imam, N., Jain, S., et al.: Loihi: A neuromorphic manycore processor with on-chip learning. Ieee Micro 2018, 38, 82–99. [CrossRef]
- Han, B., Roy, K.: Deep spiking neural network: Energy efficiency through time based coding. European Conference on Computer Vision; 2020; pp. 388–404.
- Han, B., Srinivasan, G., Roy, K.: Rmp-snn: Residual membrane potential neuron for enabling deeper high-accuracy and low-latency spiking neural network. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition 2020, 13558–13567.
- Kim, S., Park, S., Na, B., Yoon, S.: Spiking-yolo: spiking neural network for energy-efficient object detection. In: . vol. 34, pp. In Proceedings of the AAAI conference on artificial intelligence; 2020; vol. 34, pp. 11270–11277.
- Ledinauskas, E., Ruseckas, J., Juršėnas, A., Burachas, G.: Training deep spiking neural networks. 2020.
- Diehl, P.U., Neil, D., Binas, J., Cook, M., Liu, S.C., Pfeiffer, M.: Fast-classifying, high-accuracy spiking deep networks through weight and threshold balancing. 2015 International joint conference on neural networks (IJCNN); ieee, 2015; pp. 1–8.
- Sengupta, A., Ye, Y., Wang, R., Liu, C., Roy, K.: Going deeper in spiking neural networks: Vgg and residual architectures. Frontiers in neuroscience 2019, 13, 95. [CrossRef]
- Bu, T., Fang, W., Ding, J., Dai, P., Yu, Z., Huang, T.: Optimal ann-snn conversion for high-accuracy and ultra-low-latency spiking neural networks. International Conference on Learning Representations; 2021.
- Neftci, E.O., Mostafa, H., Zenke, F.: Surrogate gradient learning in spiking neural networks: Bringing the power of gradient-based optimization to spiking neural networks. IEEE Signal Processing Magazine 2019, 36, 51–63. [CrossRef]
- Wu, Y., Deng, L., Li, G., Zhu, J., Shi, L.: Spatio-temporal backpropagation for training high-performance spiking neural networks. Frontiers in neuroscience 2018, 12, 331. [CrossRef]
- Deng, S. , Li, Y., Zhang, S., Gu, S.: Temporal efficient training of spiking neural network via gradient re-weighting. preprint arXiv:2202. arXiv 2022, arXiv:2202.11946. [Google Scholar]
- Zheng, H., Wu, Y., Deng, L., Hu, Y., Li, G.: Going deeper with directly-trained larger spiking neural networks. In: . vol. 35, pp. In Proceedings of the AAAI Conference on Artificial Intelligence; 2021; vol. 35, pp. 11062–11070.
- Krizhevsky, A., Nair, V., Hinton, G.: Cifar-10 (canadian institute for advanced research). URL http://www. cs. toronto. edu/kriz/cifar. 2010, 5, 1. Available online: http://www. cs. toronto. edu/kriz/cifar. html.
- Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: A large-scale hierarchical image database. 2009 IEEE conference on computer vision and pattern recognition; Ieee, 2009; pp. 248–255.
- Li, H., Liu, H., Ji, X., Li, G., Shi, L.: Cifar10-dvs: an event-stream dataset for object classification. Frontiers in neuroscience 2017, 11, 309. [CrossRef]
- Rathi, N., Roy, K.: Diet-snn: Direct input encoding with leakage and threshold optimization in deep spiking neural networks. arXiv 2020, arXiv:2008.03658.
- Fang, W., Yu, Z., Chen, Y., Huang, T., Masquelier, T., Tian, Y.: Deep residual learning in spiking neural networks. Advances in Neural Information Processing Systems; 2021; 34, pp. 21056–21069.
- Cubuk, E.D., Zoph, B., Mane, D., Vasudevan, V., Le, Q.V.: Autoaugment: Learning augmentation policies from data. arXiv 2018, arXiv:1805.09501.
- DeVries, T., Taylor, G.W.: Improved regularization of convolutional neural networks with cutout. preprint arXiv:1708. arXiv 2017, arXiv:1708.04552.
- Loshchilov, I., Hutter, F.: Sgdr: Stochastic gradient descent with warm restarts. preprint arXiv:1608. arXiv 2016, arXiv:1608.03983.
- Rathi, N., Srinivasan, G., Panda, P., Roy, K.: Enabling deep spiking neural networks with hybrid conversion and spike timing dependent backpropagation. arXiv 2020, arXiv:2005.01807.
- Lee, C. Lee, C., Sarwar, S.S., Panda, P., Srinivasan, G., Roy, K.: Enabling spike-based backpropagation for training deep neural network architectures. Frontiers in neuroscience 2020, 119. [Google Scholar] [CrossRef] [PubMed]
- Zhang, W., Li, P.: Temporal spike sequence learning via backpropagation for deep spiking neural networks. Advances in Neural Information Processing Systems 2020, 33, 12022–12033.
- Lu, S., Sengupta, A.: Exploring the connection between binary and spiking neural networks. Frontiers in Neuroscience 2020, 14, 535. [CrossRef]
- Park, S., Kim, S., Na, B., Yoon, S.: T2fsnn: Deep spiking neural networks with time-to-first-spike coding. 2020 57th ACM/IEEE Design Automation Conference 2020, 1–6.
- Park, S., Kim, S., Choe, H., Yoon, S.: Fast and efficient information transmission with burst spikes in deep spiking neural networks. 2019 56th ACM/IEEE Design Automation Conference (DAC); 2019; pp. 1–6.
- Kim, J., Kim, H., Huh, S., Lee, J., Choi, K.: Deep neural networks with weighted spikes. Neurocomputing 2018, 311, 373–386. [CrossRef]
- Kugele, A., Pfeil, T., Pfeiffer, M., Chicca, E.: Efficient processing of spatio-temporal data streams with spiking neural networks. Frontiers in Neuroscience 2020, 14, 439. [CrossRef]





| Dataset | Neuron model | Timestep | Accuracy |
|---|---|---|---|
| CIFAR-10 | “Hard Reset" LIF | 2 | 90.36% |
| “Hard Reset" IF | 2 | 90.07% | |
| “Soft Reset" IF (SRIF) | 2 | 90.38% | |
| “Hard Reset" LIF | 4 | 92.22% | |
| “Hard Reset" IF | 4 | 92.04% | |
| “Soft Reset" IF (SRIF) | 4 | 92.46% | |
| “Hard Reset" LIF | 6 | 92.66% | |
| “Hard Reset" IF | 6 | 92.26% | |
| “Soft Reset" IF (SRIF) | 6 | 93.40% | |
| “Hard Reset" LIF | 8 | 92.90% | |
| “Hard Reset" IF | 8 | 92.86% | |
| “Soft Reset" IF (SRIF) | 8 | 94.09% | |
| CIFAR-100 | “Hard Reset" LIF | 2 | 62.67% |
| “Hard Reset" IF | 2 | 63.43% | |
| “Soft Reset" IF (SRIF) | 2 | 63.85% | |
| “Hard Reset" LIF | 4 | 66.00% | |
| “Hard Reset" IF | 4 | 66.95% | |
| “Soft Reset" IF (SRIF) | 4 | 67.90% | |
| “Hard Reset" LIF | 6 | 67.44% | |
| “Hard Reset" IF | 6 | 68.31% | |
| “Soft Reset" IF (SRIF) | 6 | 69.59% | |
| “Hard Reset" LIF | 8 | 67.85% | |
| “Hard Reset" IF | 8 | 69.14% | |
| “Soft Reset" IF (SRIF) | 8 | 69.90% |
| Dataset | Architecture | Method | Timestep | Accuracy |
|---|---|---|---|---|
| CIFAR-10 | ResNet20 | SRIF w/o MPR | 4 | 92.46% |
| SRIF w/ MPR | 4 | 92.94% | ||
| ResNet19 | SRIF w/o MPR | 4 | 95.44% | |
| SRIF w/ MPR | 4 | 96.27% | ||
| CIFAR-100 | ResNet20 | SRIF w/o MPR | 4 | 67.90% |
| SRIF w/ MPR | 4 | 70.63% | ||
| ResNet19 | SRIF w/o MPR | 4 | 77.85% | |
| SRIF w/ MPR | 4 | 78.42% |
| Dataset | Architecture | Method | Timestep | Avg. error |
|---|---|---|---|---|
| CIFAR-10 | ResNet20 | Before MPR | 4 | 0.28 |
| After MPR | 4 | 0.04 | ||
| ResNet19 | Before MPR | 4 | 0.20 | |
| After MPR | 4 | 0.03 | ||
| CIFAR-100 | ResNet20 | Before MPR | 4 | 0.38 |
| After MPR | 4 | 0.05 | ||
| ResNet19 | Before MPR | 4 | 0.32 | |
| After MPR | 4 | 0.04 |
| Dataset | Method | Type | Architecture | Timestep | Accuracy |
|---|---|---|---|---|---|
| CIFAR-10 | SpikeNorm [28] | ANN2SNN | VGG16 | 2500 | 91.55% |
| Hybrid-Train [42] | Hybrid | VGG16 | 200 | 92.02% | |
| Spike-basedBP [43] | SNN training | ResNet11 | 100 | 90.95% | |
| STBP [17] | SNN training | CIFARNet | 12 | 90.53% | |
| TSSL-BP [44] | SNN training | CIFARNet | 5 | 91.41% | |
| PLIF [9] | SNN training | PLIFNet | 8 | 93.50% | |
| Diet-SNN [37] | SNN training | VGG16 | 5 | 92.70% | |
| 10 | 93.44% | ||||
| ResNet20 | 5 | 91.78% | |||
| 10 | 92.54% | ||||
| STBP-tdBN [33] | SNN training | ResNet19 | 2 | 92.34% | |
| 4 | 92.92% | ||||
| 6 | 93.16% | ||||
| ANN* | ANN | ResNet19 | 1 | 96.29% | |
| InfLoR-SNN | SNN training | ResNet19 | 2 | 94.44% | |
| 4 | 96.27% | ||||
| 6 | 96.49% | ||||
| ResNet20 | 5 | 93.01% | |||
| 10 | 93.65% | ||||
| VGG16 | 5 | 94.06% | |||
| 10 | 94.67% | ||||
| CIFAR-100 | BinarySNN [45] | ANN2SNN | VGG15 | 62 | 63.20% |
| Hybrid-Train [42] | Hybrid | VGG11 | 125 | 67.90% | |
| T2FSNN [46] | ANN2SNN | VGG16 | 680 | 68.80% | |
| Burst-coding [47] | ANN2SNN | VGG16 | 3100 | 68.77% | |
| Phase-coding [48] | ANN2SNN | VGG16 | 8950 | 68.60% | |
| Diet-SNN [37] | SNN training | ResNet20 | 5 | 64.07% | |
| VGG16 | 5 | 69.67% | |||
| ANN* | ANN | ResNet19 | 1 | 78.61% | |
| InfLoR-SNN | SNN training | ResNet20 | 5 | 71.19% | |
| VGG16 | 5 | 71.56% | |||
| 10 | 73.17% | ||||
| ResNet19 | 2 | 75.56% | |||
| 4 | 78.42% | ||||
| 6 | 79.51% | ||||
| ImageNet | Hybrid-Train [42] | Hybrid | ResNet34 | 250 | 61.48% |
| SpikeNorm [28] | ANN2SNN | ResNet34 | 2500 | 69.96% | |
| STBP-tdBN [33] | SNN training | ResNet34 | 6 | 63.72% | |
| SEW ResNet [38] | SNN training | ResNet18 | 4 | 63.18% | |
| ResNet34 | 4 | 67.04% | |||
| Spiking ResNet [38] | SNN training | ResNet18 | 4 | 62.32% | |
| ResNet34 | 4 | 61.86% | |||
| InfLoR-SNN | SNN training | ResNet18 | 4 | 64.78% | |
| ResNet34 | 4 | 65.54% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).