Submitted:
30 January 2024
Posted:
31 January 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Method
2.1. Spiking neurons
2.2. STDP fine-tune for SNN
2.3. Convolutional SNN
2.4. Loss
3. Experiment results
3.1. Dataset
3.2. Poisson encoder

3.3. Performance of convolutional SNN and comparison against the CNNs
3.4. Sparse weights of SNN and comparison against the CNN pruning
3.5. Ablation Studies
4. Feature visualization
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Maass, W. Networks of spiking neurons: The third generation of neural network models. Neural Networks 1997, 10, 1659–1671. [Google Scholar] [CrossRef]
- Neftci, E.O., Pedroni, B.U., Joshi, S., et al.: Stochastic synapses enable efficient brain-inspired learning machines. Front. Neurosci. 2016, 10, 241. [CrossRef] [PubMed]
- MFolowosele, F., Vogelstein, R.J., Etienne-Cummings, R.: Real-time silicon implementation of V1 in hierarchical visual information processing. In: Biomedical Circuits and Systems Conference, 2008, pp. 181–184. IEEE Press. [CrossRef]
- LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE Press 1999, 86, 2278–2324. [CrossRef]
- Brader, J.M., Senn, W., Fusi, S.: Grid Learning real-world stimuli in a neural network with spike-driven synaptic dynamics. Neural Comput. 2007, 19, 2881–2912. [CrossRef] [PubMed]
- Cao, Y., Chen, Y., Khosla, D.: Spiking deep convolutional neural networks for energy-efficient object recognition. Int. J. Comput. Vis. 2015, 113, 54–66. [CrossRef]
- Diehl, P.U., Neil, D., Binas, J., et al.: Fast-classifying, high-accuracy spiking deep networks through weight and threshold balancing. In: International Joint Conference on Neural Networks (IJCNN) 2015, IEEE, 2015, pp. 1–8. IEEE Press. [CrossRef]
- Hunsberger, E., Eliasmith. Spiking deep networks with LIF neurons. arXiv 2015, arXiv:1510.08829. [Google Scholar]
- Maass, W., Markram, H.: On the computational power of circuits of spiking neurons. J. Comput. Syst. Sci. 2004, 69, 593–616. [CrossRef]
- Lovelace JJ, Rickard JT, Cios KJ. A spiking neural network alternative for the analog to digital converter. In: Neural Networks (IJCNN), The 2010 International Joint Conference On. New Jersey, USA: Institute of Electrical and Electronics Engineers-IEEE: 2010. p. 1–8. [CrossRef]
- Ambard M, Guo B, Martinez D, Bermak A. A spiking neural network for gas discrimination using a tin oxide sensor array. In: Electronic Design, Test and Applications, 2008. DELTA 2008. 4th IEEE International Symposium On. New Jersey, USA: Institute of Electrical and Electronics Engineers-IEEE: 2008. p. 394–397. [CrossRef]
- Bouganis A, Shanahan M. Training a spiking neural network to control a 4-dof robotic arm based on spike timing-dependent plasticity. In: Neural Networks (IJCNN), The 2010 International Joint Conference On. New Jersey, USA: Institute of Electrical and Electronics Engineers-IEEE: 2010. p. 1–8. [CrossRef]
- Alnajjar F, Murase K. Sensor-fusion in spiking neural network that generates autonomous behavior in real mobile robot. In: Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference On. New Jersey, USA: Institute of Electrical and Electronics Engineers-IEEE: 2008. p. 2200–2206. [CrossRef]
- Perez-Carrasco JA, Acha B, Serrano C, Camunas-Mesa L, Serrano-Gotarredona T, Linares-Barranco B. Fast vision through frameless event-based sensing and convolutional processing: Application to texture recognition. Neural Networks IEEE Trans. 2010, 21(4), 609–620. [CrossRef] [PubMed]
- Botzheim J, Obo T, Kubota N. Human gesture recognition for robot partners by spiking neural network and classification learning. In: Soft Computing and Intelligent Systems (SCIS) and 13th International Symposium on Advanced Intelligent Systems (ISIS), 2012 Joint 6th International Conference On. New Jersey, USA: Institute of Electrical and Electronics Engineers-IEEE: 2012. p. 1954–1958. [CrossRef]
- Ratnasingam S, McGinnity TM. A spiking neural network for tactile form based object recognition. In: The 2011 International Joint Conference on Neural Networks (IJCNN). New Jersey, USA: Institute of Electrical and Electronics Engineers-IEEE: 2011. p. 880–885. [CrossRef]
- Fang H, Wang Y, He J. Spiking neural networks for cortical neuronal spike train decoding. Neural Comput. 2009, 22(4), 1060–1085. [CrossRef]
- Li W, Chen H, Guo J, et al. Brain-inspired Multilayer Perceptron with Spiking Neurons. 2022. [CrossRef]
- Zhang D, Zhang T, Jia S, et al. Multi-sacle dynamic coding improved spiking actor network for reinforcement learning. In: Proceedings of the AAAI Conference on Artificial Intelligence. 2022, 36(1): 59-67. [CrossRef]
- Cuadrado J, Rançon U, Cottereau B R, et al. Optical flow estimation from event-based cameras and spiking neural networks. Frontiers in Neuroscience 2023, 17, 1160034. [CrossRef] [PubMed]
- Zou S, Mu Y, Zuo X, et al. Event-based human pose tracking by spiking spatiotemporal transformer. arXiv 2023, arXiv:2303.09681.
- Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet Classification with Deep Convolutional Neural Networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef]
- Simonyan, K.; Zisserman, A. Very Deep Convolutional Networks for Large-scale Image Recognition. Comput. Sci. 2014, 56, 1–14. [Google Scholar]
- Szegedy, C.; Liu, W.; Jia, Y. Going Deeper with Convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 26 June–1 July 2016. [Google Scholar]
- Wang Z, Guo L, Adjouadi M. A generalized leaky integrate-and-fire neuron model with fast implementation method. International journal of neural systems 2014, 24(05), 1440004. [CrossRef]
- Markram H, Lubke J, Frotscher M, et al. Regulation of synaptic efficacy by coincidence of postsynaptic APS and EPSPS. Science 1997, 275(5297), 213–215. [CrossRef]
- Caporale N, Dan Y. Spike timing–dependent plasticity: a Hebbian learning rule. Annu. Rev. Neurosci. 2008, 31, 25–46. [CrossRef] [PubMed]
- Fang W, Chen Y, Ding J, et al. SpikingJelly: An open-source machine learning infrastructure platform for spike-based intelligence. Science Advances 2023, 9(40), eadi1480. [CrossRef] [PubMed]
- Dumitru, Ian Goodfellow, Will Cukierski, Yoshua Bengio. (2013). Challenges in Representation Learning: Facial Expression Recognition Challenge. Kaggle.
- Barsoum E, Zhang C, Ferrer C C, et al. Training Deep Networks for Facial Expression Recognition with Crowd-Sourced Label Distribution. ACM, 2016. [CrossRef]
- Howard A G, Zhu M, Chen B, et al. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. 2017. [CrossRef]
- K. Simonyan and A. Zisserman, “Very Deep Convolutional Networks for Large-Scale Image Recognition,”. CoRR 2014, 1409.
- K. He, X. Zhang, haoqing Ren, and J. Sun, “Deep Residual Learning for Image Recognition,”. CoRR 2015, 1512.
- C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, “Going Deeper with Convolutions,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
- Liu Z, Li J, Shen Z, et al. Learning Efficient Convolutional Networks through Network Slimming. IEEE, 2017. [CrossRef]
- Chen J, Qiu X, Ding C, et al. SAR image classification based on spiking neural network through spike-time dependent plasticity and gradient descent. ISPRS journal of photogrammetry and remote sensing 2022, 188. [CrossRef]









| Layer (type: depth-idx) | Title 2 | Title 3 |
|---|---|---|
| Poisson encoder (alternative) | — | — |
| Sequential: 1-1 └─Conv2d: 2-1 └─BatchNorm2d: 2-2 |
[1, 128, 48, 48] | — |
| [1, 128, 48, 48] [1, 128, 48, 48] |
1,152 256 |
|
| Sequential: 1-2 └─IF Node: 2-3 └─ATan: 3-1 └─MaxPool2d: 2-4 └─Conv2d: 2-5 └─BatchNorm2d: 2-6 └─IF Node: 2-7 └─ATan: 3-2 └─MaxPool2d: 2-8 |
[1, 128, 12, 12] [1, 128, 48, 48] [1, 128, 48, 48] [1, 128, 24, 24] [1, 128, 24, 24] [1, 128, 24, 24] [1, 128, 24, 24] [1, 128, 24, 24] [1, 128, 12, 12] |
— — — — 147,456 256 — — — |
| Sequential: 1-3 └─Flatten: 2-9 └─Dropout: 2-10 └─Linear: 2-11 └─LIF Node: 2-12 └─ATan: 3-3 └─Dropout: 2-13 └─Linear: 2-14 └─LIF Node: 2-15 └─ATan: 3-4 └─Linear: 2-16 └─LIF Node: 2-17 └─ATan: 3-5 |
[1, 7] [1, 18432] [1, 18432] [1, 1152] [1, 1152] [1, 1152] [1, 1152] [1, 128] [1, 128] [1, 128] [1, 7] [1, 7] [1, 7] |
— — — 21,233,664 — — — 147,456 — — 896 — — |
| STDP fine-tune (alternative) | — | — |
| Expression | Angry | Fear | Sad | Neutral | Happy | Surprise | Digust |
|---|---|---|---|---|---|---|---|
| Number of samples | 4953 | 5121 | 6077 | 6198 | 8989 | 4002 | 547 |
| Title 1 | Convolutional SNN | CNN (3 convolutional layers + fully connected layers) |
|---|---|---|
| Total params | 21,531,136 | 50,805,191 |
| Total mult-adds (M) Params size (MB) Train speed (fps) GPU occupancy Energy efficiency (fps/w) |
427.92 86.12 398.73 81% 2.84 |
1266.64 215.3 152.38 96% 0.92 |
| Title 1 | Title 2 | Title 3 | Title 4 |
|---|---|---|---|
| VGG [33] ResNet-34 [34] Inception [35] MobileNet v1 |
CCPCCPCCPCCPFF | 10 | 72.7% |
| 3R4R6R3RPF | 33 | 72.4% | |
| CIPIIPIIPIIPF C3C4C4C4C4CF |
16 21 |
71.6% 66.2% |
|
| Proposed SNN Shallow-CNN |
CPCPFFF | 5 | 60.2% |
| CPCPCPFF | 5 | 59.2% |
| Hyperparameters | Symbol | Value |
|---|---|---|
| Simulation time steps of the conv layer | Tconv | 8 |
| Simulation time steps of STDP Membrane potential time constant of conv LIF Node Membrane potential time constant of STDP Threshold voltage of conv LIF Node Threshold voltage of STDP Reset voltage of conv LIF Node Reset voltage of STDP Learning rate Batch size |
TSTDP τconv τSTDP Vthreshold_conv V threshold_STDP Vreset_conv Vreset_STDP r N |
20 2.0 10.0 1.0 5.0 0.0 0.0 1×e-3 16 |
| Network structure | Conv-SNN (T=4) |
Conv-SNN (T=8) |
ConvSNN (T=8, Poisson encoding) |
ConvSNN (T=8, Poisson encoding + STDP fine-tune) |
|---|---|---|---|---|
| Accuary in Fer2013 | 60.15% | 60.65% | 60.91% | 61.87% |
| Accuary in FER+ | 77.17% | 77.94% | 78.79% | 79.97% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).