Submitted:
22 July 2025
Posted:
22 July 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
3. Results
4. Discussion
5. Conclusions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| CNN | Convolutional neural network |
| EP | Electronic processor |
| LD | Laser diode |
| LED | Light-emitting diode |
| LRN | Local response normalization |
| OCNN | Optical convolutional neural network |
| PD | Photo detector |
| SPBONN | Smart-pixel-based bidirectional optical neural network |
| SPLM | Spatial light modulator |
| SLM | Smart pixel light modulator |
| SPOCNN | Smart-pixel-based optical convolutional neural network |
References
- Hinton, G.; Deng, L.; Yu, D.; Dahl, G.E.; Mohamed, A.R.; Jaitly, N.; Senior, A.; Vanhoucke, V.; Nguyen, P.; Sainath, T.N.; Kingsbury, B. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups IEEE Signal processing magazine 29, 82-97 (2012).
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning, Nature 521, 436–444 (2015).
- Lecun L.; Bottou L.; Bengio Y.; Haffner P. Gradient-based learning applied to document recognition Proceedings of the IEEE 86 2278-2324 (1998).
- Chetlur S.; Woolley C.; Vandermersch P.; Cohen J.; Tran J.; Catanzaro B.; Shelhamer E. cuDNN: efficient primitives for deep learning, arXiv:1410.0759v3 (2014).
- Han, S.; Pool, J.; Tran, J.; Dally, W. Learning both weights and connections for efficient neural network. Advances in Neural Information Processing Systems, 28 (2015).
- Rhu, M.; Gimelshein, N.; Clemons, J.; Zulfiqar, A.; Keckler, S.W. vDNN: Virtualized deep neural networks for scalable, memory-efficient neural network design. In 2016 49th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), 1-13 (2016).
- Chen, T.; Moreau, T.; Jiang, Z.; Zheng, L.; Yan, E.; Shen, H.; Cowan, M.; Wang, L.; Hu, Y.; Ceze, L.; Guestrin, C. TVM: An au-tomated end-to-end optimizing compiler for deep learning. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18), 578-594 (2018).
- Jouppi, N.P.; Young, C.; Patil, N.; Patterson, D.; Agrawal, G.; Bajwa, R.; Bates, S.; Bhatia, S.; Boden, N.; Borchers, A.; Boyle, R. In-datacenter performance analysis of a tensor processing unit. In Proceedings of the 44th Annual International Symposium on Computer Architecture, 1-12 (2017).
- Colburn, S.; Chu, Y.; Shilzerman, E.; Majumdar, A. Optical frontend for a convolutional neural network. Applied optics, 58 3179-3186 (2019).
- Chang, J.; Sitzmann, V.; Dun, X.; Heidrich, W.; Wetzstein, G. Hybrid optical-electronic convolutional neural networks with optimized diffractive optics for image classification, Scientific reports 8, 12324 (2018).
- Lin, X.; Rivenson, Y.; Yardimci, N.T.; Veli, M.; Luo, Y.; Jarrahi, M.; Ozcan, A. All-optical machine learning using diffractive deep neural networks, Science 361, 1004-1008 (2018).
- Sui, X.; Wu, Q.; Liu, J.; Chen, Q.; Gu, G. A review of optical neural networks, IEEE Access 8, 70773-70783 (2020).
- Goodman, J.W. Introduction to Fourier optics. Roberts and Company publishers (2005).
- Glaser, I. Lenslet array processors, Applied Optics 21, 1271-1280 (1982).
- Ju, Y.G. A scalable optical computer based on free-space optics using lens arrays and a spatial light modulator. Optical and Quantum Electronics, 55, 1-21 (2023).
- Ju, Y.G. Scalable Optical Convolutional Neural Networks Based on Free-Space Optics Using Lens Arrays and a Spatial Light Modulator. Journal of Imaging, 2023, 9(11), p.241.
- Cox, M.A.; Cheng, L.; Forbes, A. Digital micro-mirror devices for laser beam shaping, Proc. SPIE 11043, Fifth Conference on Sensors, MEMS, and Electro-Optic Systems, 110430Y (2019).
- Mihara, K.; Hanatani, K.; Ishida, T.; Komaki, K.; Takayama, R. High Driving Frequency (> 54 kHz) and Wide Scanning An-gle (> 100 Degrees) MEMS Mirror Applying Secondary Resonance For 2K Resolution AR/MR Glasses. 2022 IEEE 35th In-ter-national Conference on Micro Electro Mechanical Systems Conference (MEMS), 477-482 (2022).
- Seitz, P. Smart Pixels, PROCEEDINGS EDMO 2001 / VIENNA, 229-234 (2001).
- Hinton, H.S. , 1996. Progress in the smart pixel technologies. IEEE journal of selected topics in quantum electronics, 2(1), pp.14-23.
- Ju, Y.-G. A Conceptual Study of Rapidly Reconfigurable and Scalable Optical Convolutional Neural Networks Based on Free-Space Optics Using a Smart Pixel Light Modulator. Computers 2025, 14, 111. [Google Scholar] [CrossRef]
- Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet Classification with Deep Convolutional Neural Networks. Advances in Neural Information Processing Systems 2012, 25, 1097–1105. [Google Scholar] [CrossRef]
- Jahns, J. , 1998. VI: Free-Space Optical Digital Computing and Interconnection. Progress in optics, 38, pp.419-513.
- Dally, W.J. and Towles, B.P., 2004. Principles and practices of interconnection networks. Elsevier. p.
- McKeown, N. , 1997. A fast switched backplane for a gigabit switched router. Business Communications Review, 27(12), pp.1-30.
- S. Hamza, J. S. Deogun and D. R. Alexander, "Free space optical multicast crossbar," in Journal of Optical Communications and Networking, vol. 8, no. 1, pp. 1-10, 1 January 2016. [CrossRef]
- Ju, Y.G. Bidirectional Optical Neural Networks Based on Free-Space Optics Using Lens Arrays and Spatial Light Modulator. Micromachines 2024, 15, 701. [Google Scholar] [CrossRef] [PubMed]
- Ju, Y.-G. A Conceptual Study of Rapidly Reconfigurable and Scalable Bidirectional Optical Neural Networks Leveraging a Smart Pixel Light Modulator. Photonics 2025, 12, 132. [Google Scholar] [CrossRef]








| Layer | Type | Kernel Size / Stride | Padding | Output Size | Remarks |
| Input | Input Image | - | - | 227×227×3 | RGB image |
| Conv1 | Convolution | 11×11×3 / 4 | 0 | 55×55×96 | |
| LRN1 | Normalization | - | - | 55×55×96 | Window size =5 |
| Pool1 | Max Pooling | 3×3 / 2 | 0 | 27×27×96 | |
| Conv2 | Convolution | 5×5×48 / 1 | 2 | 27×27×256 | Group split |
| LRN2 | Normalization | - | - | 27×27×256 | Window size =5 |
| Pool2 | Max Pooling | 3×3 / 2 | 0 | 13×13×256 | |
| Conv3 | Convolution | 3×3×128 / 1 | 1 | 13×13×384 | |
| Conv4 | Convolution | 3×3×192 / 1 | 1 | 13×13×384 | Group split |
| Conv5 | Convolution | 3×3×192 / 1 | 1 | 13×13×256 | Group split |
| Pool5 | Max Pooling | 3×3 / 2 | 0 | 6×6×256 | |
| FC6 | Fully Connected | - | - | 4096 | Flattened input size: 6×6×256 = 9216 |
| FC7 | Fully Connected | - | - | 4096 | |
| FC8 | Fully Connected | - | - | 1000 | Softmax output |
| Layer / Operation | Description | Instruction Cycles |
|---|---|---|
| RGB Multicast | Multicast RGB images to 96 kernels (2 per channel × 3) | 6 |
| Conv1 | Convolution with 11×11 kernel, stride 4 | 3 |
| LRN1 | Local Response Normalization, window size 5 | 5 |
| Pool1 | Max pooling with 3×3 window, stride 2 | 8 |
| Rearrangement after Pool1 | Data formatting using multicast (2 instructions) | 2 |
| Conv2 to Pool2 | Conv2, LRN2, and Pool2 combined | 16 |
| Conv3 to Conv5 | Convolution layers 3 to 5 | 12 |
| Pool5 | Max pooling layer | 8 |
| Intermediate Rearrangements | 4 rearrangement operations (2 cycles each) | 8 |
| FC6 to FC8 | Three fully connected layers (SPBONN) | 3 |
| Total | Total instruction cycles required | 71 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).