Submitted:
20 March 2026
Posted:
24 March 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
- We investigate the robustness of our proposed PDE-CNN-VIB architecture under common image corruptions using the CIFAR-10-C benchmark.
- We extend the evaluation of the proposed architecture beyond the standard CIFAR-10 setting to the corrupted CIFAR-10-C benchmark, providing a broader picture of its robustness behavior.
- We demonstrate improved corruption robustness compared to a baseline CNN, achieving higher mean corruption accuracy across multiple corruption types.
- We provide corruption-specific and category-wise analyses of robustness, highlighting substantial gains under high-frequency noise perturbations.
- We analyze the computational cost of the proposed model by comparing training time, inference latency, and parameter count with a baseline CNN.
2. Proposed Architecture
2.1. PDE Regularization Layer
2.2. Convolutional Feature Adaptation
2.3. VIB Module
2.4. CNN Backbone
3. Materials and Methods
3.1. CIFAR-10 Dataset
3.2. CIFAR-10-C Corruption Benchmark
3.3. Training Protocol
3.4. Evaluation Metrics
4. Results
4.1. Clean CIFAR-10 Performance
4.2. Robustness on CIFAR-10-C
4.3. Corruption Category Analysis
4.4. Computational Cost Analysis
5. Discussion
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Use of Artificial Intelligence
Conflicts of Interest
Abbreviations
| CNN | Convolutional Neural Network |
| PDE | Partial Differential Equation |
| VIB | Variational Information Bottleneck |
| mCA | Mean Corruption Accuracy |
| NLL | Negative Log-Likelihood |
| ECE | Expected Calibration Error |
| KL | Kullback–Leibler Divergence |
| CIFAR | Canadian Institute for Advanced Research |
| MPS | Metal Performance Shaders |
References
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [CrossRef]
- Krizhevsky, A.; Sutskever, I.; Hinton, G. ImageNet Classification with Deep Convolutional Neural Networks. Advances in Neural Information Processing Systems 2012, 10, 1097–1105.
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, 770–778.
- Hendrycks, D.; Dietterich, T. Benchmarking Neural Network Robustness to Common Corruptions and Perturbations. In Proceedings of the International Conference on Learning Representations (ICLR), New Orleans, LA, USA, March 2019. [CrossRef]
- Geirhos, R.; Rubisch, P.; Michaelis, C.; Bethge, M.; Wichmann, F.; Brendel, W. Generalisation in Humans and Deep Neural Networks. In Proceedings of the 32nd International Conference on Neural Information Processing Systems, Montréal, QC, Canada, December 2018, 7538–7550.
- Krizhevsky, A. Learning Multiple Layers of Features from Tiny Images. Technical Report, University of Toronto, Toronto, ON, Canada, 2009.
- Aghababaeyan, N. Towards Reliable Image Classification: A Systematic Robustness Analysis of CNN and Classical Models Under Natural Corruptions. Master’s Thesis, Concordia University, Montreal, Canada, 2025.
- Mitra, P.; Schwalbe, G.; Klein, N. Investigating Calibration and Corruption Robustness of Post-hoc Pruned Perception CNNs: An Image Classification Benchmark Study. arXiv preprint arXiv:2405.20876, 2024. [CrossRef]
- Tran, A.T.; Zeevi, T.; Payabvash, S. Strategies to Improve the Robustness and Generalizability of Deep Learning Segmentation and Classification in Neuroimaging. BioMedInformatics 2025, 5, 20. [CrossRef]
- Huang, Z. Trustworthy Machine Learning under Distribution Shifts. arXiv 2025, arXiv:2512.23524. [CrossRef]
- Sahakyan, V.; Melkonyan, V.; Gharagyozyan, G.; Avetisyan, A. Enhancing Image Recognition with Pre-Defined Convolutional Layers Based on PDEs. Programming and Computer Software 2023, 49(3), 192–197. [CrossRef]
- Tishby, N.; Pereira, F.; Bialek, W. The Information Bottleneck Method. In Proceedings of the 37th Annual Allerton Conference on Communication, Control, and Computing, Monticello, IL, USA, 1999, 368–377. [CrossRef]
- Harremoës, P.; Tishby, N. The Information Bottleneck Revisited or How to Choose a Good Distortion Measure. 2007 IEEE International Symposium on Information Theory (ISIT), Nice, France, 2007, 566–570. [CrossRef]
- Shwartz-Ziv, R.; Tishby, N. The Role of the Information Bottleneck in Representation Learning. International Conference on Learning Representations (ICLR) Workshop on Deep Learning Theory, Toulon, France, 2017. [CrossRef]
- Alemi, A.; Fischer, I.; Dillon, J.; Murphy, K. Deep Variational Information Bottleneck. In Proceedings of the International Conference on Learning Representations (ICLR), Toulon, France, 2017. [CrossRef]
- Wang, C.; Du, S.; Zhang, Y. Self-Supervised Learning for High-Resolution Remote Sensing Images Change Detection With Variational Information Bottleneck. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 2023. [CrossRef]
- Haroutunian, M. E.; Gharagyozyan, G. A. Information Theory Tools and Techniques to Overcome Machine Learning Challenges. Mathematical Problems of Computer Science 2025, 63, 25–41. [CrossRef]
- Feder, M.; Urbanke, R.; Fogel, Y. Information-Theoretic Framework for Understanding Modern Machine Learning. arXiv 2025, arXiv:2506.07661. [CrossRef]
- Gharagyozyan, G. Improving CNN Generalization with PDE Preprocessing and the Variational Information Bottleneck. In Proceedings of the International Conference on Computer Science and Information Technologies (CSIT), Yerevan, Armenia, September 2025, 145–147.
- Gharagyozyan, G. A PDE-Based Convolutional Neural Network with Variational Information Bottleneck: Experimental Evaluation and Generalization Analysis. Mathematical Problems of Computer Science 2025, 64, 37–46. [CrossRef]
- Perona, P.; Malik, J. Scale-Space and Edge Detection Using Anisotropic Diffusion. IEEE Transactions on Pattern Analysis and Machine Intelligence 1990, 12(7), 629–639. [CrossRef]
- Weickert, J. Anisotropic Diffusion in Image Processing; Teubner: Stuttgart, Germany, 1998.
- Richtmyer, R. D.; Morton, K. W. Difference Methods for Initial-Value Problems, 2nd ed.; John Wiley & Sons: New York, NY, USA, 1967; pp. 1–30.
- Cover, T. M.; Thomas, J. A. Elements of Information Theory, 2nd ed.; John Wiley & Sons: New York, NY, USA, 2006.
- Mohanty, S.; Reddy, A.; Mopuri, K. R. DiRe: Diversity-promoting Regularization for Dataset Condensation. In Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV), Tucson, AZ, USA, 2026. [CrossRef]
- Qi, D.; Li, J.; Dou, S.; Gao, J.; Wang, Y.; Zhao, B. Active Dataset Distillation via Dual-Space Informative Matching. IEEE Transactions on Image Processing 2026. [CrossRef]
- Hendrycks, D.; Mu, N.; Cubuk, E.; Zoph, B.; Gilmer, J.; Lakshminarayanan, B. AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty. In Proceedings of the International Conference on Learning Representations (ICLR), Addis Ababa, Ethiopia, 2020. [CrossRef]
- Tishby, N.; Zaslavsky, N. Deep Learning and the Information Bottleneck Principle. In Proceedings of the 2015 IEEE Information Theory Workshop (ITW), Jerusalem, Israel, 2015, 1–5. [CrossRef]
- Goodfellow, I.; Shlens, J.; Szegedy, C. Explaining and Harnessing Adversarial Examples. In Proceedings of the International Conference on Learning Representations (ICLR), San Diego, CA, USA, 2015. [CrossRef]
- Hendrycks, D.; Gimpel, K. A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks. In Proceedings of the International Conference on Learning Representations (ICLR), Toulon, France, 2017. [CrossRef]
- Cubuk, E.; Zoph, B.; Mane, D.; Vasudevan, V.; Le, Q. AutoAugment: Learning Augmentation Strategies from Data. In Proceedings of the 2019 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 2019, 113–123. [CrossRef]
- Madry, A.; Makelov, A.; Schmidt, L.; Tsipras, D.; Vladu, A. Towards Deep Learning Models Resistant to Adversarial Attacks. In Proceedings of the International Conference on Learning Representations (ICLR), Vancouver, BC, Canada, 2018. [CrossRef]





| Model | Accuracy (%) | NLL | ECE |
| Baseline CNN | 80.60 | 0.584965 | 0.037420 |
| PDE-CNN-VIB | 88.52 | 0.3684 | 0.0177 |
| Corruption Type | Baseline CNN | PDE-CNN-VIB |
| Gaussian Noise | 25.79 | 39.62 |
| Shot Noise | 38.66 | 53.37 |
| Impulse Noise | 50.14 | 58.97 |
| Defocus Blur | 79.93 | 79.48 |
| Glass Blur | 47.82 | 57.48 |
| Motion Blur | 66.60 | 65.54 |
| Zoom Blur | 70.69 | 70.82 |
| Snow | 72.81 | 74.13 |
| Frost | 67.63 | 69.88 |
| Fog | 82.81 | 82.29 |
| Brightness | 86.14 | 86.49 |
| Contrast | 74.34 | 74.64 |
| Elastic Transform | 76.74 | 76.40 |
| Pixelate | 74.89 | 70.71 |
| JPEG Compression | 72.98 | 73.43 |
| mCA | 65.86 | 68.88 |
| Model | Parameters (M) | Epoch Time (s) | Inference Time (ms/image) |
| Baseline CNN | 11.17 | 108.77 | 1.463 |
| PDE-CNN-VIB | 11.20 | 122.12 | 1.516 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).