Bayesian R-LayerNorm: Uncertainty-Aware Adaptive Normalization with Provable Robustness Bounds

Mohsen Mostafa

doi:10.20944/preprints202603.0450.v1

Submitted:

04 March 2026

Posted:

05 March 2026

You are already at the latest version

Abstract

This paper introduces Bayesian R-LayerNorm, a novel normalization layer that extends the previously proposed R-LayerNorm with formal mathematical foundations and uncertainty quantification. Building upon the empirical success of R-LayerNorm, we present a complete mathematical formalism using sta-tistical field theory, renormalization group methods, and information geometry. Our approach provides provable stability guarantees through three theorems: numerical stability, gradient stability, and training convergence. The Bayesian extension incorporates uncertainty estimation through a stable ψ-function, enabling adaptive noise suppression based on local entropy estimates. A key contribution is the integration of uncertainty quantification directly into the normal-ization operation, providing confidence estimates for each normalized activation without additional cost. The method is adaptive to local noise, varying its normalization strength spatially based on estimated noise levels. Despite its theoretical depth, the implementation is simple and serves as a drop-in replacement for existing normalization layers, adding only two learnable parameters per layer. Experimental validation on the full CIFAR-10-C dataset demonstrates consistent improvements: Bayesian R-LayerNorm achieves average accuracy gains of +0.49% over standard LayerNorm across four common corruptions, with the largest improvement of +0.74% on shot noise. The method requires minimal computational overhead (∼ 10%) and we provide complete open-source implementation. We further show that the learned λ parameters offer interpretability, revealing which layers adapt most strongly to different corruptions. While the accuracy gains are modest, the framework opens new di-rections for trustworthy and interpretable normalization in safety-critical applications where uncertainty matters as much as accuracy.

Keywords:

normalization

;

robust learning

;

Bayesian methods

;

uncertainty quantification

;

image corruption

Subject:

Computer Science and Mathematics - Artificial Intelligence and Machine Learning

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Bayesian R-LayerNorm: Uncertainty-Aware Adaptive Normalization with Provable Robustness Bounds

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe