Preprint
Article

This version is not peer-reviewed.

FedHeRM: Federated Heterogeneity-Aware Reinforcement Learning for Secure NOMA Resource Management in Vehicular Edge Networks

Submitted:

07 April 2026

Posted:

08 April 2026

You are already at the latest version

Abstract
The rapid expansion of 5G/6G technologies and the burgeoning demand for low-latency, high-reliability services at the network edge challenge traditional centralized cloud architectures. Edge computing offers a promising solution, yet its dynamic, resource-constrained, and heterogeneous nature necessitates decentralized intelligence for effective resource management. This paper addresses these challenges by synergistically integrating Federated Learning (FL), Multi-Agent Reinforcement Learning (MARL), and Non-Orthogonal Multiple Access (NOMA) resource management. While Federated Multi-Agent Reinforcement Learning (FMARL) is a critical direction, existing methods struggle with extreme heterogeneity and privacy concerns. To overcome these limitations, we propose FedHeRM: Federated Heterogeneity-aware Reinforcement Learning for Secure NOMA Resource Management in Vehicular Edge Networks. FedHeRM models the vehicular edge environment as a Partially Observable Stochastic Game, where RSU agents learn optimal resource allocation policies. Our core innovation lies in a novel Heterogeneity-aware Federated Multi-Agent Reinforcement Learning (HeRA-FMARL) framework, which introduces a Dynamic Heterogeneity Measure (DHM) for adaptive weighted aggregation of model updates, significantly accelerating convergence and enhancing generalization across diverse agents. Furthermore, FedHeRM integrates robust privacy-preserving mechanisms, including Differential Privacy for local updates and Secure Aggregation protocols. Comprehensive experiments in a simulated vehicular edge environment demonstrate that FedHeRM significantly outperforms state-of-the-art baselines across critical metrics, achieving superior system throughput, lower task latency, reduced energy consumption, and enhanced user fairness, while maintaining excellent scalability and strong privacy guarantees. An ablation study confirms the crucial roles of its key components, further validating FedHeRM's efficacy in highly dynamic and heterogeneous edge networks.
Keywords: 
;  ;  ;  ;  
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated