Submitted:
08 April 2026
Posted:
09 April 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction

- We propose FedMA-DRL, a federated multi-agent DRL framework that integrates CTDE architecture with nonlinear reward shaping and action-guided exploration for efficient joint channel selection and power control in CRNs.
- We introduce a GNN-augmented Q-value predictor that leverages the topological structure of wireless devices to improve prediction accuracy and a FedAge-based federated aggregation strategy for privacy-preserving distributed learning.
- We develop an attention-based domain adaptation module that enhances cross-domain generalization, enabling robust performance across heterogeneous wireless environments without requiring domain-specific retraining.
2. Related Work
2.1. Deep Reinforcement Learning for Dynamic Spectrum Access
2.2. Federated Learning and Graph Neural Networks in Wireless Networks
3. Method
3.1. Problem Formulation
3.2. Nonlinear Reward Shaping
3.3. GNN-Augmented Q-Value Predictor
3.4. Federated Aggregation with FedAge
3.5. Attention-Based Domain Adaptation Module
3.6. Overall Training Procedure
4. Experiments
4.1. Experimental Setup
4.2. Main Results
4.3. Effectiveness of FedMA-DRL
4.4. Human Evaluation
4.5. Scalability Analysis
4.6. Cross-Domain Generalization
5. Conclusion
References
- Ahmed, M.; Hassan, R. Dynamic Spectrum Access in Cognitive Radio Networks: A Comprehensive Review. IEEE Access 2024.
- Kumar, S.; Ahmad, I. A Review of Spectrum Sensing in Modern Cognitive Radio Networks. Telecommunication Systems 2023.
- Zhang, W.; Liu, C. Enhancing Cognitive Radio Network Performance through Channel Selection and Power Control. IEEE Transactions on Cognitive Communications 2024.
- Gupta, A.; Singh, R. A Novel Game Theoretic Approach for Market-Driven Dynamic Spectrum Access. Wireless Networks 2023.
- Chen, Y.; Zhao, M. Improved Spectrum Prediction Model for Cognitive Radio Networks Using Hybrid LSTM-MLP. ICT Express 2024.
- Yang, N.; Zhang, H.; Long, K.; Jiang, C.; Yang, Y. Spectrum management scheme in fog IoT networks. IEEE Communications Magazine 2018, 56, 101–107. [CrossRef]
- Yang, N.; Zhang, H.; Long, K.; Hsieh, H.Y.; Liu, J. Deep neural network for resource management in NOMA networks. IEEE Transactions on Vehicular Technology 2019, 69, 876–886. [CrossRef]
- Chen, M.; Zeng, Q. Applications of Deep Reinforcement Learning in Wireless Networks. IEEE Communications Surveys & Tutorials 2023.
- Zhang, Y.; Li, P. A Comprehensive Survey of Multi-Agent Deep Reinforcement Learning for Dynamic Spectrum Access. Neurocomputing 2025.
- Gronauer, S.; Diepold, K. An Introduction to Centralized Training for Decentralized Execution in Multi-Agent Reinforcement Learning. arXiv preprint arXiv:2409.03052 2024.
- Wang, J.; Li, P. A Heterogeneous-Agent Deep Reinforcement Learning Approach for Dynamic Spectrum Access. IEEE Transactions on Wireless Communications 2025.
- Wu, Y.; Yu, Y.; Yang, Z.; Zeng, Z.; Chen, G.; Xu, J. Brain-SAM: Modality-Agnostic Model for Brain Lesion Segmentation. In Proceedings of the 2025 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, 2025, pp. 3000–3005.
- Xu, X.; Tu, W.; Yang, Y. CASE-Net: Integrating local and non-local attention operations for speech enhancement. Speech Communication 2023, 148, 31–39. [CrossRef]
- Xu, X.; Tu, W.; Yang, Y. Adaptive selection of local and non-local attention mechanisms for speech enhancement. Neural Networks 2024, 174, 106236. [CrossRef]
- Aggarwal, M.; Gupta, S. A Comprehensive Review of Federated Learning: Methods, Applications, and Challenges. Artificial Intelligence Review 2024.
- Li, F.; Yang, J. A Dynamic Spectrum Access Scheme for Internet of Things with Improved Federated Learning. Journal of Network and Computer Applications 2025. [CrossRef]
- Kumar, A.; Kumar, V. Dynamic Spectrum Access in Cognitive Radio Networks: A Reinforcement Learning Perspective. IEEE Access 2024.
- Xu, Y.; Chen, H. Deep Reinforcement Learning for Dynamic Spectrum Access: A Multi-Agent Approach. IEEE Transactions on Wireless Communications 2024.
- Ukpong, U.C.; Idowu-Bismark, O.; Adetiba, E. Deep Reinforcement Learning Agents for Dynamic Spectrum Access in Television Whitespace Cognitive Radio Networks. Scientific African 2025. [CrossRef]
- Malhotra, S. Deep Reinforcement Learning for Dynamic Resource Allocation in Wireless Networks. arXiv preprint arXiv:2502.01129 2025.
- Bai, W.; Zheng, G.; Xia, W.; Mu, Y.; Xue, Y. Multi-Agent Deep Reinforcement Learning-Based Joint Channel Selection and Power Control Method. Computers and Electrical Engineering 2025. [CrossRef]
- Li, P.; Wang, J. Multi-Agent Deep Reinforcement Learning for Dynamic Spectrum Access. Springer CCIS 2025.
- Venkatesan, P.; Kumaratharan, N. Reinforcement Learning-Based Dynamic Spectrum Allocation for Efficient Cognitive Radio Network Management. Computer Networks 2025. [CrossRef]
- Shah, S.S.; Ali, A. Optimizing Resource Allocation and Energy Efficiency in Federated Fog Computing for IoT. arXiv preprint arXiv:2504.00791 2025.
- Wu, F.; He, Z. Privacy-Preserving Federated Graph Neural Network with Local Differential Privacy. Security and Communication Networks 2023.
- He, C.; Fan, S. A Federated Graph Neural Network Framework for Privacy-Preserving Personalized Recommendation. Nature Communications 2022.
- Banday, Y. Empowering RIS-Assisted NOMA Networks with Deep Learning for User Clustering and Phase Shifter Optimization. Wireless Networks 2025. [CrossRef]
- Yang, N.; Zhang, H.; Berry, R. Partially observable multi-agent deep reinforcement learning for cognitive resource management. In Proceedings of the GLOBECOM 2020-2020 IEEE Global Communications Conference. IEEE, 2020, pp. 1–6.
- Xu, X.; Wang, Y.; Xu, D.; Peng, Y.; Zhang, C.; Jia, J.; Chen, B. Vsegan: Visual speech enhancement generative adversarial network. In Proceedings of the ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022, pp. 7308–7311.



| Method | SU Thr. | Coll. Prob. | Energy Eff. | Spectrum Eff. |
| C-DRL | 12.34 | 0.082 | 3.21 | 4.56 |
| DQN | 13.15 | 0.064 | 3.58 | 5.12 |
| QR-DQN | 13.48 | 0.059 | 3.72 | 5.34 |
| PPO | 12.87 | 0.071 | 3.45 | 4.89 |
| R2D2 | 13.92 | 0.051 | 3.88 | 5.67 |
| Ours | 14.87 | 0.038 | 4.35 | 6.23 |
| Variant | SU Thr. (Mbps) | Coll. Prob. | Energy Eff. | Spectrum Eff. |
| FedMA-DRL (Full) | 14.87 | 0.038 | 4.35 | 6.23 |
| w/o GNN | 13.92 | 0.052 | 3.88 | 5.67 |
| w/o FedAge | 14.15 | 0.046 | 4.02 | 5.85 |
| w/o Domain Adapt. | 14.21 | 0.044 | 4.08 | 5.91 |
| w/o Nonlinear Reward | 13.68 | 0.061 | 3.65 | 5.28 |
| Method | Spectrum Util. | Interference Mgmt. | Interpretability |
| C-DRL | 3.2 | 2.8 | 3.5 |
| DQN | 3.5 | 3.2 | 3.1 |
| QR-DQN | 3.6 | 3.3 | 3.0 |
| PPO | 3.4 | 3.1 | 3.3 |
| R2D2 | 3.8 | 3.5 | 3.2 |
| Ours | 4.2 | 4.0 | 3.8 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).