Preprint
Article

This version is not peer-reviewed.

Adaptive RL-Based FHSS Strategies: A Comparative Analysis of Baseline, Tabular Q-Learning, and DQN vs. 1st-Order Markov Jammer

Submitted:

22 May 2026

Posted:

26 May 2026

You are already at the latest version

Abstract
This paper presents a comparative analysis of Reinforcement Learning (RL)-based strategies for optimizing Frequency-Hopping Spread Spectrum (FHSS) systems against a first-order Markov jammer in Unmanned Aerial Vehicle (UAV) communications, addressing critical vulnerabilities in electronic warfare scenarios. The jammer model simulate adaptive threats in drone networks. Simulations were conducted within a Markov Decision Process (MDP) framework featuring 16 channels and episodes of 1000 steps. Three approaches were evaluated: Baseline random channel selection, Tabular Q-Learning, and Deep Q-Network (DQN) employing 16-128-128-16 neural architecture. Training spanned 100–500 episodes, with performance assessed via key metrics: Success Rate (%), Bit Error Rate (BER), Signal-to-Noise Ratio (SNR), action Entropy, and Packet Loss Rate (PLR) under Forward Error Correction (FEC).
Keywords: 
;  ;  ;  ;  ;  ;  
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated