Submitted:
26 May 2026
Posted:
27 May 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work
3. Methodology
3.1. Data Preparation and Preprocessing
3.2. Data Balancing: The SMOTE-CTGAN Pipeline
3.3. Hybrid Model Architecture (LSTM-VAE)
3.4. Experimental Setup and Hyperparameter Optimization
4. Results
4.1. Evaluation on Test Set
4.2. 10-Fold Cross-Validation and Statistical Significance
4.3. Performance Comparison with State-of-the-Art Augmentation and Hybrid IDS Approaches
5. Model Explainability using SHAP Analysis
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| GenAI | Generative Artificial Intelligence |
| CTGAN | Conditional Tabular Generative Adversarial Network |
| SMOTE | Synthetic Minority Over-sampling Technique |
| LSTM | Long Short-Term Memory |
| SHAP | SHapley Additive exPlanations |
| VAE | Variational Autoencoders |
| IDS | Intrusion Detection System |
| AUC | Area Under the Curve |
| ROC | Receiver Operating Characteristic |
References
- Kedys, A. Fast-Changing Cyber Threat Landscape and a New Reality of Cyber Security. Cyber Secur. 2025, 8, 273. [CrossRef]
- Wang, P.; Lin, H.-C.; Chen, J.-H.; Lin, W.-H.; Li, H.-C. Improving Cyber Defense Against Ransomware: A Generative Adversarial Networks-Based Adversarial Training Approach for Long Short-Term Memory Network Classifier. Electronics 2025, 14, 810. [CrossRef]
- Coppolino, L.; D’Antonio, S.; Mazzeo, G.; Uccello, F. The good, the bad, and the algorithm: The impact of generative AI on cybersecurity. Neurocomputing 2025, 623, 129406. [CrossRef]
- Ferrag, M.A.; Maglaras, L.; Janicke, H. Generative AI in Cybersecurity: A Comprehensive Review of Applications, Challenges, and Future Directions. Comput. Secur. 2025, 111, 102–118.
- Reynaud, S.; Roxin, A. Review of eXplainable artificial intelligence for cybersecurity systems. Discover Artificial Intelligence 2025, 5, 78. [CrossRef]
- Whang, S.E.; Roh, Y.; Song, H.; Lee, J.-G. Data Collection and Quality Challenges in Deep Learning: A Data-Centric AI Perspective. arXiv 2021, arXiv:2112. [CrossRef]
- Bagui, S.; Li, K. Resampling Imbalanced Data for Network Intrusion Detection Datasets. J. Big Data 2021, 8, 6. [CrossRef]
- Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic Minority Over-Sampling Technique. J. Artif. Intell. Res. 2002, 16, 321–357. [CrossRef]
- Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780.
- Lundberg, S.; Lee, S.-I. A Unified Approach to Interpreting Model Predictions. Adv. Neural Inf. Process. Syst. 2017, 30, 4765–4774.
- Hermosilla, P. A Comparative Study of SHAP and LIME in Intrusion Detection Systems. Appl. Sci. 2025, 15, 7329.
- Hozouri, A.; Mirzaei, A.; Effatparvar, M. A comprehensive survey on intrusion detection systems with advances in machine learning, deep learning and emerging cybersecurity challenges. Discov. Artif. Intell. 2025, 5, 314. [CrossRef]
- Alashjaee, A.M. Deep learning for network security: an attention-CNN-LSTM model for accurate intrusion detection. Sci. Rep. 2025, 15, 21856. [CrossRef]
- Ekpo, O.; Casola, V.; De Benedictis, A.; Asuquo, P.; Agbor, B. A hybrid CNN–LSTM–attention framework for intrusion detection in smart mobility networks. Future Internet 2026, 18, 210. [CrossRef]
- Afraji, D.M.; Lloret, J.; Peñalver, L. An integrated hybrid deep learning framework for intrusion detection in IoT and IIoT networks using CNN-LSTM-GRU architecture. Computation 2025, 13, 222. [CrossRef]
- Zhu, G.; Yu, Y.; Deng, X.; Dai, Y.; Li, Z. A Hybrid Split-Attention and Transformer Architecture for High-Performance Network Intrusion Detection. Comput. Model. Eng. Sci. 2025, 145, 4317. [CrossRef]
- Agarwal, L.; Jaint, B.; Mandpura, A.K. Reducing overfitting in deep learning intrusion detection for power systems with CTGAN. Chaos Solitons Fractals 2024, 188, 115603. [CrossRef]
- Menssouri, S.; Amhoud, E.M. A conditional tabular GAN-enhanced intrusion detection system for rare attacks in IoT networks. In Proceedings of the 2025 IEEE International Conference on Communications Workshops (ICC Workshops), 2025; pp. 1918–1923.
- Saka, S.; Selis, V.; Marshall, A. AlignAD-VAE: a variational autoencoder with MMD-based dataset alignment for network anomaly detection. In Proceedings of the 2025 IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom), 2025; pp. 861–867.
- Qiu, Z.; Wang, Y.; Li, H.; Zhang, J. VAEMax: open-set intrusion detection based on OpenMax and variational autoencoder. In Proceedings of the 2024 IEEE International Conference on Information and Communication Technologies (ICTC), 2024; pp. 98–105.
- Neupane, S.; Ables, J.; Anderson, W.; Mittal, S.; Rahimi, S.; Banicescu, I.; Seale, M. Explainable intrusion detection systems (x-ids): A survey of current methods, challenges, and opportunities. IEEE Access 2022, 10, 112392–112415. [CrossRef]
- Yagiz, M.A.; Goktas, P. LENS-XAI: redefining lightweight and explainable network security through knowledge distillation and variational autoencoders for scalable intrusion detection in cybersecurity. arXiv 2025, arXiv:2501.00790.
- Doshi, R.; Hiran, K.K. Explainable artificial intelligence as a cybersecurity aid. In Advances in Explainable AI Applications for Smart Cities; IGI Global Scientific Publishing, 2024; pp. 98–113.
- Khan, N.; Ahmad, K.; Al Tamimi, A.; Alani, M.M.; Bermak, A.; Khalil, I. Explainable AI-based intrusion detection systems for Industry 5.0 and adversarial XAI: a systematic review. Information 2025, 16, 1036. [CrossRef]
- Brik, B.; Chergui, H.; Zanzi, L.; Devoti, F.; Ksentini, A.; Siddiqui, M.S.; Costa-Pérez, X.; Verikoukis, C. Explainable AI in 6G O-RAN: A tutorial and survey on architecture, use cases, challenges, and future research. IEEE Commun. Surv. Tutor. 2024, 27, 2826–2859. [CrossRef]
- Mohale, V.Z.; Obagbuwa, I.C. A systematic review on the integration of explainable artificial intelligence in intrusion detection systems to enhancing transparency and interpretability in cybersecurity. Front. Artif. Intell. 2025, 8, 1526221. [CrossRef]
- Barkah, A.S.; Selamat, S.R.; Abidin, Z.Z.; Wahyudi, R. Impact of data balancing and feature selection on machine learning-based network intrusion detection. Int. J. Inform. Vis. 2023, 7, 241–248. [CrossRef]
- Chandekar, P.; Mehta, M.; Chandan, S. Enhanced anomaly detection in iomt networks using ensemble ai models on the ciciomt2024 dataset. arXiv 2025, arXiv:2502.11854.
- Azzouni, A.; Pujolle, G. A long short-term memory recurrent neural network framework for network traffic matrix prediction. arXiv 2017, arXiv:1705.05690.
- Kingma, D.P.; Welling, M. Auto-Encoding Variational Bayes. In Proceedings of the International Conference on Learning Representations (ICLR), Banff, AB, Canada, 14–16 April 2014. (Also available as arXiv:1312.6114, 2013.).
- Cousineau, D.; Chartier, S. Outliers detection and treatment: a review. Int. J. Psychol. Res. 2010, 3, 58–67. [CrossRef]
- Randhawa, P.; Jasthi, V.N.; Piyush, K.; Kaushik, G.K.; Batamulay, M.; Prasad, S.N.; Rawat, M.; Veernapu, K.; Naik, N. Conditional Tabular Generative Adversarial Network Based Clinical Data Augmentation for Enhanced Predictive Modeling in Chronic Kidney Disease Diagnosis. BioMedInformatics 2026, 6, 6. [CrossRef]
- Xu, L.; Skoularidou, M.; Cuesta-Infante, A.; Veeramachaneni, K. Modeling tabular data using conditional GAN. Adv. Neural Inf. Process. Syst. 2019, 32.
- Zarkadis, I.C.; Douligeris, C. Machine Learning for Network Attacks Classification and Statistical Evaluation of Adversarial Learning Methodologies for Synthetic Data Generation. arXiv 2026, arXiv:2603.
- Bouidaine, A.B.; Moussaoui, D.; Hadjila, M.; Ferhi, W.; Hachemi, M.H. Deep Learning-Based Anomaly and Intrusion Detection Using the CSE-CIC-IDS2018 Dataset. Eng. Technol. Appl. Sci. Res. 2025, 15, 24782–24787. [CrossRef]
- Balasubramanian, S.K.; Perumal, S. Comparative Study of BiGRU with Multi-Head Attention and CNN for Network Intrusion Detection Using a Cleaned and Balanced CSE-CIC-IDS 2018 Dataset. Turk. J. Eng. 2025, 9, 725–737. [CrossRef]
- Mchina, J.P.; Mduma, N.; Sinde, R.S. Adaptive Decision-Level Intrusion Detection for Known and Zero-Day Attacks. Network 2026, 6, 23. [CrossRef]








| Accuracy (%) | Precision (%) | Recall (%) | F1-Score (%) | ROC AUC |
| 99.08 | 96.54 | 94.35 | 95.44 | 0.99499 |
![]() |
![]() |
![]() |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).


