Submitted:
14 January 2024
Posted:
15 January 2024
You are already at the latest version
Abstract
Keywords:
0. Introduction
1. Related Works
2. Materials and Methods
2.1. Datasets and Source
2.2. Proposed Novel Hybrid Method for DDoS Attack Detection using TOPT with genetics algorithm
2.3. Three machine learning algorithm Integration with Genetics Algorithms
2.4. The framework of the proposed DDoS diagnosis procedure
3. Performance parameters
- True Positive tp: When both the model's forecast and the actual values in the dataset are positive, we say that a value is true positive, or tp. meaning the classifier accurately differentiates between good and bad traffic.
- True negative tn: When both the model's forecast and the actual values in the dataset are negative, we say that b) is a true negative tn. i.e. it is the circumstance where the traffic is accurately categorized as malicious.
- False Positive fp: False positive is the error category where the model prediction is positive and actual values in the dataset is negative. i.e. \sit is the circumstance where the traffic is wrongly classed as innocuous.
- False Negative fn: A false negative is a form of error in which the actual values in the dataset contradict the prediction of the model. i.e., it is the circumstance where the traffic is wrongly categorized as harmful.
- As a performance metric, accuracy may be written as a fraction with the sum of correct answers (positive and negative) in the numerator and the sum of incorrect answers (positive and negative) in the denominator.
4. Results
4.1. Three machine learning classification results
4.2. SVM
4.3. Random Forest" (RF)
4.4. XGBoost
4.5. Receiver Operating Characteristic ROC(AUC) training performance
4.6. Performing accuracy tests using a variety of methodologies for fivefold cross validation
5. Three machine learning algorithm optimizations with genetic algorithms results
5.1. RF-GA Optimization with genetic algorithms results
5.2. SVM-GA optimization with genetic algorithms results
5.3. Proposed three TOPOT-Classifiers with other 7 seven GA optimization models results
6. Comparative analysis with existing results
| ML-GA Classifiers | 5 iterations/5-fold CV | Best Pipeline Test Accuracy Score |
| Extra Trees Classifier | Internal cv score | 0.8123 |
| K-Neighbors Classifier | Internal cv score | 0.8158 |
| Bernoulli NB | Internal cv score | 0.7322 |
| GBoosting Classifier | Internal cv score | 0.9910 |
| SGD Classifier | Internal cv score | 0.5283 |
| Multinomial NB | Internal cv score | 0.5307 |
| Logistic Regression | Internal cv score | 0.7151 |
| SVM-GA Optimization | Internal cv score | 0.9940 |
| Best pipeline test accuracy | Internal cv score | 0.9960 |
| RF-GA Optimization | Internal cv score | 0.9988 |
| Best pipeline test accuracy | Internal cv score | 0.9950 |
| Proposed XGB-GA | Accuracy: | 0.9999 |
| Best pipeline test accuracy: | 1.000 |
7. Discussion
8. Conclusion
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Conflicts of Interest
References
- Bhutto, A.; Chandio, A.A.; Luhano, K.K.; Korejo, I.A. Analysis of Energy and Network Cost Effectiveness of Scheduling Strategies in Datacentre. Cybernetics and Information Technologies 2023, 23, 56–69. [Google Scholar] [CrossRef]
- Chandio, A.A.; Korejo, M.S.; Korejo, I.A.; Chandio, M.S. To investigate classical scheduling schemes with power management in IaaS cloud environment for HPC workloads. In 2017 IEEE 15th Student Conference on Research and Development (SCOReD). 2017. (pp. 121-126). IEEE.
- Wyld, D.C. The cloudy future of government IT: Cloud computing and the public sector around the world. International Journal of Web & Semantic Technology 2010, 1, 1–20. [Google Scholar]
- Muteeh, A.; Sardaraz, M.; Tahir, M. MrLBA: multi-resource load balancing algorithm for cloud computing using ant colony optimization. Cluster Computing 2021, 24, 3135–3145. [Google Scholar] [CrossRef]
- Maryam, K.; Sardaraz, M.; Tahir, M. Evolutionary algorithms in cloud computing from the perspective of energy consumption: A review. In 2018 14th international conference on emerging technologies (ICET), 2018. (pp. 1-6). IEEE.
- Ganesh Kumar, G.; Vivekanandan, P. Energy efficient scheduling for cloud data centers using heuristic based migration. Cluster Computing 2019, 22, 14073–14080. [Google Scholar] [CrossRef]
- Younas, I.; Naeem, A. Optimization of sensor selection problem in IoT systems using opposition-based learning in many-objective evolutionary algorithms. Computers & Electrical Engineering 2022, 97, 107625. [Google Scholar]
- Gubbi, J.; Buyya, R.; Marusic, S.; Palaniswami, M. Internet of Things (IoT): A vision, architectural elements, and future directions. Future generation computer systems 2013, 29, 1645–1660. [Google Scholar] [CrossRef]
- Al Bataineh, A. , Manacek, S. MLP-PSO hybrid algorithm for heart disease prediction. Journal of Personalized Medicine 2022, 12, 1208. [Google Scholar] [CrossRef]
- Samieinasab, M.; Torabzadeh, S.A.; Behnam, A.; Aghsami, A.; Jolai, F. Meta-Health Stack: A new approach for breast cancer prediction. Healthcare Analytics 2022, 2, 100010. [Google Scholar] [CrossRef]
- Jiao, B.; Guo, Y.; Yang, S.; Pu, J.; Gong, D. Reduced-space Multistream Classification based on Multi-objective Evolutionary Optimization. IEEE Transactions on Evolutionary Computation 2022. [Google Scholar]
- Hameed, B.Z.; Prerepa, G.; Patil, V.; Shekhar, P.; Zahid Raza, S.; Karimi, H.; Somani, B.K. Engineering and clinical use of artificial intelligence (AI) with machine learning and data science advancements: Radiology leading the way for future. Therapeutic Advances in Urology 2021, 13, 17562872211044880. [Google Scholar] [CrossRef] [PubMed]
- Tuli, S.; Ilager, S.; Ramamohanarao, K.; Buyya, R. Dynamic scheduling for stochastic edge-cloud computing environments using a3c learning and residual recurrent neural networks. IEEE transactions on mobile computing 2020, 21, 940–954. [Google Scholar] [CrossRef]
- Hu, C.; Zeng, S.; Li, C. An uncertainty measure for prediction of non-Gaussian process surrogates. Evolutionary Computation 2023, 31, 53–71. [Google Scholar] [CrossRef]
- Zelinka, I. A survey on evolutionary algorithms dynamics and its complexity–Mutual relations, past, present and future. Swarm and Evolutionary Computation 2015, 25, 2–14. [Google Scholar] [CrossRef]
- Casalino, L.; Masseni, F.; Pastrone, D. Robust Design Approaches for Hybrid Rocket Upper Stage. Journal of Aerospace Engineering 2019, 32, 04019087. [Google Scholar] [CrossRef]
- Jatoi, W.M.; Korejo, I.A.; Chandio, A.A.; Brohi, K.; Koondhar, Y.M. Meta-heuristic algorithms with immigrant techniques for nurse duty roster in public hospitals in Sindh, Pakistan, 2020.
- Dong, D.; Ye, Z.; Cao, Y.; Xie, S.; Wang, F.; Ming, W. An improved association rule mining algorithm based on ant lion optimizer algorithm and FP-growth. In 2019 10th IEEE international conference on intelligent data acquisition and advanced computing systems: technology and applications (IDAACS), 2019. (Vol. 1, pp. 458–463). IEEE.
- Ahmad, A.S.; Hassan, M.Y.; Abdullah, M.P.; Rahman, H.A.; Hussin, F.; Abdullah, H.; Saidur, R. A review on applications of ANN and SVM for building electrical energy consumption forecasting. Renewable and Sustainable Energy Reviews 2014, 33, 102–109. [Google Scholar] [CrossRef]
- Madni SH, H.; Latiff MS, A.; Coulibaly, Y.; Abdulhamid SI, M. Recent advancements in resource allocation techniques for cloud computing environment: a systematic review. cluster computing 2017, 20, 2489–2533. [Google Scholar] [CrossRef]
- Wang, L. Machine availability monitoring and machining process planning towards Cloud manufacturing. CIRP Journal of Manufacturing Science and Technology 2013, 6, 263–273. [Google Scholar] [CrossRef]
- Løken, E. Use of multicriteria decision analysis methods for energy planning problems. Renewable and sustainable energy reviews 2007, 11, 1584–1595. [Google Scholar] [CrossRef]
- Xia, W.; Wu, Z. An effective hybrid optimization approach for multi-objective flexible job-shop scheduling problems. Computers & industrial engineering 2005, 48, 409–425. [Google Scholar]
- Aslanpour, M.S.; Ghobaei-Arani, M.; Toosi, A.N. Auto-scaling web applications in clouds: A cost-aware approach. Journal of Network and Computer Applications 2017, 95, 26–41. [Google Scholar] [CrossRef]
- Buyya, R.; Broberg, J.; Goscinski, A.M. (Eds.). Cloud computing: Principles and paradigms. John Wiley & Sons 2010. Buyya, R.; Broberg, J.; Goscinski, A.M. (Eds.).
- Khalaf, B.A.; Mostafa, S.A.; Mustapha, A.; Mohammed, M.A.; Abduallah, W.M. Comprehensive review of artificial intelligence and statistical approaches in distributed denial of service attack and defense methods. IEEE Access 2019, 7, 51691–51713. [Google Scholar] [CrossRef]
- Dixit, P.; Silakari, S. Deep learning algorithms for cybersecurity applications: A technological and status review. Computer Science Review 2021, 39, 100317. [Google Scholar] [CrossRef]
- Basit, A.; Zafar, M.; Liu, X.; Javed, A.R.; Jalil, Z.; Kifayat, K. A comprehensive survey of AI-enabled phishing attacks detection techniques. Telecommunication Systems 2021, 76, 139–154. [Google Scholar] [CrossRef] [PubMed]
- Mohammed, M.A.; Gunasekaran, S.S.; Mostafa, S.A.; Mustafa, A.; Abd Ghani, M.K. Implementing an agent-based multi-natural language anti-spam model. In 2018 International symposium on agent, multi-agent systems and robotics (ISAMSR). 2018.(pp. 1-5). IEEE.
- Aburomman, A.A.; Reaz MB, I. A survey of intrusion detection systems based on ensemble and hybrid classifiers. Computers & security 2017, 65, 135–152. [Google Scholar]
- Dwivedi, S.; Vardhan, M.; Tripathi, S. Defense against distributed DoS attack detection by using intelligent evolutionary algorithm. International Journal of Computers and Applications 2022, 44, 219–229. [Google Scholar] [CrossRef]
- Natarajan, S., Mgen. 2014. Available online: https://ryu.readthedocs.io/en/latest/ryu_app_api.html.
- Kumar PA, R.; Selvakumar, S. Detection of distributed denial of service attacks using an ensemble of adaptive and hybrid neuro-fuzzy systems. Computer Communications 2013, 36, 303–319. [Google Scholar] [CrossRef]
- Da Silva, A.S. , Wickboldt, J.A., Granville, L.Z., Schaeffer-Filho, A., Atlantic: a framework for anomaly traffi detection, classifiation, and mitigation in sdn. In: NOMS IEEE/IFIP Network Operations and Management Symposium. IEEE 2016, pp. 27–35.
- Perez-Diaz, J.A.; Valdovinos, I.A.; Choo KK, R.; Zhu, D. A flexible SDN-based architecture for identifying and mitigating low-rate DDoS attacks using machine learning. IEEE Access 2020, 8, 155859–155872. [Google Scholar] [CrossRef]
- Ye, J.; Cheng, X.; Zhu, J.; Feng, L.; Song, L. A DDoS attack detection method based on SVM in software defined network. Security and Communication Networks 2018. [Google Scholar] [CrossRef]
- Ko, I.; Chambers, D.; Barrett, E. Self-supervised network traffic management for DDoS mitigation within the ISP domain. Future Generation Computer Systems 2020, 112, 524–533. [Google Scholar] [CrossRef]
- Han, B.; Yang, X.; Sun, Z.; Huang, J.; Su, J. OverWatch: A cross-plane DDoS attack defense framework with collaborative intelligence in SDN. Security and Communication Networks 2018, 1–15. [Google Scholar] [CrossRef]
- Myint Oo, M.; Kamolphiwong, S.; Kamolphiwong, T.; Vasupongayya, S. Advanced support vector machine-(ASVM-) based detection for distributed denial of service (DDoS) attack on software defined networking (SDN). Journal of Computer Networks and Communications 2019. [Google Scholar] [CrossRef]
- Ahuja, N.; Singal, G.; Mukhopadhyay, D.; Kumar, N. Automated DDOS attack detection in software defined networking. Journal of Network and Computer Applications 2021, 187, 103108. [Google Scholar] [CrossRef]




| Traffic Class | Benign | Malicious |
|---|---|---|
| ICMP | 24957 | 16364 |
| TCP | 18897 | 10539 |
| UDP | 22772 | 10816 |
| Algorithms | accuracy | precision | recall | f1-score |
| SVM | 72.00% | 0.71 | 0.83 | 0.76 |
| Random forest | 98.00% | 0.98 | 0.99 | 0.98 |
| XGBoost | 98.08% | 0.99 | 0.99 | 0.98 |
| Classifiers | MAE | MSE | R2 |
|---|---|---|---|
| Gradient Boost | 4.7917 | 4.7917 | 0.9997 |
| Classifiers | GA Generations | Best internal CV score | GA Optimization Best Accuracy Score |
| Gradient Boost | Generation 1 | Current best internal CV score: | 0.9999 |
| Generation 2 | Current best internal CV score: | 1.0 | |
| Generation 3 | Current best internal CV score: | 1.0 | |
| Generation 4 | Current best internal CV score: | 1.0 | |
| Generation 5 | Current best internal CV score: | 1.0 | |
| Best pipeline test accuracy: | 1.000 | ||
| Accuracy: | 0.9999 |
| Classifiers | Precision | Recall | F1-Score | Accuracy |
| Gradient Boost | 1.00 | 1.00 | 1.00 | 0.9999 |
| 1.00 | 1.00 | 1.00 |
| Classifiers | GA Generations | Best internal CV score | Best Pipeline Test Accuracy Score |
| RF-GA | Generation 1 | Current best internal CV score: | 0.9981 |
| Generation 2 | Current best internal CV score: | 0.9988 | |
| Generation 3 | Current best internal CV score: | 0.9983 | |
| Generation 4 | Current best internal CV score: | 0.9988 | |
| Generation 5 | Current best internal CV score: | 0.9988 | |
| Best pipeline test accuracy: | 0.9988 | ||
| Classifiers | GA Generations | Best internal CV score | Best Pipeline Test Accuracy Score |
| SVM-GA | Generation 1 | Current Pareto front scores: | 0.9739 |
| Generation 2 | Current Pareto front scores: | 0.9835 | |
| SVM-GA | Generation 3 | Current Pareto front scores: | 0.9925 |
| Generation 4 | Current Pareto front scores: | 0.9925 | |
| Generation 5 | Current Pareto front scores: | 0.9940 | |
| Best pipeline test accuracy: | 0.9960 | ||
| S. No | Authors | Testing Accuracy |
|---|---|---|
| 1 | Meti et al., 2017 [33] | 80% |
| 2 | Da Silva et al., 2016 [34] | 88.7% |
| 3 | Perez-Díaz et al. [35] | 95% |
| 4 | Ye et al., 2018 [36] | 95.24% |
| 5 | Ko et al. [37] | 96% |
| 6 | Han et al., 2018 [38] | 96% |
| 7 | MyintOo et al., 2019 [39] | 97% |
| 8 | Auhoja [40] | 98.8% |
| 9 | Proposed XGB-GA Optimization | 99.00% |
| 10 | Proposed TOPT Best pipeline test accuracy: | 1.000% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).