Submitted:
13 January 2025
Posted:
13 January 2025
You are already at the latest version
Abstract
In the current landscape of the Industrial Internet of Things (IIoT), Distributed Denial of Service (DDoS) attacks represent a significant security threat. Traditional defense mechanisms often require extensive computational and storage resources, resulting in substantial increases in operational costs. In response to this challenge, this study proposes a novel DDoS attack detection method for IIoT environments, named IIoT Attack Detection based on CNN-mLSTM-KAN (IAD-CLK). The method first employs an Adaptive Feature Selection Boosting (AFSB) technique during the data preprocessing phase to identify the most relevant features, thereby reducing the computational load on the model. Subsequently, the CNN-mLSTM-KAN model is introduced, which combines depthwise separable convolutions, an LSTM architecture enhanced by matrix operations (mLSTM), and the Kolmogorov–Arnold Network (KAN). This integration significantly improves the efficiency and accuracy of DDoS attack detection. Experimental results on the CICDDoS2019 dataset demonstrate that the model achieves an accuracy of 99.78%, while maintaining a low time cost of 0.122 ms. These findings not only highlight the model’s advantages in terms of accuracy and computational complexity but also demonstrate its ability to meet the stringent low-latency requirements of IIoT systems.
Keywords:
1. Introduction
- (1)
- We have employed a variety of machine learning techniques and analyzed feature selection methods using performance metrics such as precision, accuracy, F1 score, recall, loss, and time cost. Based on these indicators, we selected the AFSB algorithm to reduce feature dimensionality. This feature selection strategy effectively identifies significant features, thereby enhancing performance robustness.
- (2)
- We propose a hybrid approach for detecting IoT Distributed Denial-of-Service (DDoS) attacks, termed IAD-CLK, which combines convolutional neural networks (CNN) with an enhanced long short-term memory network utilizing matrix operations (mLSTM). The IAD-CLK method integrates a CNN-mLSTM-KAN model that employs a deeply separable convolutional neural network alongside mLSTM [14] and KAN [15] neural networks. This proposed model efficiently learns the intrinsic features of DDoS attacks in a computationally effective manner. Furthermore, residual connections are employed to enhance learning efficiency and mitigate the gradient vanishing problem. Experimental results demonstrate that this method achieves high detection accuracy and rapid convergence, facilitating effective detection of DDoS attacks in industrial Internet environments.
2. Related Work
2.1. ML Approaches
2.2. DL Approaches
3. Proposed Approach
3.1. Data Preprocessing
- (1)
- Data Preparation: The initial flow-based dataset contained 88 features. Several non-contributory features, such as "Unnamed," "Timestamp," "Source Port," "Source IP," "Flow ID," and "Similar HTTP," were removed. After discarding these features, 80 features remained for subsequent analysis. It was also essential to eliminate values containing NaN, infinite values, and empty entries. Given the large volume of samples in the dataset, instances with malformed values were removed.
- (2)
-
Data Normalization: Some features, including "Bwd IAT min," "Flow IAT Std," and "Flow IAT Max," "Flow Duration," displayed considerable variance between their minimum and maximum values. To address this high variability among features, data normalization was applied. This technique not only shortens the training duration but also improves the performance of the model. In this research, feature scaling was implemented using a Min-Max normalization method, which is based on the following principles:In the formula, represents the normalized numerical result, which ranges between [0, 1]. The terms ’max’ and ’min’ denote the maximum and minimum values of the represented feature, respectively. The performance of DDoS detection models declines as the dimensionality of features increases. Some features exhibit little or no relevance to the DDoS detection process. To enhance the model’s capability and reduce training time, it is essential to eliminate redundant datasets by decreasing feature dimensions. We employed the AFSB method to select the top 10 features with the highest weights. Table 1 presents the ten extracted features along with their corresponding values.
3.2. CNN-mLSTM-KAN
3.2.1. KAN
3.2.2. mLSTM
4. Experimental Findings and Interpretation
4.1. Experimental Environment
4.2. Experimental Dataset
- (1)
- Exploitation-based DDoS attacks: These attacks utilize TCP SYN flooding and UDP flooding techniques to deplete the victim’s resources and network bandwidth. In a TCP SYN flood attack, an overwhelming number of SYN packets are sent, causing the server to become unresponsive to connection requests as it is unable to process the replies. Meanwhile, a UDP flood attack sends a significant volume of UDP packets to occupy the ports of the targeted system, draining the network bandwidth and potentially resulting in a system crash.
- (2)
- Exploitation-based DDoS attacks: These attacks leverage TCP SYN flooding and UDP flooding to overload the victim’s resources and network capacity. In TCP SYN flooding, a multitude of SYN packets is transmitted, preventing the server from handling new connection requests due to its inability to respond to incoming replies. Conversely, UDP flooding attacks target the ports of the victim’s system with numerous UDP packets, which can exhaust network bandwidth and ultimately lead to a system failure.
4.3. Experimentation and Verification
4.3.1. Experimental Indicators
4.3.2. Comparative Experiment
4.3.3. Ablation Experiment
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- T. Qiu, J. Chi, X.Z.Z.N.M.A.; Wu, D.O. Edge computing in industrial Internet of Things: Architecture, advances and challenges. IEEE Commun. Surveys Tuts. 2020, 22, 2462–2488. [Google Scholar] [CrossRef]
- RAHHAL J, S.D. IOT based predictive maintenance using LSTM RNN estimator. 2020 International Conference on Electrical, Communication,and Computer Engineering 2020, pp. 1–5.
- V. Balasubramanian, M.A.; Reisslein, M. An SDN architecture for time sensitive industrial IoT. Comput. Netw. 2021, 186. [Google Scholar]
- Du, M.; Wang, K. An SDN-enabled pseudo-honeypot strategy for distributed denial of service attacks in industrial Internet of Things. IEEE Trans. Ind. Informat. 2019, 16, 648–657. [Google Scholar] [CrossRef]
- Romero-Gázquez, J.L.; Bueno-Delgado, M. Software architecture solution based on SDN for an industrial IoT scenario. Wireless Commun. Mobile Comput. 2018, 208. [Google Scholar] [CrossRef]
- A.A. Pranata, T.S.J.; Kim, D.S. Overhead reduction scheme for SDN-based data center networks. Comput. Stand. Interfaces 2019, 63, 1–15. [Google Scholar] [CrossRef]
- W. Mao, Z. Zhao, Z.C.G.M.; Gao, W. Energy-efficient industrial Internet of Things: Overview and open issues. IEEE Trans.Ind. Informat. 2021, 17, 7225–7237. [Google Scholar] [CrossRef]
- D. Mourtzis, K.A.; Zogopoulos, V. Mapping vulnerabilities in the industrial Internet of Things landscape. Procedia CIRP 2019, 84, 265–270. [Google Scholar] [CrossRef]
- G. C. Amaizu, C. I. Nwakanma, S.B.J.M.L.; Kim, D.S. Composite and efficient DDoS attack detection framework for B5Gnetworks. Comput. Netw. 2021, 188. [Google Scholar]
- N. M. Yungaicela-Naula, C.V.R.; Perez-Diaz, J.A. SDN-based architecture for transport and application layer DDoS attack detection by using machine and deep learning. IEEE Access 2021, 9, 108495–108512. [Google Scholar] [CrossRef]
- X. Jing, Z.Y.; Pedrycz, W. Security data collection and data analytics in the Internet: A survey. IEEE Commun. Surveys Tuts. 2018, 21, 586–618. [Google Scholar]
- R. Doriguzzi-Corin, S. Millar, S.S.H.J.M.d.R.a.D.S. LUCID: A practical, lightweight deep learning solution for DDoS attack detection. IEEE Trans. Netw. Service Manag. 2020, 17, 876–889. [Google Scholar] [CrossRef]
- Y. Wei, J. Jang-Jaccard, F.S.A.S.W.X.; Camtepe, S. AE-MLP: A hybrid deep learning approach for DDoS detection and classification. IEEE Access 2021, 9, 146810–146821. [Google Scholar] [CrossRef]
- Ziming Liu, Yixuan Wang, S.V. KAN: Kolmogorov–Arnold Networks. arXiv 2024.
- Maximilian Beck, Korbinian Pöppel, M.S.A.A. xLSTM: Extended Long Short-Term Memory. arXiv 2024, 2405.04517v1.
- R. Akter, V.-S. Doan, T.H.T.; Kim, D.S. RFDOA-net: Anefficient ConvNet for RF-based DOA estimation in UAV surveillance systems. IEEE Trans. Veh. Technol. 2021, 70, 12209–12214. [Google Scholar] [CrossRef]
- Alzahrani, R.J.; Alzahrani, A. Survey of traffic classification solution in IoT networks. Int. J. Comput. Appl. 2021, 183, 37–45. [Google Scholar] [CrossRef]
- L. A. C. Ahakonye, C. I. Nwakanma, J.M.L.; Kim, D.S. Efficient classification of enciphered SCADA network traffic in smart factory using decision tree algorithm. IEEE Access 2021, 9, 154892–154901. [Google Scholar] [CrossRef]
- A. O. Sangodoyin, M. O. Akinsolu, P.P.; Grout, V. Detection and classification of DDoS flooding attacks on software-defined networks: A case study for the application of machine learning. IEEEAccess 2021, 9, 122495–122508. [Google Scholar]
- Ullah, I.; Mahmoud, Q.H. Design and development of a deeplearning-based model for anomaly detection in IoT networks. IEEE Access 2021, 9, 103906–103926. [Google Scholar] [CrossRef]
- et al., S.H H. A deep CNN ensemble framework for efficient DDoS attack detection in software defined network. IEEE Access 2020, 8, 53972–53983. [Google Scholar] [CrossRef]
- D. Alghazzawi, O. Bamasaq, H.U.; Asghar, M.Z. Efficient detection of DDoS attacks using a hybrid deep learning model with improved feature selection. Appl. Sci. 2021, 11. [Google Scholar] [CrossRef]
- M. Lopez-Martin, A. Sanchez-Esguevillas, J.I.A.; Carro, B. Network intrusion detection based on extended RBF neural network with offline reinforcement learning. IEEE Access 2021, 9, 153153–153170. [Google Scholar] [CrossRef]
- S. K. Sahu, D. P. Mohapatra, J.K.R.K.S.S.Q.V.P.a.N.N.D. A LSTM-FCNN based multi-class intrusion detection using scalable framework. Comput. Elect. Eng. 2022, 99. [Google Scholar]
- A. R. Shaaban, E.A.E.; Hussein, M. DDoS attack detection and classification via convolutional neural network (CNN). in Proc.9th Int. Conf. Intell. Comput. Inf. Syst. (ICICIS) 2019, p. 233–238.
- L. Karanam, K.K.P.; Aldmour, R. Intrusion detection mechanism for large scale networks using CNN-LSTM. in Proc. 13th Int. Conf. Develop. Syst. Eng. (DeSE) 2020, p. 323–328.
- I. Sharafaldin, A. H. Lashkari, S.H.; Ghorbani, A.A. Developing realistic distributed denial of service (DDoS) attack dataset and taxonomy. in Proc. Int. Carnahan Conf. Security Technol. (ICCST) 2019, p. 1–8.
- A. H. Lashkari, G. Draper-Gil, M.S.I.M.; Ghorbani, A.A. Characterization of Tor traffic using time based features. in Proc.ICISSP 2017, p. 253–262.
- A. Khraisat, I. Gondal, P.V.; Kamruzzaman, J. Survey of intrusion detection systems: Techniques, datasets and challenges. Cybersecurity 2019, 2, 1–22. [Google Scholar]
- M. S. Elsayed, N.A.L.K.; Jurcut, A.D. InSDN: A novel SDN intrusion dataset. IEEE Access 2020, 8, 165263–165284. [Google Scholar] [CrossRef]
- Sarica, A.K.; Angin, P. A novel SDN dataset for intrusion detection in IoT networks. in Proc. 16th Int. Conf. Netw. Serv. Manag. (CNSM) 2020, pp. 1–5.
- A. Aldweesh, A.D.; Emam, A.Z. Deep learning approaches for anomaly-based intrusion detection systems: A survey, taxonomy, and open issues. Knowl. Based Syst. 2020, 189. [Google Scholar]
- Y. N. Kunang, S. Nurmaini, D.S.; Suprapto, B.Y. Attack classification of an intrusion detection system using deep learning and hyperparameter optimization. J. Inf. Security Appl. 2021, 58. [Google Scholar]
- S. Rajagopal, P.P.K.; Hareesha, K.S. Towards effective network intrusion detection: From concept to creation on azure cloud. IEEE Access 2021, 9, 19723–19742. [Google Scholar] [CrossRef]



| Feature | Sample |
|---|---|
| Time | 0.092 |
| PKT Len | 151 |
| IP Flags | 0X4000 |
| Highest Layer | 99602525 |
| Protocols | 0011010001006 |
| TCP ACK | 336 |
| TCP Len | 85 |
| TCP Window Size | 144 |
| TCP Flags | 0X018 |
| UDP Len | 0 |
| Component | Output Size | Detail |
|---|---|---|
| Input | [batchsize,10,10] | |
| Conv | [batchsize,10,10] | 10,3,ReLU,padding |
| Batch Normalization | [batchsize,10,10] | |
| Conv | [batchsize,8,10] | 8,1,ReLU,padding |
| Batch Normalization | [batchsize,8,10] | |
| MaxPool | [batchsize,8,5] | Pooling size = 2 |
| Conv | [batchsize,8,5] | 8,3,ReLU,padding |
| Conv | [batchsize,16,5] | 16,1,ReLU,padding |
| MaxPool | [batchsize,16,2] | Pooling size = 2 |
| Conv | [batchsize,16,5] | 16,1,ReLU,padding |
| MaxPool | [batchsize,16,2] | Pooling size = 2 |
| Addition | [batchsize,16,2] | |
| AvgPool | [batchsize,8,2] | Pooling size = 2 |
| mLSTM | [batchsize,16,2] | |
| KAN | [batchsize,2] |
| Param name | Param value |
|---|---|
| Selected Features | 10 |
| Total Features | 80 |
| Learning rate | 0.001 |
| Optimizer | Adam |
| Loss Function | Cross-entropy loss |
| Batch Size | 32 |
| Epoch | 50 |
| Activation Function | ReLU |
| FS | Number | A | P | R | F1 | Loss | ROC Score |
|---|---|---|---|---|---|---|---|
| ExtraTress | 10 | 99.52% | 99.51% | 99.49% | 99.48% | 0.025 | 99.50% |
| RandomForest | 10 | 97.24% | 97.21% | 97.20% | 97.19% | 0.088 | 97.08% |
| XGBoost] | 10 | 99.53% | 99.57% | 99.58% | 99.58% | 0.013 | 99.62% |
| ANOVA | 10 | 99.43% | 99.39% | 99.37% | 99.37% | 0.028 | 99.41% |
| LightGBM | 10 | 99.68% | 99.56% | 99.57% | 99.51% | 0.014 | 99.62% |
| AFSB | 10 | 99.78% | 99.66% | 99.65% | 99.65% | 0.012 | 99.75% |
| Performance | Number of feature | ||||
|---|---|---|---|---|---|
| f=10 | f=15 | f=20 | f=25 | f=30 | |
| A | 99.78% | 99.73% | 99.74% | 99.69% | 99.80% |
| P | 99.66% | 99.63% | 99.64% | 99.65% | 99.73% |
| R | 99.66% | 99.63% | 99.64% | 99.65% | 99.73% |
| F1 | 99.66% | 99.63% | 99.64% | 99.65% | 99.73% |
| Loss | 0.012 | 0.011 | 0.010 | 0.006 | 0.008 |
| Time-cost(ms) | 0.122 | 0.124 | 0.251 | 0.126 | 0.224 |
| Techniques | A | Time-Cost | Param | MFLOPS |
|---|---|---|---|---|
| Extend Decision Tree [34] | 97.70% | - | - | - |
| CART [19] | 98.90% | 8.40ms | - | - |
| CNN [10] | 99.63% | 0.105ms | 42.33K | 0.829 |
| Deep CNN [25] | 99.64% | 0.160ms | 32.53K | 0.759 |
| GRU [10] | 99.62% | 0.374ms | 114.49K | 5.26 |
| LSTM [10] | 99.25% | 0.255ms | 51.72K | 2.33 |
| Existing CNN-LSTM [26] | 99.65% | 0.188ms | 27.81K | 0.521 |
| CNN-mLSTM-KAN | 99.78% | 0.122ms | 17.83K | 0.085 |
| Techniques | A | Time-Cost | Parameter | MFLOPS |
|---|---|---|---|---|
| CNN [10] | 99.63% | 0.105ms | 42.33K | 0.829 |
| CNN-mLSTM | 99.67% | 0.160ms | 22.52K | 4.58 |
| CNN-KAN | 99.64% | 0.374ms | 21.50K | 4.24 |
| CNN-mLSTM-KAN | 99.78% | 0.122ms | 17.83K | 0.085 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).