Submitted:
29 November 2024
Posted:
29 November 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Need for Detecting Banking Malware
- The need to update signature-based malware systems.
- The inability of these systems to detect newer malware variants.
- Unable to detect malware that uses sophisticated obfuscation techniques.
- The inability to detect zero-day malware.
- From all the ML algorithms being analyzed, identify which one performs the best.
- Establish whether the features used to detect the Zeus banking malware can also be used to detect the other banking malware variants.
- Determine a minimum set of features that could be used for detecting Zeus.
- Determine a minimum set of features that could be used for detecting the other variants of the Zeus malware.
- Compare the performance results of all the ML algorithms.
1.2. Overview of the Zeus Banking Malware
1.3. Overview of the Zeus Panda Banking Malware
1.4. Overview of the Ramnit Banking Malware
1.5. Proposed Banking Malware Tree
- Most of the banking malware variants belong to one of the three families identified in Figure 3.
- All the banking malware variants have borrowed code from each other.
- Most new banking malware variants still share code and perform similar actions to those shown in Figure 3.
- Banking malware variants continue to evolve and are becoming more effective at targeting victims.
2. Related Studies
3. Problem Statement
4. Research Methodology
4.1. Machine Learning Algorithms
- Binary Classification – Two possible classifications can be predicted for example, an email can either be spam or not spam. The two possible classes are usually either normal or abnormal.
- Multi-Class classification - Multiple classes are involved and each data point is classified into one of the available class options.
- Multi-Label classification - Multiple classes can be predicted for each data point. For example, a house could be present in multiple photos.
4.2. System Architecture and Methodology
- The datasets are identified and collected.
- Features are extracted from these datasets.
- The extracted features are transferred to a csv file and prepared.
- The features are selected for training and testing.
- The algorithm is trained and tested, and a model is created. Only one dataset is used for the training.
- The model is tuned and trained and tested again if required.
- The model is used to test and evaluate the remaining datasets.
- Deploy the final model and test all the data samples and create a report highlighting the evaluation metrics.
4.3. Data Samples
4.4. Feature Selection
- Filter method - Feature selection is independent of the ML algorithm.
- Wrapper method - Features are selectively used to train the ML algorithm and through continual experimental analysis, the best features are selected for the final model. This method can be very time-consuming.
- Hybrid – Which is a fusion of the filter and wrapper approaches.
- mean_fpktl.
- min_fpktl.
- min_bpktl.
- min_fiat.
- mean_fiat.
- mean_biat.
- min_biat.
- sflow_fpackets.
- sflow_fbytes.
- Duration.
4.5. Evaluation Approach of the Experimental Analysis
5. Results
5.1. Training and Testing the Decision Tree Machine Learning Algorithms
5.2. Training and Testing the Random Forest (RF) Machine Learning Algorithm
5.3. Training and Testing the K-Neaest Neigbor (KNN) Machine Learning Algorithm
5.4. Training and Testing Using the Ensemble Machine Learning Approach
5.5. Comparing the Predication Results of All the Algorithms Tested
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Wadhwa, Amit, and Neerja Arora. "A Review on Cyber Crime: Major Threats and Solutions." International Journal of Advanced Research in Computer Science 8, no. 5 (2017).
- Morgan, Steve. 2022. “Cybercrime to Cost the World 8 Trillion Annually in 2023.” Cybercrime Magazine. , 2022. https://cybersecurityventures.com/cybercrime-to-cost-the-world-8-trillion-annually-in-2023/. 17 October.
- “Banking Malware Threats Surging as Mobile Banking Increases – Nokia Threat Intelligence Report.” n.d. Nokia. https://www.nokia.com/about-us/news/releases/2021/11/08/banking-malware-threats-surging-as-mobile-banking-increases-nokia-threat-intelligence-report/.
- Kuraku, Sivaraju, and Dinesh Kalla. "Emotet malware—a banking credentials stealer." Iosr J. Comput. Eng 22 (2020): 31-41.
- Etaher, Najla, George RS Weir, and Mamoun Alazab. "From zeus to zitmo: Trends in banking malware." In 2015 IEEE Trustcom/BigDataSE/ISPA, vol. 1, pp. 1386-1391. IEEE, 2015.
- “Godfather Banking Trojan Spawns 1.2K Samples across 57 Countries.” 2024. Darkreading.com. 2024. https://www.darkreading.com/endpoint-security/godfather-banking-trojan-spawns-1k-samples-57-countries.
- Pilania, Suruchi, and Rakesh Singh Kunwar. "Zeus: In-Depth Malware Analysis of Banking Trojan Malware." In Advanced Techniques and Applications of Cybersecurity and Forensics, pp. 167-195. Chapman and Hall/CRC.
- Kazi, Mohamed Ali, Steve Woodhead, and Diane Gan. "An investigation to detect banking malware network communication traffic using machine learning techniques." Journal of Cybersecurity and Privacy 3, no. 1 (2022): 1-23.
- Owen, Harry, Javad Zarrin, and Shahrzad M. Pour. "A survey on botnets, issues, threats, methods, detection and prevention." Journal of Cybersecurity and Privacy 2, no. 1 (2022): 74-88.
- Boukherouaa, El Bachir, Mr Ghiath Shabsigh, Khaled AlAjmi, Jose Deodoro, Aquiles Farias, Ebru S. Iskender, Mr Alin T. Mirestean, and Rangachary Ravikumar. Powering the digital economy: Opportunities and risks of artificial intelligence in finance. International Monetary Fund, 2021.
- AMR. 2022. “IT Threat Evolution in Q3 2022. Non-Mobile Statistics.” Securelist.com. Kaspersky. , 2022. https://securelist.com/it-threat-evolution-in-q3-2022-non-mobile-statistics/107963/. 18 November.
- Kazi, Mohamed Ali, Steve Woodhead, and Diane Gan. "Comparing the performance of supervised machine learning algorithms when used with a manual feature selection process to detect Zeus malware." International Journal of Grid and Utility Computing 13, no. 5 (2022): 495-504.
- Punyasiri, D. L. S. "Signature & Behavior Based Malware Detection." (2023).
- Gopinath, Mohana, and Sibi Chakkaravarthy Sethuraman. "A comprehensive survey on deep learning based malware detection techniques." Computer Science Review 47 (2023): 100529.
- Kazi, M.; Woodhead, S.; Gan, D. A contempory Taxonomy of Banking Malware. In Proceedings of the First International Conference on Secure Cyber Computing and Communications, Jalandhar, India, 15–17 December 2018. [Google Scholar]
- Falliere, N.; Chien, E. Zeus: King of the Bots. 2009. Available online: https://www.google.co.uk/url?sa=t&source=web&rct=j&opi=89978449&url=https://pure.port.ac.uk/ws/portalfiles/portal/42722286/Understanding_and_Mitigating_Banking_Trojans.pdf&ved=2ahUKEwizroXLwZqJAxU-VUEAHdgzKqEQFnoECDMQAQ&usg=AOvVaw1St11bbRwbhYj9IB4VdQv4 (accessed on 19 October 2024).
- Lelli, A. Zeusbot/Spyeye P2P Updated, Fortifying the Botnet. Available online: https://www.symantec.com/connect/blogs/zeusbotspyeye-p2p-updated-fortifying-botnet (accessed on 5 November 2019).
- Cluley, Graham. “GameOver Zeus Malware Returns from the Dead.” Graham Cluley, , 2014. https://grahamcluley.com/gameover-zeus-malware/. 14 July.
- Niu, Zequn, Jingfeng Xue, Dacheng Qu, Yong Wang, Jun Zheng, and Hongfei Zhu. "A novel approach based on adaptive online analysis of encrypted traffic for identifying Malware in IIoT." Information Sciences 601 (2022): 162-174.
- Ebach, Luca. "Analysis Results of Zeus. Variant. Panda." G DATA Advanced Analytics (2017).
- Lamb, Christopher. Advanced Malware and Nuclear Power: Past Present and Future. No. SAND2019-14527C. Sandia National Lab.(SNL-NM), Albuquerque, NM (United States), 2019.
- De Carli, Lorenzo, Ruben Torres, Gaspar Modelo-Howard, Alok Tongaonkar, and Somesh Jha. "Botnet protocol inference in the presence of encrypted traffic." In IEEE INFOCOM 2017-IEEE Conference on Computer Communications, pp. 1-9. IEEE, 2017.
- Lioy, Antonio, Andrea Atzeni, and Francesco Romano. "Machine Learning for malware characterization and identification." (2023).
- Black, Paul, Iqbal Gondal, and Robert Layton. “A Survey of Similarities in Banking Malware Behaviours.” Computers & Security 77 (August 2018): 756–72. [CrossRef]
- Pilania, Suruchi, and Rakesh Singh Kunwar. "Zeus: In-Depth Malware Analysis of Banking Trojan Malware." In Advanced Techniques and Applications of Cybersecurity and Forensics, pp. 167-195. Chapman and Hall/CRC.
- CLULEY, Graham. “Russian Creator of NeverQuest Banking Trojan Pleads Guilty in American Court.” Hot for Security, 2019. https://www.bitdefender.com/en-us/blog/hotforsecurity/russian-creator-of-neverquest-banking-trojan-pleads-guilty-in-american-court/.
- Fisher, Dennis. “Cridex Malware Takes Lesson from GameOver Zeus.” Threatpost.com. Threatpost, , 2014. https://threatpost.com/cridex-malware-takes-lesson-from-gameover-zeus/107785/. 15 August.
- Ionut Ilascu. “Softpedia.” softpedia, , 2014. https://news.softpedia.com/news/Cridex-Banking-Malware-Variant-Uses-Gameover-Zeus-Thieving-Technique-455193.shtml. 16 August.
- Andriesse, Dennis, Christian Rossow, Brett Stone-Gross, Daniel Plohmann, and Herbert Bos. "Highly resilient peer-to-peer botnets are here: An analysis of gameover zeus." In 2013 8th International Conference on Malicious and Unwanted Software:" The Americas"(MALWARE), pp. 116-123. IEEE, 2013.
- Sarojini, S. , and S. Asha. "Botnet detection on the analysis of Zeus panda financial botnet." Int. J. Eng. Adv. Technol 8 (2019): 1972-1976.
- Aboaoja, Faitouri A., Anazida Zainal, Fuad A. Ghaleb, Bander Ali Saleh Al-Rimy, Taiseer Abdalla Elfadil Eisa, and Asma Abbas Hassan Elnour. "Malware detection issues, challenges, and future directions: A survey." Applied Sciences 12, no. 17 (2022): 8482.
- Chen, Ruidong, Weina Niu, Xiaosong Zhang, Zhongliu Zhuo, and Fengmao Lv. "An effective conversation-based botnet detection method." Mathematical Problems in Engineering 2017, no. 1 (2017): 4934082.
- Jha, Jayshree, and Leena Ragha. "Intrusion detection system using support vector machine." International Journal of Applied Information Systems (IJAIS) 3 (2013): 25-30.
- Singla, Sanjam, Ekta Gandotra, Divya Bansal, and Sanjeev Sofat. "A novel approach to malware detection using static classification." International Journal of Computer Science and Information Security 13, no. 3 (2015): 1-5.
- Wu, Wei, Jaime Alvarez, Chengcheng Liu, and Hung-Min Sun. "Bot detection using unsupervised machine learning." Microsystem Technologies 24 (2018): 209-217.
- Yahyazadeh, Mosa, and Mahdi Abadi. "BotOnus: An Online Unsupervised Method for Botnet Detection." ISeCure 4, no. 1 (2012).
- Soniya, B. , and M. Wilscy. "Detection of randomized bot command and control traffic on an end-point host." Alexandria Engineering Journal 55, no. 3 (2016): 2771-2781.
- Ghafir, I.; Prenosil, V.; Hammoudeh, M.; Baker, T.; Jabbar, S.; Khalid, S.; Jaf, S. BotDet: A System for Real Time Botnet Command and Control Traffic Detection. IEEE Access 2018, 6, 38947–38958. [Google Scholar] [CrossRef]
- P. Agarwal.; S. Satapathy, Implementation of signature-based detection system using snort in windows, International Journal of Computer Applications & Information Technology 2014, vol. 3, no. 3.
- He, S.; Zhu, J.; He, P.; Lyu, M.R. Experience report: System log analysis for anomaly detection. In Proceedings of the 2016 IEEE 27th international symposium on software reliability engineering (ISSRE), Ottawa, ON, Canada, 23–27 October 2016; pp. 207–218. [Google Scholar]
- Zhou, J.; Qian, Y.; Zou, Q.; Liu, P.; Xiang, J. DeepSyslog: Deep Anomaly Detection on Syslog Using Sentence Embedding and Metadata. IEEE Transactions on Information Forensics and Security 2022, 17, 3051–3061. [Google Scholar] [CrossRef]
- Khraisat, A.; Gondal, I.; Vamplew, P.; Kamruzzaman, J. Survey of intrusion detection systems: Techniques, datasets and challenges. Cybersecurity 2019, 2, 1–22. [Google Scholar] [CrossRef]
- Sharma, P.; Said, Z.; Memon, S.; Elavarasan, R.M.; Khalid, M.; Nguyen, X.P.; Arıcı, M.; Hoang, A.T.; Nguyen, L.H. Comparative evaluation of AI-based intelligent GEP and ANFIS models in prediction of thermophysical properties of Fe3O4-coated MWCNT hybrid nanofluids for potential application in energy systems. Int. J. Energy Res. 2022, 1–16. [Google Scholar]
- Ahsan, M.; Nygard, K.E.; Gomes, R.; Chowdhury, M.M.; Rifat, N.; Connolly, J.F. Cybersecurity Threats and Their Mitigation Approaches Using Machine Learning—A Review. J. Cybersecur. Priv. 2022, 2, 527–555. [Google Scholar] [CrossRef]
- Choi, Rene Y., Aaron S. Coyner, Jayashree Kalpathy-Cramer, Michael F. Chiang, and J. Peter Campbell. "Introduction to machine learning, neural networks, and deep learning." Translational vision science & technology 9, no. 2 (2020): 14-14.
- Kumar, Ajitesh. “Difference: Binary, Multiclass & Multi-Label Classification.” Data Analytics, , 2022. https://vitalflux.com/difference-binary-multi-class-multi-label-classification/?utm_content=cmp-true. 16 May.
- Elmachtoub, Adam N., Jason Cheuk Nam Liang, and Ryan McNellis. "Decision trees for decision-making under the predict-then-optimize framework." In International conference on machine learning, pp. 2858-2867. PMLR, 2020.
- Liberman, Neil. “Decision Trees and Random Forests.” Towards Data Science. Towards Data Science, , 2017. https://towardsdatascience.com/decision-trees-and-random-forests-df0c3123f991. 27 January.
- Oshiro, Thais Mayumi, Pedro Santoro Perez, and José Augusto Baranauskas. "How many trees in a random forest?." In Machine Learning and Data Mining in Pattern Recognition: 8th International Conference, MLDM 2012, Berlin, Germany, -20, 2012. Proceedings 8, pp. 154-168. Springer Berlin Heidelberg, 2012. 13 July.
- Kazi, Mohamed Ali, Steve Woodhead, and Diane Gan. "Detecting Zeus Malware Network Traffic Using the Random Forest Algorithm with Both a Manual and Automated Feature Selection Process." In IOT with Smart Systems: Proceedings of ICTIS 2022, Volume 2, pp. 547-557. Singapore: Springer Nature Singapore, 2022.
- Suyal, Manish, and Parul Goyal. "A review on analysis of k-nearest neighbor classification machine learning algorithms based on supervised learning." International Journal of Engineering Trends and Technology 70, no. 7 (2022): 43-48.
- Aggarwal, Charu C., and Charu C. Aggarwal. Data classification. Springer International Publishing, 2015.
- Chung, Jetli, and Jason Teo. "Single classifier vs. ensemble machine learning approaches for mental health prediction." Brain informatics 10, no. 1 (2023): 1.
- Salur, Mehmet Umut, and İlhan Aydın. "A soft voting ensemble learning-based approach for multimodal sentiment analysis." Neural Computing and Applications 34, no. 21 (2022): 18391-18406.
- Jabbar, Hanan Ghali. "Advanced Threat Detection Using Soft and Hard Voting Techniques in Ensemble Learning." Journal of Robotics and Control (JRC) 5, no. 4 (2024): 1104-1116.
- Shomiron. zeustracker. Available online: https://github.com/dnif-archive/enrich-zeustracker (accessed on 25 July 2022).
- Stratosphere. Stratosphere Laboratory Datasets. Available online: https://www.stratosphereips.org/datasets-overview Retrieved (accessed on 20 September 2024).
- Abuse.ch. Fighting malware and botnets. Available online: https://abuse.ch/ (accessed on 13 May 2022).
- Haddadi, F.; Zincir-Heywood, A.N. Benchmarking the effect of flow exporters and protocol filters on botnet traffic classification. IEEE Syst. J. 2014, 10, 1390–1401. [Google Scholar] [CrossRef]
- Kasongo, Sydney Mambwe, and Yanxia Sun. "A deep learning method with filter based feature engineering for wireless intrusion detection system." IEEE access 7 (2019): 38597-38607.
- Miller, Shane, Kevin Curran, and Tom Lunney. "Multilayer perceptron neural network for detection of encrypted VPN network traffic." In 2018 International conference on cyber situational awareness, data analytics and assessment (Cyber SA), pp. 1-8. IEEE, 2018.
- Kazi, Mohamed Ali, Steve Woodhead, and Diane Gan. 2023. "An Investigation to Detect Banking Malware Network Communication Traffic Using Machine Learning Techniques" Journal of Cybersecurity and Privacy 3, no. 1: 1-23.
- Nasiri, Hamid, and Seyed Ali Alavi. "A Novel Framework Based on Deep Learning and ANOVA Feature Selection Method for Diagnosis of COVID-19 Cases from Chest X-Ray Images." Computational intelligence and neuroscience 2022, no. 1 (2022): 4694567.
- Alshanbari, Huda M., Tahir Mehmood, Waqas Sami, Wael Alturaiki, Mauawia A. Hamza, and Bandar Alosaimi. "Prediction and classification of COVID-19 admissions to intensive care units (ICU) using weighted radial kernel SVM coupled with recursive feature elimination (RFE)." Life 12, no. 7 (2022): 1100.
- Kavya D, “Optimizing Performance: SelectKBest for Efficient Feature Selection in Machine Learning,” Medium, , 2023, https://medium.com/@Kavya2099/optimizing-performance-selectkbest-for-efficient-feature-selection-in-machine-learning-3b635905ed48. 16 February.
- Luan, Hui, and Chin-Chung Tsai. "A review of using machine learning approaches for precision education." Educational Technology & Society 24, no. 1 (2021): 250-266.
- Davis, Jesse, and Mark Goadrich. "The relationship between Precision-Recall and ROC curves." In Proceedings of the 23rd international conference on Machine learning, pp. 233-240. 2006.
- Fourure, Damien, Muhammad Usama Javaid, Nicolas Posocco, and Simon Tihon. "Anomaly detection: How to artificially increase your f1-score with a biased evaluation protocol." In Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 3-18. Cham: Springer International Publishing, 2021.
- Visa, Sofia, Brian Ramsay, Anca L. Ralescu, and Esther Van Der Knaap. "Confusion matrix-based feature selection." Maics 710, no. 1 (2011): 120-127.













| Classifier | Classification Results FP |
FN | Accuracy |
|---|---|---|---|
| Kstar | 0.275 | 0.026 | 88.69 |
| J48 | 0.156 | 0.026 | 92.84 |
| DT | 0.14 | 0.031 | 97.47 |
| FP | TN | FP | FN | Accuracy | |
|---|---|---|---|---|---|
| Zeus 1 | 14,678 | 4352 | 969 | 1 | 0.9515 |
| Zeus 2 | 14,663 | 4341 | 991 | 5 | 0.9502 |
| Waledac 1 | 14,536 | 4500 | 963 | 1 | 0.9518 |
| Waledac 2 | 14,521 | 4525 | 963 | 1 | 0.9523 |
| Storm 1 | 10,139 | 4499 | 501 | 1386 | 0.8858 |
| Storm 2 | 2300 | 503 | 247 | 3 | 0.9181 |
| Botnet | Average Detection Rate | Average False Alarm Rate |
|---|---|---|
| HTTP-based | 0.95 | 0.041 |
| IRC-based | 0.96 | 0.033 |
| P2P-based | 0.91 | 0.037 |
| Dataset type | Malware Name/Year | Number of flows | Name of dataset for this paper |
|---|---|---|---|
| Malware Benign |
Zeus/2019 | 66009 | Dataset1 |
| N/A | 66009 | ||
| Malware Benign |
Zeus/2019 | 38282 | Dataset2 |
| N/A | 38282 | ||
| Malware Benign |
Zeus/2022 | 272425 | Dataset3 |
| N/A | 272425 | ||
| Malware Benign |
ZeusPanda/2022 | 11864 | Dataset4 |
| N/A | 11864 | ||
| Malware Benign |
Ramnit/2022 | 10204 | Dataset5 |
| N/A | 10204 | ||
| Malware Benign |
Dridex/2018 | 134998 | Dataset6 |
| N/A | 134998 |
| Predicted Benign | Predicted Zeus | |
|---|---|---|
| Actual Benign (Total) | TN | FP |
| Actual Zeus (Total) | FN | TP |
| Dataset Name | Malware Precision Score |
Malware Recall Score |
Malware f1-score |
Benign Precision Score |
Benign Recall Score |
Benign f1-score |
|---|---|---|---|---|---|---|
| Dataset1 | 1.00 | 0.95 | 0.97 | 0.95 | 1.00 | 0.97 |
| Dataset2 | 1.00 | 0.95 | 0.97 | 0.96 | 1.00 | 0.98 |
| Dataset3 | 1.00 | 0.99 | 0.99 | 0.99 | 1.00 | 0.99 |
| Dataset4 | 1.00 | 0.99 | 0.99 | 0.99 | 1.00 | 0.99 |
| Dataset5 | 0.87 | 0.97 | 0.92 | 0.97 | 0.86 | 0.91 |
| Dataset6 | 0.78 | 0.66 | 0.71 | 0.70 | 0.82 | 0.76 |
| Dataset Name | Malware Total Samples Tested |
Malware Samples Classified Correctly |
Malware Samples Classified Incorrectly |
Total Benign Samples Tested |
Benign Samples Classified Correctly |
Benign Samples Classified Incorrectly |
|---|---|---|---|---|---|---|
| Dataset1 | 66009 | 62906 | 3103 | 66009 | 65722 | 287 |
| Dataset2 | 38282 | 36519 | 1763 | 38282 | 38152 | 130 |
| Dataset3 | 272425 | 270328 | 2097 | 272425 | 271439 | 986 |
| Dataset4 | 11864 | 11728 | 136 | 11864 | 11820 | 44 |
| Dataset5 | 10204 | 9941 | 263 | 10204 | 8759 | 1445 |
| Dataset6 | 134998 | 88500 | 46498 | 134998 | 110167 | 24831 |
| Dataset Name | Malware Precision Score |
Malware Recall Score |
Malware f1-score |
Benign Precision Score |
Benign Recall Score |
Benign f1-score |
|---|---|---|---|---|---|---|
| Dataset1 | 1.00 | 0.95 | 0.97 | 0.95 | 1.00 | 0.97 |
| Dataset2 | 1.00 | 0.95 | 0.97 | 0.96 | 1.00 | 0.98 |
| Dataset3 | 1.00 | 0.99 | 0.99 | 0.99 | 1.00 | 0.99 |
| Dataset4 | 1.00 | 0.99 | 0.99 | 0.99 | 1.00 | 0.99 |
| Dataset5 | 0.87 | 0.97 | 0.92 | 0.97 | 0.86 | 0.91 |
| Dataset6 | 0.78 | 0.66 | 0.71 | 0.70 | 0.82 | 0.76 |
| Dataset Name | Total Malware Samples Tested |
Malware Samples Classified Correctly |
Malware Samples Classified Incorrectly |
Total Benign Samples Tested |
Benign Samples Classified Correctly |
Benign Samples Classified Incorrectly |
|---|---|---|---|---|---|---|
| Dataset1 | 66009 | 65051 | 958 | 66009 | 66003 | 6 |
| Dataset2 | 38282 | 37737 | 545 | 38282 | 38278 | 4 |
| Dataset3 | 272425 | 272276 | 149 | 272425 | 272401 | 24 |
| Dataset4 | 11864 | 11758 | 106 | 11864 | 11863 | 1 |
| Dataset5 | 10204 | 9990 | 214 | 10204 | 8852 | 1352 |
| Dataset6 | 134998 | 88586 | 46412 | 134998 | 111428 | 23570 |
| Dataset Name | Malware Precision Score |
Malware Recall Score |
Malware f1-score |
Benign Precision Score |
Benign Recall Score |
Benign f1-score |
|---|---|---|---|---|---|---|
| Dataset1 | 1.00 | 0.90 | 0.95 | 0.91 | 1.00 | 0.95 |
| Dataset2 | 1.00 | 0.91 | 0.95 | 0.91 | 1.00 | 0.95 |
| Dataset3 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| Dataset4 | 1.00 | 0.99 | 0.99 | 0.99 | 1.00 | 0.99 |
| Dataset5 | 0.92 | 0.97 | 0.95 | 0.97 | 0.92 | 0.95 |
| Dataset6 | 0.85 | 0.50 | 0.63 | 0.65 | 0.91 | 0.76 |
| Dataset Name | Total Malware Samples Tested |
Malware Samples Classified Correctly |
Malware Samples Classified Incorrectly |
Total Benign Samples Tested |
Benign Samples Classified Correctly |
Benign Samples Classified Incorrectly |
|---|---|---|---|---|---|---|
| Dataset1 | 66009 | 59476 | 6533 | 66009 | 66003 | 6 |
| Dataset2 | 38282 | 34659 | 3623 | 38282 | 38278 | 4 |
| Dataset3 | 272425 | 272423 | 2 | 272425 | 272401 | 24 |
| Dataset4 | 11864 | 11719 | 145 | 11864 | 11863 | 1 |
| Dataset5 | 10204 | 9939 | 265 | 10204 | 9397 | 807 |
| Dataset6 | 134998 | 68156 | 66842 | 134998 | 123232 | 11766 |
| Dataset Name | Malware Precision Score |
Malware Recall Score |
Malware f1-score |
Benign Precision Score |
Benign Recall Score |
Benign f1-score |
|---|---|---|---|---|---|---|
| Dataset1 | 1.00 | 0.95 | 0.97 | 0.95 | 1.00 | 0.97 |
| Dataset2 | 1.00 | 0.95 | 0.97 | 0.96 | 1.00 | 0.98 |
| Dataset3 | 1.00 | 0.99 | 0.99 | 0.99 | 1.00 | 0.99 |
| Dataset4 | 1.00 | 0.99 | 0.99 | 0.99 | 1.00 | 0.99 |
| Dataset5 | 0.87 | 0.97 | 0.92 | 0.97 | 0.86 | 0.91 |
| Dataset6 | 0.78 | 0.66 | 0.71 | 0.70 | 0.82 | 0.76 |
| Dataset Name | Total Malware Samples Tested |
Malware Samples Classified Correctly |
Malware Samples Classified Incorrectly |
Total Benign Samples Tested |
Benign Samples Classified Correctly |
Benign Samples Classified Incorrectly |
|---|---|---|---|---|---|---|
| Dataset1 | 66009 | 65051 | 958 | 66009 | 66003 | 6 |
| Dataset2 | 38282 | 37737 | 545 | 38282 | 38278 | 4 |
| Dataset3 | 272425 | 272276 | 149 | 272425 | 272401 | 24 |
| Dataset4 | 11864 | 11758 | 106 | 11864 | 11863 | 1 |
| Dataset5 | 10204 | 9990 | 214 | 10204 | 8852 | 1352 |
| Dataset6 | 134998 | 88586 | 46412 | 134998 | 111428 | 23570 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).