Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

PDF Malware Detection using Machine learning

Version 1 : Received: 26 January 2023 / Approved: 30 January 2023 / Online: 30 January 2023 (12:55:47 CET)

How to cite: AlMahadeen, A.; alkasassbeh, M. PDF Malware Detection using Machine learning. Preprints 2023, 2023010557. https://doi.org/10.20944/preprints202301.0557.v1 AlMahadeen, A.; alkasassbeh, M. PDF Malware Detection using Machine learning. Preprints 2023, 2023010557. https://doi.org/10.20944/preprints202301.0557.v1

Abstract

Portable Document Format (PDF) is one of the most widely used files types worldwide in data exchange, this has encourage hackers to utilize such files to spread any malicious content through PDF, utilizing different methods and techniques to accomplish that, on the other hand, security researches kept trying to improve detection methods to cope up to the rapidly increasing number of malwares daily, one of the commonly used detection technique nowadays is by utilizing artificial intelligence and Machine learning classificat; thision to help detecting PDF Malwares, in this paper, we utilize machine learning classifier Random Forest on a newly released PDF Malware dataset CIC-Evasive-PDFMal2022 to achieve the main goal of detecting malicious PDF documents, results showing a detection accuracy of around 99.5%

Keywords

PDF; Malware; Machine Learning; Python; Random Forest

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.