Submitted:
21 November 2025
Posted:
24 November 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- Malicious links, which appear as impostor websites infected with malware;
- Malicious file attachments, which are infected with malware to compromise the user’s computer or files;
- Fraudulent data entry forms, which prompt the user to enter their login credentials or other sensitive information to log in.
- the ambiguity in choosing a suitable dataset (training dataset);
- the composition of features included in the dataset;
- the architecture of the classifier model;
- the values of its hyperparametersl;
- the way of selection effective training algorithms (tuning) for the detection system;
- and the mechanisms for interpreting the results, considering the natural limitations associated with the required computational costs of system implementation.
2. Analysis of Relevant Works
2.1. The Methodology for Data Collection for the Review has Been Further Explored
- «multimodal phishing detection» AND («explainable AI» OR «XAI»);
- «deep learning» AND «phishing detection» AND («SHAP» OR «LIME»);
- «large language models» AND «phishing»;
- «visual phishing detection» AND «interpretability»;
- «multi-modal fusion» AND «cybersecurity».
2.2. Review of Major Publications on Research into Multimodal Phishing Detection Systems
- There are now fairly representative open datasets containing a large number of URL samples with a wide variety of characteristics for both phishing sites (Kaggle, Phishtank, OpenPhish, etc.). and legitimate sites (Alexa Rank, crawlcommon, etc.), which can be used in the training phase for phishing web-detection systems (see also [18]);
- An important stage preceding the training of such systems is a detailed study (Pre-processing) of the set of features of phishing and legitimate websites contained in the dataset, including their normalization and coding, analysis of correlation and informational significance, possible reduction of the dimensionality of the feature space, etc. using special methods of feature selection (Feature Selection) (detailed information on this can be found in [19,20,21] );
- Based on their research, authors of several publications [21,22,23] pay close attention to the comparative assessment of the effectiveness of different ML models (as used in the specific issue of phishing web addresses) using such quality metrics as accuracy, recall, accuracy, and F1 score (harmonic mean); based on their research, random forest algorithms, decision trees and multi-layer perceptrons are preferred;
- A hybrid ML model based on a committee of classifiers [11] (for example implemented by RF algorithm) is particularly interesting in terms of improving the accuracy of phishing detection, where each individual classifier interacts with its own set of URL features and the final decision on the presence or absence of a phishing threat is taken by an arbitrator (MLP) using a separate voting system;
- Combining Natural Language Processing (NLP) capabilities - URL, text blocks, domains, logs, etc. - processing visual elements (images) of the website and the structure of the HTML code in a multimodal approach [10,24,25] not only increases accuracy, completeness and predict the evolution of new types of phishing attacks, including generative attacks.
| No. | Authors | Dataset | ML/DL models | Accuracy |
|---|---|---|---|---|
| 1. | Al-diabat M. [19] | University of Irvine Repository (11,000 URLs, based on Phishtank and Yahoo Directory, 30 features) |
Classifiers based on C 4.5 and IREP algorithms | C 4.5–96% IREP–95% |
| 2. | Mahajan R., Siddavatam I., [22] | Phishtank (19653 URLs) + Alexa Rank (17058 URLs) |
Decision Trees (DT), RF, SVM | RF–97,14% |
| 3. | Almomani A., Alauthman M., Shatnawi M.T. et al, [20] |
Huddersfield University, Dataset 1 (2,456 URLs, 30 features); Dataset 2 (10,000 URLs, 48 features) |
16 ML models (RF, Classification and Regression Trees (CART), Logistic Regression (LR), SVM, NN, Bayesian Additive Regression Trees (BART), etc.) |
RF–96% (Dataset 1) and 98% (Dataset 2) |
| 4. | Khera M., Prasad T., Xess L.D. et al, [23] |
Kaggle.com | 9 ML models (RF, DT, LR, K-nearest neighbors (kNN), XGBoost, XBNet, etc.) |
RF–97,3% |
| 5. | Alazaidah R., Al-Shaikh A., Almousa M.R. et al, [21] |
Dataset 1 (11,055 URLs, 30 features, 2 classes) Dataset 2 (1,353 URLs, 9 features, 3 classes) |
24 ML models, 6 learning strategies (Naive Bayes, LR, MLP, AdaBoost, RF, Random Tree, etc.) |
Filtered Classification J-48– 90,76% (Dataset 1); RF–97,26% (Dataset 2) |
| 6. | Dutta A.K., [4] | Phishtank.com (6042 URLs) + Crawlez (7658 URLs, based on Alexa Rank) |
LSTM | 97% |
| 7. | Ghaleb F.A., Alsaedi M., Saeed F. et al, [11] |
Kaggle.com (651191 URLs) | Hybrid ML model: 3 x RF + Multi-Layer Perceptron (MLP) |
96,80% |
| 8. | Krotov E.Yu., [10] | Kaggle.com (10000 URLs) + Common Crawe |
Multimodal system: CNN + LSTM (image + text) |
92% |
| 9. | Lee J., Lim P., et al, [24] | Open Phish (1585 URLs) + Alexa Rank (3000 URLs) |
Multimodal system: GPT-4-turbo, Claude 3, Gemini Pro-Vision 1.0 |
GPT-4–92% Claude 3–90% |
| 10. | Syed Sh. A., [25] | Phishtank, APWG, Open Phish | BERT (text) + ResNet + VGG16 (image) |
96,2% |
2.3. Classification of Data Types for Feature Generation in the Detection of Phishing Attacks
- The main categories of URL features: lexical, syntactic, semantic, entropic, statistical;
- The main categories of structural HTML features: DOM structural, data entry form structure, characteristics of the page’s script component, CSS and style features of the design template, and semantic HTML features;
- The main categories of visual features: Logo Detection, Layout Analysis, Color Analysis, Visual Similarity, and Image Quality.
- URL features provide fast initial analysis with a relatively high accuracy (up to 93%), but are vulnerable to attack by URL obfuscation;
- HTML features provide the most detailed information on the attack structure and are particularly effective in detecting credential phishing;
- Visual features are critical for detecting brand impersonation and visual cloning attacks, achieving 94-98% accuracy, but require significant computational resources;
- Multimodal integration outperforms single-modal approaches in F1 scores by 8-15 percent, but increases the time to classify (up to 5-10 seconds);
- Feature engineering remains an important step even in the use of deep learning, as hand-selected and hand-crafted features ensure interpretability and resilience to adversarial perturbations.
- Early fusion: concatenates the properties of all modalities at the input level, followed by processing with one model. Advantages: captures low level correlations of features, simple architecture of the final model. Disadvantages: loss of specificity of the modality, complexity of the training.
- Intermediate fusion: each modalities is processed by a separate encoder, and then the features are combined at an intermediate level by means of an attentional mechanism. It provides a balance between specificity and interaction between modalities.
- Late fusion: independently train specialized models for each modality with decision aggregation by means of voting or stacking. Advantages: model specialisation, modularity, parallelism. Best results for heterogeneous data.
- Hybrid fusion: Combination of early and late fusion with adaptive weights. This strategy shows maximum performance with high architectural complexity.
2.4. Explainable Artificial Intelligence Methods for Detecting Phishing Attacks
- Phishing website detection system [16]: Base classifier model (ML)—Random Forest (RF); Training datasets (datasets)—Phish Tank and Tranco; Number of used URL features is 26; Lorenz Zonoids (multivariate extension of the Gini coefficient) was used as a feature selection procedure (XAI method);
- Phishing website detection system [42]: Base classifier models–SVM and RF; Dataset–Ebbu 2017; Number of URL features–40; XAI methods–LIME and EBM (Explainable Boosting Machine); Prediction accuracy–94.7% (for SVM) and 97.3% (for RF);
- A system for detecting phishing websites [43]: 6 ML models were studied (XGBoost, LighGBM, RF, KNN, Twin SVM, CNN); 4 datasets (ISCX-URLs, P.L-URLs, Phish Guard URLs, Suspicious-URLs); the XAI-LIME method; The XGBoost (Extreme Gradient Boosting) model shows the highest forecast accuracy (Accuracy) of 96.8%.

2.5. Main Datasets Used to Train Multimodal ML Models for Phishing Attack Detection
- Phishing sites are accessible for an average of 4-8 hours before being blocked, and datasets become quickly outdated;
- The ratio of legitimate URLs to phishing URLs ranges from 100 to 1 to 1000 to 1 in real traffic of corporate information systems;
- The percentage of annotation errors in community-driven datasets is quite high (up to 15%);
- Features distributions are dynamic and change over time (concept drift).
2.6. Promising Multimodal Systems for Analyzing Phishing Attacks
2.7. Findings from the Systematic Review
- Contemporary machine learning models exhibit susceptibility to adversarial attacks [53], while existing certified defenses impose prohibitive computational overheads. This highlights critical research needs: (1) the development of scalable certified defenses for deployment in production systems, (2) the implementation of adaptive adversarial training robust to unknown attack types, and (3) the creation of robustness benchmarks tailored to the domain of phishing attacks.
- The most advanced detection systems for zero-day phishing threats attain recall metrics in the range of 87-94%. Existing systems consistently fail to detect 6-13% of unknown threats. To mitigate this vulnerability, actual research [26,28] efforts are focused on advancing few-shot learning, meta-learning, and anomaly detection to address these limitations.
- Current hybrid architectures demonstrate detection accuracy, achieving performance rates of 98-99.68%. The integration of Large Language Models (LLMs) has yielded a significant advancement in zero-day detection, with recall metrics ranging from 87-94%, while also enabling the generation of natural language explanations. Additionally, Explainable Artificial Intelligence (XAI) methodologies are utilized in a substantial portion (80%) of recent research.
- SThe adoption of multimodal systems has resulted in an improvement in accuracy metrics ranging from 5% to 12%. It has been demonstrated that late fusion techniques produce better outcomes when dealing with heterogeneous modalities. Furthermore, agent-based LLM architectures constitute a new paradigm characterized by their integrated reasoning functionalities.
- A critical challenge is to ensure model adversarial robustness. This problem characterized by a 3–9% decrease in accuracy, temporal decay effect (an 8–15% degradation in performance without overfitting) and high LLM latency, which imposes significant real-time operational constraints.
3. Design of a Multimodal Phishing Website Detection System Using Explainable Artificial Intelligence
3.1. Structural Diagram of the Web Resource Analysis System
- Feature Extraction;
- Context enrichment with data from external sources;
- Normalization, standardization, and generation of a multimodal context.
- A module dedicated to data labeling, re-labeling, and label validation;
- A database (DB1) for storing enriched and prepared data;
- A version management module for dataset tracking and synchronization with corresponding machine learning (ML) models;
- A balancing and augmentation module configured to generate training, validation, and test datasets.
- Machine learning (ML) models specifically designed for the analysis of URLs/metadata, images, and the source code of target resource pages;
- A late fusion multimodal module that operates based on a weighted model voting mechanism;
- A product model database (DB2).
- A performance monitoring module;
- A data drift detection module, which detects decreased model classification accuracy on freshly labeled data;
- A retraining and/or updating of model modules at specified intervals.
- SOC Analytics dashboard;
- Decision verification and data markup module;
3.2. Key ML Models in a Multimodal Web Resource Analysis System
3.2.1. Developing URL Analysis Model
–CatBoost Model for Analyzing URL-Extracted Features and Metadatal
- more than 150,000 domain names from real phishing campaigns (picked by PhishTank, OpenPhish, VirusTotal);
- 200,000 legitimate domains from Alexa Top and SSL certificates (Let’s Encrypt, Mozilla).
–CNN1D for URL Analysis at the Character Level
- the ability to extract local patterns as substrings characteristic of phishing URLs (e.g., "login," "secure") through the use of convolutions with kernels of varying sizes;
- high analysis efficiency on the CPU because of the reduced number of model parameters, which allows for the simultaneous analysis of short ("@," "//") and long ("login") URL anomalies, improving classification accuracy.
- Phishing URLs (from open sources such as PhishTank and OpenPhish);
- Legitimate URLs (from popular services, including Google, Microsoft, and banking portals);
- Metadata: page source code, domain WHOIS data, screenshots, and binary labels ("phishing"/"not phishing").
3.2.2. Development of a Model for Analyzing the Visual Image of the Main Page of a Web Resource
–CNN2D Model for Visual Image Analysis Based on Pre-Trained EfficientNet-B7
3.2.3. Development of a Model for Analyzing the HTML Code of the Main Page of a Web Resource
–Transformer for Analyzing the HTML Code of the Main Page Based on the Pre-trained CodeBERT
3.3. Key XAI Technologies as Part of a Multimodal Web Resource Analysis System
3.3.1. SHAP Coefficients for Interpreting the Performance of the M1: CatBoost Model
3.3.2. GradCam and Integrated Gradients for Interpreting the Performance of the M2: CNN1D Modell
3.3.3. GradCam for Interpreting the Performance of the : CNN2D Model
3.3.4. Attention Matrix for Interpreting the Work of the Model: CodeBERT
3.3.5. LLM Explainer for Preparing Final Reports in Natural Language
3.4. Software Architecture of a Web Resource Analysis System
- "Telegram bot" (App-tg_bot) implements the functionality of receiving incoming messages from users and serves as the first link in the value verification chain.
- "Machine learning module"–a container with a machine learning model is deployed on the basis of the "m_service" image, where the "/check_domain" endpoint is implemented via the Flask API. When a POST request with a domain is received, multi-stage processing occurs.
- "The Threat Intelligence module" (App-ti_service) is a container for integration with external reputation services such as Kaspersky TI, VirusTotal TI and others via an API implemented on FastAPI.
- "Data storage service"–a database for long-term storage of all system operation results; a container with a MongoDB database is used.
- Container for updating Elasticsearch ("App-update_es"). In the test environment, the project uses Elasticsearch to analyze domains, where the "update_es" container is responsible for regularly updating the “idecoutm” index. Each time the script is run, a check is made to see if the current domain has been processed before: if a match is found, the document is updated to reflect the new results, and if not, a full check cycle is started. The update occurs every 3 seconds, which ensures that the data in Elasticsearch is up-to-date and allows to quickly track changes in the analyzed domains. With the help of Praeco, a correlation rule was created to notify analysts of the SOC, which allows for a prompt response to information security incidents.
- The container management container ("Portainer"), which uses the "portainer/portainer-ce:latest" image, provides a web interface for managing all of the project’s Docker containers. Special attention has been paid during design to safety and data isolation. All containers run in a single virtual segment, with no external ports exposed except for the Telegram API and the Web Administration interfaces. All requests are handled over HTTPS, logged, and may be signed for investigation.
- processing queries in real time;
- storing results and metadata for subsequent analysis;
- scaling individual modules independently (e.g., multiplying ml_service);
- integrating with external sources via APIs and with internal security systems via SIEM interfaces;
- providing a visual interface for both users (via Telegram) and administrators (via Portainer and Mongo Express).
4. Computational Experiment to Evaluate the Performance of the Web-Based Resource-Analysis System
4.1. Computational Experiment I on the Prepared Dataset for Models and
4.1.1. The First Phase of Testing is Collecting the Dataset () and Preparing Models
- phishing URLs were obtained from open repositories: Phishing Site URLs (Kaggle), OpenPhish, URLHaus and PhishTank, where malicious link databases are updated daily;
- legitimate URLs were selected from Alexa and Majestic Million Top 1 Million Websites lists, as well as downloaded from secure corporate proxy logs.
- cross-checks via VirusTotal API;
- built-in Reputational Score checking module;
- manual check by WHOIS data (registration date, TLD zone, domain activity).
- validation using OpenAI and QwQ-32B models in batch mode.
- 1.6 million phishing web-resource;
- 1.6 million legitimate web-resource.
| Model | Operation | Batch performance evaluation | Learning mode / inference during integration |
|---|---|---|---|
| HTML tokenization | 25-50 ms | GPU / GPU | |
| Code-BERT ONNX | 200 ms | GPU | |
| Attention extraction | 15-20 ms | GPU | |
| Feature extraction | Less than 8 ms per URL | GPU / CPU | |
| SHAP computation | Less than 12 ms per URL | GPU / CPU | |
| CatBoost C-compiled | Less than 8 ms per URL | GPU / CPU |
4.1.2. Second Stage of Testing–System Throughput Assessment
- corporate mail gateway logs;
- traffic through proxy servers and web filters;
- specially generated requests through a Telegram bot.
4.1.3. Third Stage of Testing–testing as part of the SOC
4.1.4. The Fourth Stage of Testing–Zero-Day Resource Testing
- Google Safe Browsing;
- OpenPhish Detection API;
- URLhaus API.
4.1.5. Testing the Subsystem for Explaining Positive Responses
4.1.6. Summary Results of Computational Experiment I
4.2. Computational Experiment II on the Prepared Dataset for Models and
4.3. Computational Experiment III on the Prepared Data Set for all Models
4.4. Discussion
- Enhanced Detection Accuracy: It improves the detection accuracy of phishing attacks that are not identifiable by a single modality. This is because attackers face a greater challenge in bypassing all analysis modalities simultaneously.
- Reduced False Positives: The use of a committee of models reduces the number of false positives by leveraging a committee of models.
- Zero-Day Attack Detection: The system is capable of detecting novel (zero-day) attacks by analyzing a comprehensive combination of features across different modalities.
- Improved Interpretability: The XAI (explainable artificial intelligence) subsystem provides insights into the contribution of each modality, thereby assisting analysts in understanding the rationale behind a detection decision.
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A. Excerpt From a Technical Report on the Results of the Classification of a Synthetic Web Resource, which was Generated Utilizing a Localized Large Language Model
| ## TECHNICAL APPENDIX ### Complete Model Predictions CatBoost (URL features): 98.7% phishing Character-CNN (URL chars): 96.2% phishing CodeBERT (HTML structure): 95.8% phishing EfficientNetB7 (Visual): 97.8% phishing Consensus Average: 97.1% phishing Standard Deviation: 1.3% Agreement Level: VERY HIGH ### Complete SHAP Values (CatBoost) domain_age: -0.45 (3 days old → suspicious) tld_type: -0.38 (.tk → free, abuse-prone) typosquatting_score: -0.52 (paypa1 vs paypal → deliberate) url_length: -0.12 (slightly long) subdomain_count: -0.08 (1 subdomain) has_https: +0.15 (has HTTPS, but...) ... [additional features] ### Character-CNN Attribution Top 10 Position 8-13 "paypa1": IG attribution 0.92 Position 13 "1": IG attribution 0.95 (critical!) Position 22-24 ".tk": IG attribution 0.88 Position 15-21 "secure": IG attribution 0.45 ... [full attribution map available] ### CodeBERT Attention Top 10 Tokens <input name="creditcard">: 0.95 weight (CRITICAL) <input type="password">: 0.89 weight <form action="http://...">: 0.82 weight (insecure) "urgent": 0.78 weight "suspended": 0.78 weight "verify": 0.68 weight "account": 0.65 weight ... [full token list available] “` Visual GradCAM Regions (Top 5) Region 1: Logo area - Position: top-left (header) - Activation: 0.95/1.0 - Area: 2.1% of page - Text: "PAYPA1 Security Center" - Elements: visual_element, square_element |
| Region 2: Warning banner - Position: center (main_content) - Activation: 1.0/1.0 (maximum!) - Area: 8.5% of page - Text: "URGENT: Account suspended in 24 hours" - Elements: visual_element Region 3: Form area - Position: center (form_area) - Activation: 0.88/1.0 - Area: 3.2% of page - Text: "Credit Card Number", "CVV" - Elements: input_field (multiple) Region 4: URL bar representation - Position: top-center (header) - Activation: 0.85/1.0 - Text: "https://paypa1-secure.tk" Region 5: Footer - Position: bottom (footer) - Activation: 0.45/1.0 - Text: "Copyright 2024 PayPa1" [with typo] - Elements: text, poor_quality |
| REPORT METADATA Report Generated: 2025-10-23 18:30:51 UTC Analysis Duration: 3.7 seconds Models Used: 4 (CatBoost, CharCNN, CodeBERT, EfficientNetB7) XAI Methods: 6 (SHAP, GradCAM 1D, IntGrad, Attention, GradCAM 2D, Text Convert) LLM Model: Qwen2.5-32B-Instruct Confidence: VERY HIGH Analyst Review: RECOMMENDED (for quality assurance) |
| END OF REPORT This report was generated by an AI-powered phishing detection system combining multiple deep learning models with explainable AI techniques. While highly accurate, human expert review is recommended for critical cases. |
References
- Golushko, A. Current Cyber Threats: Q4 2024–Q1 2025. Technical report, Positive Technologies.
- Phishing Activity Trends Reports. Technical report, APWG Report.
- Phishing Detection in Depth: Attack Types, Detection Tools, and More. Technical report, zvelo.
- Dutta, A.K. Detecting phishing websites using machine learning technique. PLOS ONE 2021, 16, e0258361. [Google Scholar] [CrossRef]
- Basit, A.; Zafar, M.; Liu, X.; Javed, A.R.; Jalil, Z.; Kifayat, K. A comprehensive survey of AI-enabled phishing attacks detection techniques. Telecommunication Systems 2021, 76, 139–154. [Google Scholar] [CrossRef] [PubMed]
- Lukmanova, K.; Kartak, V. The development of a phishing attack protection system using software-hardware implementation of machine learning methods. Modeling, optimization and information technology 2024, 12. [Google Scholar] [CrossRef]
- Kavya, S.; Sumathi, D. Staying ahead of phishers: a review of recent advances and emerging methodologies in phishing detection. Artificial Intelligence Review 2024, 58, 50. [Google Scholar] [CrossRef]
- Kornyukhina, S.; Laponina, O. Exploring the Potential of Deep Learning Algorithms to Protect Against Phishing Attacks 2023. 11, 163–174.
- Li, W.; Manickam, S.; Chong, Y.W.; Leng, W.; Nanda, P. A State-of-the-Art Review on Phishing Website Detection Techniques. IEEE Access 2024, 12, 187976–188012. [Google Scholar] [CrossRef]
- Krotov, E. Applying Deep Learning Methods to Phishing Website Detection: Performance Analysis and Model Optimization. Current research, 256.
- Alsaedi, M.; Ghaleb, F.; Saeed, F.; Ahmad, J.; Alasli, M. Cyber Threat Intelligence-Based Malicious URL Detection Model Using Ensemble Learning. Sensors 2022, 22, 3373. [Google Scholar] [CrossRef]
- Aljofey, A.; Jiang, Q.; Rasool, A.; Chen, H.; Liu, W.; Qu, Q.; Wang, Y. An effective detection approach for phishing websites using URL and HTML features. Scientific Reports 2022, 12, 8842. [Google Scholar] [CrossRef]
- Wangchuk, T.; Gonsalves, T. Multimodal Phishing Detection on Social Networking Sites: A Systematic Review. IEEE Access 2025, 13, 103405–103416. [Google Scholar] [CrossRef]
- Baltrušaitis, T.; Ahuja, C.; Morency, L.P. Multimodal Machine Learning: A Survey and Taxonomy, 2017. Version Number: 2. [CrossRef]
- J, G. The Role of Explainable AI in Understanding Phishing Susceptibility. JOURNAL OF RECENT TRENDS IN COMPUTER SCIENCE AND ENGINEERING ( JRTCSE) 2024, 12, 1–6. [Google Scholar]
- Calzarossa, M.C. ; Department Of Economics&Management, P.G.S.; Zieni, R. Explaining Explainable Ai, with Applications to Phishing Detection, 2024. [CrossRef]
- Shyni, E. Enhancing Phishing Detection with Explainable AI (XAI): A Transparent Cybersecurity Approach. Technical report, Smart Security Tips.
- Vrbančič, G.; Fister, I.; Podgorelec, V. Datasets for phishing websites detection. Data in Brief 2020, 33, 106438. [Google Scholar] [CrossRef] [PubMed]
- Al-diabat, M. Detection and Prediction of Phishing Websites using Classification Mining Techniques. International Journal of Computer Applications 2016, 147, 5–11. [Google Scholar] [CrossRef]
- Almomani, A.; Alauthman, M.; Shatnawi, M.T.; Alweshah, M.; Alrosan, A.; Alomoush, W.; Gupta, B.B.; Gupta, B.B.; Gupta, B.B. Phishing Website Detection With Semantic Features Based on Machine Learning Classifiers: A Comparative Study. International Journal on Semantic Web and Information Systems 2022, 18, 1–24. [Google Scholar] [CrossRef]
- Alazaidah, R.; Al-Shaikh, A.; Almousa, M.; Khafajeh, H.; Samara, G.; Alzyoud, M.; Al-shanableh, N.; Almatarneh, S. Website Phishing Detection Using Machine Learning Techniques. Journal of Statistics Applications & Probability 2024, 13, 119–129. [Google Scholar] [CrossRef]
- Mahajan, R.; Siddavatam, I. Phishing Website Detection using Machine Learning Algorithms. International Journal of Computer Applications 2018, 181, 45–47. [Google Scholar] [CrossRef]
- Khera, M.; Prasad, T.; Xess, L. Malicious Website Detection using Machine Learning. International Journal of Engineering Research and Technology (IJERT) 2022, 11. [Google Scholar]
- Lee, J.; Lim, P.; Hooi, B.; Divakaran, D.M. Multimodal Large Language Models for Phishing Webpage Detection and Identification, 2024. Version Number: 1. [CrossRef]
- University of the Cumberlands, Kentucky, United States of America.; Syed, S.A. AI-Driven Detection of Phishing Attacks through Multimodal Analysis of Content and Design. International Journal of Innovative Research in Computer and Communication Engineering 2024, 12. [CrossRef]
- Trad, F.; Chehab, A. Large Multimodal Agents for Accurate Phishing Detection with Enhanced Token Optimization and Cost Reduction. In Proceedings of the 2024 2nd International Conference on Foundation and Large Language Models (FLLM), Dubai, United Arab Emirates, 2024; pp. 229–237. [CrossRef]
- Cao, T.; Huang, C.; Li, Y.; Wang, H.; He, A.; Oo, N.; Hooi, B. PhishAgent: A Robust Multimodal Agent for Phishing Webpage Detection, 2024. Version Number: 3. [CrossRef]
- Li, Y.; Huang, C.; Deng, S.; Lock, M.L.; Cao, T.; Oo, N.; Lim, H.W.; Hooi, B. KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection 2024. Publisher: arXiv Version Number: 2. 2. [CrossRef]
- Alshingiti, Z.; Alaqel, R.; Al-Muhtadi, J.; Haq, Q.E.U.; Saleem, K.; Faheem, M.H. A Deep Learning-Based Phishing Detection System Using CNN, LSTM, and LSTM-CNN. Electronics 2023, 12, 232. [Google Scholar] [CrossRef]
- Abdelnabi, S.; Krombholz, K.; Fritz, M. VisualPhishNet: Zero-Day Phishing Website Detection by Visual Similarity, 2019. Version Number: 4. [CrossRef]
- PhiUSIIL Phishing URL Dataset.
- Chiew, K.L.; Tan, C.L.; Wong, K.; Yong, K.S.; Tiong, W.K. A new hybrid ensemble feature selection framework for machine learning-based phishing detection system. Information Sciences 2019, 484, 153–166. [Google Scholar] [CrossRef]
- Shafin, S.S. An explainable feature selection framework for web phishing detection with machine learning. Data Science and Management 2025, 8, 127–136. [Google Scholar] [CrossRef]
- Yoon, J.H.; Buu, S.J.; Kim, H.J. Phishing Webpage Detection via Multi-Modal Integration of HTML DOM Graphs and URL Features Based on Graph Convolutional and Transformer Networks. Electronics 2024, 13, 3344. [Google Scholar] [CrossRef]
- Asiri, S.; Xiao, Y.; Li, T. PhishTransformer: A Novel Approach to Detect Phishing Attacks Using URL Collection and Transformer. Electronics 2023, 13, 30. [Google Scholar] [CrossRef]
- (PDF) Phishpedia: A Hybrid Deep Learning Based Approach to Visually Identify Phishing Webpages.
- Abdelnabi, S.; Krombholz, K.; Fritz, M. VisualPhishNet: Zero-Day Phishing Website Detection by Visual Similarity. In Proceedings of the ACM Conference on Computer and Communications Security (CCS). ACM, 2020.
- Zhang, L.; Zhang, P.; Liu, L.; Tan, J. Multiphish: Multi-Modal Features Fusion Networks for Phishing Detection. In Proceedings of the ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada; 2021; pp. 3520–3524. [Google Scholar] [CrossRef]
- Salih, A.M.; Raisi-Estabragh, Z.; Galazzo, I.B.; Radeva, P.; Petersen, S.E.; Lekadir, K.; Menegaz, G. A Perspective on Explainable Artificial Intelligence Methods: SHAP and LIME. Advanced Intelligent Systems 2025, 7, 2400304. [Google Scholar] [CrossRef]
- Lim, B.; Huerta, R.; Sotelo, A.; Quintela, A.; Kumar, P. EXPLICATE: Enhancing Phishing Detection through Explainable AI and LLM-Powered Interpretability, 2025. arXiv:2503.20796 [cs]. [CrossRef]
- Shendkar, B.D.; Chandre, P.R.; Madachane, S.S.; Kulkarni, N.; Deshmukh, S. Enhancing Phishing Attack Detection Using Explainable AI: Trends and Innovations. ASEAN Journal on Science and Technology for Development 2024, 42, 8. [Google Scholar] [CrossRef]
- Galego Hernandes, P.R.; Floret, C.P.; Cardozo De Almeida, K.F.; Da Silva, V.C.; Papa, J.P.; Pontara Da Costa, K.A. Phishing Detection Using URL-based XAI Techniques. In Proceedings of the 2021 IEEE Symposium Series on Computational Intelligence (SSCI), Orlando, FL, USA; 2021; pp. 01–06. [Google Scholar] [CrossRef]
- Akhtar, H.M.U.; Nauman, M.; Akhtar, N.; Hameed, M.; Hameed, S.; Tareen, M.Z. Mitigating Cyber Threats: Machine Learning and Explainable AI for Phishing Detection. VFAST Transactions on Software Engineering 2025, 13, 170–195. [Google Scholar] [CrossRef]
- Rashid, F.; Ranaweera, N.; Doyle, B.; Seneviratne, S. LLMs are One-Shot URL Classifiers and Explainers, 2024. Version Number: 1. [CrossRef]
- Feng, Z.; Guo, D.; Tang, D.; Duan, N.; Feng, X.; Gong, M.; Shou, L.; Qin, B.; Liu, T.; Jiang, D.; et al. CodeBERT: A Pre-Trained Model for Programming and Natural Languages. In Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, Online, 2020; pp. 1536–1547. [CrossRef]
- Moreau, L. AI Explainability with Grad-CAM: Visualizing Neural Network Decisions, 2025.
- Shrikumar, A.; Greenside, P.; Kundaje, A. Learning Important Features Through Propagating Activation Differences 2017. Publisher: arXiv Version Number: 2. [CrossRef]
- Nori, H.; Jenkins, S.; Koch, P.; Caruana, R. InterpretML: A Unified Framework for Machine Learning Interpretability, 2019. Version Number: 1. [CrossRef]
- Colhak, F.; Ecevit, M.I.; Dag, H. Transfer Learning for Phishing Detection: Screenshot-Based Website Classification. In Proceedings of the 2024 9th International Conference on Computer Science and Engineering (UBMK), Antalya, Turkiye; 2024; pp. 1–6. [Google Scholar] [CrossRef]
- Combined BERT Phishing Dataset.
- Alexa Top 1 Million Sites.
- Roy, S.S.; Nilizadeh, S. PhishLang: A Real-Time, Fully Client-Side Phishing Detection Framework Using MobileBERT, 2024. Version Number: 3. 3. [CrossRef]
- Yuan, Y.; Hao, Q.; Apruzzese, G.; Conti, M.; Wang, G. "Are Adversarial Phishing Webpages a Threat in Reality?" Understanding the Users’ Perception of Adversarial Webpages. In Proceedings of the Proceedings of the ACM Web Conference 2024, Singapore Singapore, 2024; pp. 1712–1723. [CrossRef]
- Prokhorenkova, L.; Gusev, G.; Vorobev, A.; Dorogush, A.V.; Gulin, A. CatBoost: unbiased boosting with categorical features. In Proceedings of the Advances in Neural Information Processing Systems. Curran Associates, Inc., 2018, Vol. 31.
- Tan, M.; Le, Q.V. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks 2019. Publisher: arXiv Version Number: 5. [CrossRef]
- Tekkali, C.G.; Natarajan, K. Transfer learning of pre-trained CNNs on digital transaction fraud detection. International Journal of Knowledge-based and Intelligent Engineering Systems 2024, 28, 571–580. [Google Scholar] [CrossRef]
- Cik, I.; Rasamoelina, A.D.; Mach, M.; Sincak, P. Explaining Deep Neural Network using Layer-wise Relevance Propagation and Integrated Gradients. In Proceedings of the 2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI), Herl’any, Slovakia, 2021; pp. 000381–000386. [CrossRef]
- Ali, A.; Schnake, T.; Eberle, O.; Montavon, G.; Müller, K.R.; Wolf, L. XAI for Transformers: Better Explanations through Conservative Propagation, 2022. Version Number: 2. [CrossRef]
- Yang, A.; Yang, B.; Hui, B.; Zheng, B.; Yu, B.; Zhou, C.; Li, C.; Li, C.; Liu, D.; Huang, F.; et al. Qwen2 Technical Report, 2024. arXiv:2407.10671. [CrossRef]
- Colhak, F.; Ecevit, M.I.; Ucar, B.E.; Creutzburg, R.; Dag, H. Phishing Website Detection through Multi-Model Analysis of HTML Content, 2024. Version Number: 3. [CrossRef]











| Modality type | features | Extraction methods |
|---|---|---|
| Text | URL, email body, HTML content, domain/subdomain |
NLP, tokenization, embeddings (BERT, Word2Vec) |
| Visual | Page screenshots, logos, favicons, color schemes, layout |
CNN, Transfer Learning (ResNet, VGG), Object Detection |
| Structural | HTML DOM, CSS, JavaScript, metadata, certificates |
DOM Parsing, Graph Analysis, Feature Extraction |
| Network | DNS queries, IP addresses, WHOIS, SSL certificates |
Network analysis, OSINT, Reputation services |
| Combination of modalities | Number of systems | Average accuracy | Typical architectures of ML models |
|---|---|---|---|
| URL + HTML | 18 | 96,5-99,2% | CNN + LSTM, BERT [34,35] |
| URL + Visual | 12 | 93,4-97,8% | CNN + ResNet/VGG [36,37] |
| URL + HTML + Visual | 8 | 98,0-99,4% | Transformers, LLM [27,28] |
| All 5 types | 3 | 98,4-99,6% | Agent-based LLM systems [27] |
| Fusion strategies | Advantages | Disadvantages | Examples of systems |
|---|---|---|---|
| Early Fusion | – accounting for low-level correlations; –simple architecture; |
- loss of modality specificity; - train difficulty |
MultiPhish [38] |
| Intermediate Fusion | - balance of specificity and interaction; - mechanisms of attention |
–average learning difficulty | PhishTransformer [35] |
| Late Fusion | – model specialization; modularity; –parallelization |
–loss of intermodule connections | Yoon et al. [33] |
| Hybrid Fusion | - maximum performance; adaptability | – high model complexity; –risk of overfitting |
KnowPhish [28] |
| Method | Type of explanation | Applicability | Clarity | Accuracy | Application in phishing |
|---|---|---|---|---|---|
| SHAP | Global + Local | Model-agnostic | Average | High | XGBoost, RF–URL features |
| LIME | Local | Model-agnostic | Average | Average | Any models - URL, HTML |
| Grad-CAM | Local visual | CNN-specific | High | Average | ResNet, VGG–screenshots |
| DeepLIFT | Local | DNN-specific | Low | High | CNN, LSTM–all modalities |
| Explainable Boosting Machine (EBM) | Intrinsic | Special model | High | High | Email phishing |
| Attention Mechanisms | Intrinsic | Transformers | Average | Average | BERT, Transformers–text |
| LLM text | Post-processing | Any | Very high | High | All modalities with context |
| Level | Methods | Difficulty of implementation | Cost of inference | User-friendliness | Application in industrial systems |
|---|---|---|---|---|---|
| Post-hoc | LIME, SHAP, Grad-CAM, DeepLIFT | Average | Low | Technical | Wide |
| Intrinsic | Decision trees, EBM, logistic regression | Low | Very low | High | Moderate |
| Attention | Transformers, BERT family | High | Average | Average | Growing |
| LLM text | On-premises and cloud LLM | Very high | High | Very high | Experimental |
| Name | Year | Data type | Volume | Balance of classes (phishing/legitimate) | Sources | Peculiarities |
|---|---|---|---|---|---|---|
| PhiUSIIL | 2024 | URL + features | 235795 URLs | 57% / 43% | UCI ML Repository [31] | 48 features, Focus on the Latest Threats |
| TR-OP | 2024 | Web pages | 10000 | 50% / 50% | USENIX Security | Manual validation, 440 brands [27] |
| Combined BERT | 2023 | Multimodal | 800K+ URLs, 18K emails, 6K SMS |
52% / 47% | HuggingFace | Cross-channel data [50] |
| Phishtank | Updates daily | URL | 1,5M+ | Only phishing | phishtank.org | Community-driven, API access [31] |
| APWG | Quarterly | URL + reports | Varies | Only phishing | apwg.org | Industrial standard [2] |
| MTLP | 2024 | Screenshots | 15000 | 50% / 50% | IEEE | Multi-type logo phishing [49] |
| Alexa Top 1M | Archival | URL | 1000000 | 100% legitimate | archive.org | Baseline of legitimate sites [51] |
| No. | Multimodal component | Data type | Extracted features (briefly) |
|---|---|---|---|
| 1 | URL | URL components and domain features | – URL structure and statistical features (length, special characters, presence of IP instead of domain); –Lexical features; –Syntactic features; –Phonetic features; –Semantic features; –SSL/TLS certificates; |
| 2 | HTML | Source code of the main page of the target resource | – HTML components; –DOM structure; –JavaScript code; –CSS; |
| 3 | Image | Image of the main page of the analyzed resource | –RGB image 600 x 600 pixels; |
| 4 | Metadata | – WHOIS information (domain registration) – domain analysis (age, registrar); –reputation databases (VirusTotal, Google Safe Browsing) –Threat Intelligence feeds (PhishTank, URLhaus, OpenPhish) |
| No. | Multimodal component | Model designation |
Model type |
|---|---|---|---|
| 1 | URL + Metadata | Using the Optuna framework, hyperparameters of the CatBoost model [54] are selected in k-cross-validation mode with k=5. |
|
| CNN1D for character-level URL analysis. |
|||
| 2 | HTML | Fine-tuning of the CodeBERT model [45]. |
|
| 3 | Image | Fine-tuning of the EfficientNet-B7 model [55,56]. |
|
| 4 | Late binding | Late binding of trained models using weighted voting. |
| No | Multimodal component | Model designation | Model | Formed features of XAI |
|---|---|---|---|---|
| 1 | URL + Metadata | CatBoost | SHAP Values | |
| CNN1D | GradCam, Integrated Gradients [57] | |||
| 2 | HTML | CodeBERT | Attention matrix [58] | |
| 3 | Image | EfficientNet-B7 | GradCam |
| Metrics | CatBoost | RandomForest | XGBoost |
|---|---|---|---|
| ROC-AUC | 0.924 | 0.915 | 0.918 |
| Accuracy | 0.910 | 0.890 | 0.900 |
| Recall | 0.880 | 0.850 | 0.860 |
| F1-score | 0.900 | 0.870 | 0.880 |
| Layers and architecture parameters | Description |
|---|---|
| Input layer– embedding | EMBED_DIM = 64 – converting characters to vector representation |
| CNN layers | three parallel convolutional layers with kernels of sizes KERNEL_SIZES = [3, 4, 5], number of filters NUM_KERNELS = 64 |
| MaxPooling | Two layers |
| Output layer | fully connected layer with sigmoid activation function |
| Loss function | Binary Crossentropy |
| Metrics | Accuracy, Precision, Recall, F1-score |
| Optimizer | Adam (LR = 0,001) |
| Data separation | The validation sample is 20% of the available data. |
| Regularization | Dropout (DROPOUT = 0.5) and early stopping (ReduceLROnPlateau). |
| Synthetic link | http://paypa1-security.tk/login | ||
|---|---|---|---|
| features | Features values | SHAP | Interpretation |
| domain_age_days | 3 | -0,450 | A critical feature of a phishing resource |
| is_suspicious_tld | 1 | -0,380 | An essential feature of a phishing resource |
| typosquatting_score | 0,75 | -0,315 | A critical feature of a phishing resource |
| Model | Synthetic example of a phishing URL | Description |
|---|---|---|
| Grad-CAM | http://paypa1-security.tk/login | The model focuses on the domain when deciding whether a resource belongs to a phishing resource. |
| Integrated Gradients |
http://paypa1-security.tk/login +0,9 +0,7 +0,95 +0,6 |
Details of URL symbols and their importance coefficients n terms of their contribution to the resulting solution |
| Parameter | Characteristic |
|---|---|
| GPU | 4 GPU Tesla V100 |
| GPU video memory capacity | 128 GB |
| CPU | Intel Xeon E5-2698 v4 2.2 GHz (20-core) |
| RAM capacity | 256GB RDIMM DDR4 |
| Experiment | Models (designation) | Models (name) | Dataset | F1 score |
|---|---|---|---|---|
| I | M1 | CatBoost | D1 (own dataset) | 0.900 |
| M3 | CodeBERT | 0.950 | ||
| M1 + M3 | CatBoost + CodeBERT + Voting Classifier | 0.972 |
| Model | Operation | Batch performance evaluation | Learning mode / inference during integration |
|---|---|---|---|
| Tokenization | 3 ms | GPU / GPU | |
| CNN1D forward pass | 7 ms | GPU | |
| GradCAM + Integrated Gradients | 55 ms | GPU | |
| Screenshot capture | 250 ms | GPU / CPU | |
| Image preprocessing | 15 ms | GPU / CPU | |
| EfficientNetB7 inference | 200 ms | GPU | |
| GradCAM computation | 150 ms | GPU | |
| Region analysis | 50 ms | GPU / CPU |
| Experiment | Models (designation) | Models (name) | Dataset | F1 score |
|---|---|---|---|---|
| II | CNN1D | D2 (MTLP) | 0.914 | |
| EfficientNet-B7 | 0.932 | |||
| + | CNN1D + EfficientNet-B7 + Voting Classifier |
0.944 |
| Experiment | Models (designation) | Models (name) | Dataset | F1 score |
|---|---|---|---|---|
| III | CatBoost + CodeBERT + CNN1D + EfficientNet-B7 + Voting Classifier |
(own dataset) | 0.989 | |
| CatBoost + CodeBERT + CNN1D + EfficientNet-B7 + Voting Classifier |
(MTLP) | 0.953 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).