Submitted:
26 May 2026
Posted:
27 May 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Background and Related Work
3. Data and Methodology
3.1. Data
3.1.1. Data Collection
3.1.2. Data Split and Preparation
3.2. Methodology
3.2.1. Lung Segmentation
3.2.2. Model Training
3.2.3. Adversarial Attack Generation and Configuration
- White-box attacks: In this setting, the attacker has complete access to the model’s architecture, weights, and gradients [18]. This allows direct computation of perturbations that maximize prediction errors by exploiting internal model parameters.
- Black-box attacks: In contrast, the attacker has no knowledge of the target model’s internal structure or parameters [18]. To simulate this scenario, a separate CNN architecture, i.e., DenseNet121 [30], was employed to generate adversarial samples. These samples were then transferred to the target model to evaluate its susceptibility to cross-model perturbations.
- Fast Gradient Sign Method (FGSM) [37], a single-step gradient-based attack that introduces small pixel-level perturbations along the gradient’s sign direction.
- Projected Gradient Descent (PGD) [38], a multi-step iterative extension of FGSM that refines perturbations while projecting them within a defined epsilon boundary.
- Basic Iterative Method (BIM) [39], an iterative variant of FGSM that applies repeated small perturbations, enabling more precise adversarial manipulation.
- Momentum Iterative Method (MIM) [39], an improved iterative technique that incorporates momentum in gradient updates, enhancing attack stability and effectiveness.
3.2.4. Defence Mechanisms for Adversarial Robustness
- Adversarial Training (AT): AT was adopted as a foundational defence mechanism and as a benchmark for evaluating alternative robustness strategies. Using the Adversarial Robustness Toolbox library [40], the model was iteratively trained with a mixture of clean and adversarially perturbed samples. Specifically, adversarial examples constituted 30% of the training data to balance robustness improvements with feature-space stability and prevent excessive drift from clinically relevant image representations. This process ensures the model is continuously exposed to new adversarial examples during training, allowing it to learn more invariant and resilient decision boundaries. By integrating adversarial examples directly into the optimization process, the model becomes better equipped to recognize and mitigate subtle perturbations that could otherwise lead to misclassification. As a result, AT serves as a strong baseline defence [18], improving overall robustness while maintaining diagnostic accuracy across diverse CXR inputs. In this study, two variants of AT are implemented. The first one, employs PGD as the method to generate the adversarial images for training (AT-PGD) [41]. The second variant, Tradeoff-inspired Adversarial Defense via Surrogate-loss minimization (TRADES) [42], introduces a theoretically grounded trade-off between natural accuracy and robustness. TRADES decomposes the objective into a natural classification loss on clean samples and a robustness regularization term that penalizes the divergence between predictions on clean and adversarial inputs, thus it allows more stable optimization and mitigates excessive degradation of clean performance compared to the basic PGD-based AT [42].
- Multivariate Gaussian Model (MGM): To complement AT, an MGM was employed following the methodology outlined in Li et al. [17]. MGM operates as a post hoc detection mechanism that models the distribution of high-level features extracted from the final layer of the CNN. During training, these features are fitted to a Gaussian distribution characterized by a mean vector and covariance matrix. At inference time, MGM computes the log-likelihood or Mahalanobis distance of each input’s feature vector relative to the learned distribution. Samples that deviate significantly from the distribution of clean images are flagged as potential adversarial inputs. By thresholding the log-likelihood score, the model can selectively ignore or reject inputs suspected of being adversarial, thereby reducing the risk of incorrect predictions.
4. Results
4.1. Effect of Region-of-Interest Isolation
4.2. Pre-Attack Performace Analysis
4.3. Comparative Analysis of Model Robustness to Adversarial Perturbations
4.4. Comparative Evaluation of Defence Mechanisms
5. Discussion
5.1. Impact of Lung Segmentation on Model Performance
5.2. Vulnerability of Models to Adversarial Perturbations
5.3. Comparative Evaluation of Defence Strategies
5.4. Mechanistic Insights into MGM’s Stability
5.5. Implications for Medical AI Deployment
6. Conclusions and Limitaions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Wong, A.; Lee, J.R.H.; Rahmat-Khah, H.; Sabri, A.; Alaref, A.; Liu, H. TB-Net: a tailored, self-attention deep convolutional neural network design for detection of tuberculosis cases from chest X-ray images. Frontiers in Artificial Intelligence 2022, 5. [CrossRef]
- Patel, N.; Wong, A.; Ebadi, A. Empowering Tuberculosis Screening with Explainable Self-Supervised Deep Neural Networks. In Proceedings of the 2024 IEEE International Conference on Machine Learning and Applications (ICMLA). IEEE, 2024, pp. 794–797.
- Patel, N.; Wong, A.; Ebadi, A. An Explainable Hybrid AI Framework for Enhanced Tuberculosis and Symptom Detection. arXiv 2025. arXiv:2510.18819. [CrossRef]
- Organization, W.H. Global Tuberculosis Report 2023. World Health Organization, ISBN 978-92-4-008385-1 2023.
- Nations, U. Sustainable Development Goals. New York: United Nations, Available at https://sdgs.un.org/ 2022.
- Emery, J.C.; Richards, A.S.; Dale, K.D.; McQuaid, C.F.; White, R.G.; Denholm, J.T.; Houben, R.M. Self-clearance of Mycobacterium tuberculosis infection: implications for lifetime risk and population at-risk of tuberculosis disease. Proceedings of the Royal Society B, vol. 288, pp. 20201635 2021. [CrossRef]
- Tiemersma, E.W.; van der Werf, M.J.; Borgdorff, M.W.; Williams, B.G.; Nagelkerke, N.J. Natural history of tuberculosis: duration and fatality of untreated pulmonary tuberculosis in HIV negative patients: a systematic review. PloS one, vol. 6, pp. e17601 2011. [CrossRef] [PubMed]
- Balakrishna, S.; Solanki, V.K. A comprehensive review on ai-driven healthcare transformation. Ingeniería Solidaria 2024, 20. [CrossRef]
- Esteva, A.; Chou, K.; Yeung, S.; Naik, N.; Madani, A.; Mottaghi, A.; Liu, Y.; Topol, E.; Dean, J.; Socher, R. Deep learning-enabled medical computer vision. NPJ digital medicine 2021, 4. [CrossRef] [PubMed]
- Tavanapong, W.; Oh, J.; Riegler, M.A.; Khaleel, M.; Mittal, B.; De Groen, P.C. Artificial intelligence for colonoscopy: past, present, and future. IEEE journal of biomedical and health informatics 2022, 26. [CrossRef]
- Yadav, S.S.; Jadhav, S.M. Deep convolutional neural network based medical image classification for disease diagnosis. Journal of Big data 2019, 6. [CrossRef]
- Shamshad, F.; Khan, S.; Zamir, S.W.; Khan, M.H.; Hayat, M.; Khan, F.S.; Fu, H. Transformers in medical imaging: A survey. Medical Image Analysis 2023, 88, 102802. [CrossRef]
- Kumar, R.R.; Priyadarshi, R. Denoising and segmentation in medical image analysis: A comprehensive review on machine learning and deep learning approaches. Multimedia Tools and Applications 2025, 84, 10817–10875. [CrossRef]
- Yao, W.; Bai, J.; Liao, W.; Chen, Y.; Liu, M.; Xie, Y. From cnn to transformer: A review of medical image segmentation models. Journal of Imaging Informatics in Medicine 2024. [CrossRef]
- Finlayson, S.G.; Bowers, J.D.; Ito, J.; Zittrain, J.L.; Beam, A.L.; Kohane, I.S. Adversarial attacks on medical machine learning. Science 2019, 363. [CrossRef]
- Dong, J.; Chen, J.; Xie, X.; Lai, J.; Chen, H. Adversarial attack and defense for medical image analysis: Methods and applications. arXiv 2023. arXiv:2303.14133.
- Li, X.; Zhu, D. Robust detection of adversarial attacks on medical images. In Proceedings of the 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI). IEEE, 2020.
- Bai, T.; Luo, J.; Zhao, J.; Wen, B.; Wang, Q. Recent advances in adversarial training for adversarial robustness. arXiv 2021. arXiv:2102.01356. [CrossRef]
- Dong, J.; Chen, J.; Xie, X.; Lai, J.; Chen, H. Survey on adversarial attack and defense for medical image analysis: Methods and challenges. ACM Computing Surveys 2024, 57, 1–38. [CrossRef]
- Nie, W.; Guo, B.; Huang, Y.; Xiao, C.; Vahdat, A.; Anandkumar, A. Diffusion Models for Adversarial Purification. In Proceedings of the International Conference on Machine Learning (ICML), 2022.
- Meng, D.; Chen, H. MagNet: A Two-Pronged Defense against Adversarial Examples. In Proceedings of the ACM Conference on Computer and Communications Security (CCS), 2017, pp. 135–147.
- Jaeger, S.; Candemir, S.; Antani, S.; Wáng, Y.X.; Lu, P.X.; Thoma, G. Two public chest X-ray datasets for computer-aided screening of pulmonary diseases. Quantitative Imaging in Medicine and Surgery 2014, 4, 475–477. [CrossRef]
- Krizhevsky, A.; Hinton, G.; et al. Learning multiple layers of features from tiny images 2009.
- Rahman, T.; Rahman, M.M.; Zubair, S.; Islam, K.M.R.; Karim, A. Reliable Tuberculosis Detection Using Chest X-Ray With Deep Learning, Segmentation and Visualization. IEEE Access 2020, 8. [CrossRef]
- Portals, N.T. TB Portals. https://tbportals.niaid.nih.gov. Accessed: Aug. 23, 2025.
- Liu, Y.; Wu, Y.H.; Zhang, S.C.; Liu, L.; Wu, M.; Cheng, M.M. Revisiting computer-aided tuberculosis diagnosis. IEEE transactions on pattern analysis and machine intelligence 2023, 46, 2316–2332. [CrossRef] [PubMed]
- Wang, L.; Lin, Z.Q.; Wong, A. COVID-Net: a tailored deep convolutional neural network design for detection of COVID-19 cases from chest X-ray images. Scientific Reports 2020, 10, 19549. [CrossRef]
- Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation, 2015. [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the Proceedings of the IEEE conference on computer vision and pattern recognition, 2016.
- Huang, G.; Liu, Z.; Van Der Maaten, L.; Weinberger, K.Q. Densely connected convolutional networks. In Proceedings of the Proceedings of the IEEE conference on computer vision and pattern recognition, 2017.
- Howard, A.; Sandler, M.; Chu, G.; Chen, L.C.; Chen, B.; Tan, M.; Wang, W.; Zhu, Y.; Pang, R.; Vasudevan, V.; et al. Searching for MobileNetV3. In Proceedings of the Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019, pp. 1314–1324.
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. In Proceedings of the International Conference on Learning Representations (ICLR), 2021.
- Touvron, H.; Cord, M.; Douze, M.; Massa, F.; Sablayrolles, A.; Jégou, H. Training Data-Efficient Image Transformers and Distillation through Attention. In Proceedings of the International Conference on Machine Learning (ICML), 2021, pp. 10347–10357.
- Caron, M.; Touvron, H.; Misra, I.; Jégou, H.; Mairal, J.; Bojanowski, P.; Joulin, A. Emerging Properties in Self-Supervised Vision Transformers, 2021, [arXiv:cs.CV/2104.14294].
- Park, S.; Kim, G.; Oh, Y.; Seo, J.B.; Lee, S.M.; Kim, J.H.; Moon, S.; Lim, J.K.; Park, C.M.; Ye, J.C. Self-evolving vision transformer for chest X-ray diagnosis through knowledge distillation. Nature communications 2022, 13, 3848. [CrossRef] [PubMed]
- Caron, M.; Touvron, H.; Misra, I.; Jégou, H.; Mairal, J.; Bojanowski, P.; Joulin, A. Emerging properties in self-supervised vision transformers. In Proceedings of the Proceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 9650–9660.
- Goodfellow, I.J.; Shlens, J.; Szegedy, C. Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572 2014.
- Mądry, A.; Makelov, A.; Schmidt, L.; Tsipras, D.; Vladu, A. Towards deep learning models resistant to adversarial attacks. stat 2017, 1050.
- Dong, Y.; Liao, F.; Pang, T.; Su, H.; Zhu, J.; Hu, X.; Li, J. Boosting adversarial attacks with momentum. In Proceedings of the Proceedings of the IEEE conference on computer vision and pattern recognition, 2018.
- Nicolae, M.I.; Sinn, M.; Tran, M.N.; Buesser, B.; Rawat, A.; Wistuba, M.; Zantedeschi, V.; Baracaldo, N.; Chen, B.; Ludwig, H.; et al. Adversarial Robustness Toolbox v1.0.0, 2019. [CrossRef]
- Madry, A.; Makelov, A.; Schmidt, L.; Tsipras, D.; Vladu, A. Towards Deep Learning Models Resistant to Adversarial Attacks. In Proceedings of the International Conference on Learning Representations (ICLR), 2018.
- Zhang, H.; Yu, Y.; Jiao, J.; Xing, E.P.; El Ghaoui, L.; Jordan, M.I. Theoretically Principled Trade-off between Robustness and Accuracy. In Proceedings of the International Conference on Machine Learning (ICML), 2019, pp. 7472–7482.
- Han, T.; Nebelung, S.; Pedersoli, F.; Zimmermann, M.; Schulze-Hagen, M.; Ho, M.; Haarburger, C.; Kiessling, F.; Kuhl, C.; Schulz, V.; et al. Advancing diagnostic performance and clinical usability of neural networks via adversarial training and dual batch normalization. Nature communications 2021, 12, 4315. [CrossRef]



| Dataset | Total | Healthy | TB | COVID-19 |
|---|---|---|---|---|
| Shenzhen | 662 | 326 | 336 | 0 |
| Montgomery | 138 | 80 | 58 | 0 |
| TB Radio | 4200 | 3500 | 700 | 0 |
| TB Portals | 1049 | 0 | 1049 | 0 |
| TBX11K | 4600 | 3800 | 800 | 0 |
| COVIDx-CXR Train | 61440 | 0 | 0 | 61440 |
| Total | 76330 | 7706 | 2943 | 61440 |
| Dataset | Total | Healthy | TB | COVID-19 |
|---|---|---|---|---|
| CXR Train Unlabeled | 8332 | 3170 | 1719 | 3443 |
| CXR Train Labeled | 925 | 345 | 207 | 373 |
| CXR Test | 1031 | 391 | 215 | 425 |
| Total | 10288 | 3906 | 2141 | 4241 |
| Normal | TB | COVID-19 | |
|---|---|---|---|
| Clean | ![]() |
![]() |
![]() |
| Segmented | ![]() |
![]() |
![]() |
| Normal | TB | COVID-19 | |
|---|---|---|---|
| Clean | ![]() |
![]() |
![]() |
| White-box attack | ![]() |
![]() |
![]() |
| Black-box attack | ![]() |
![]() |
![]() |
| Model | Accuracy | F1 Score | AUC-ROC |
|---|---|---|---|
| DenseNet121 | 0.976 | 0.976 | 0.998 |
| ViT Base | 0.962 | 0.962 | 0.995 |
| ResNet50 | 0.967 | 0.967 | 0.998 |
| MobileNet | 0.985 | 0.985 | 0.999 |
| DeiT | 0.982 | 0.982 | 0.997 |
| DISTL | 0.988 | 0.992 | 0.998 |
| Model | Accuracy | F1 Score | AUC-ROC |
|---|---|---|---|
| DenseNet121 | 0.938 | 0.938 | 0.993 |
| ViT Base | 0.942 | 0.942 | 0.994 |
| ResNet50 | 0.949 | 0.949 | 0.994 |
| MobileNet | 0.943 | 0.943 | 0.993 |
| DeiT | 0.952 | 0.952 | 0.995 |
| DISTL | 0.936 | 0.936 | 0.989 |
| Model | Attack | White-Box | Black-Box | ||||
|---|---|---|---|---|---|---|---|
| F1 | AUC | ACC | F1 | AUC | ACC | ||
| ResNet50 | FGSM | 0.393 | 0.471 | 0.539 | 0.393 | 0.474 | 0.535 |
| PGD | 0.475 | 0.498 | 0.472 | 0.413 | 0.488 | 0.535 | |
| BIM | 0.479 | 0.494 | 0.472 | 0.458 | 0.512 | 0.535 | |
| MIM | 0.477 | 0.499 | 0.473 | 0.399 | 0.484 | 0.540 | |
| DenseNet121 | FGSM | 0.468 | 0.479 | 0.497 | 0.503 | 0.499 | 0.537 |
| PGD | 0.439 | 0.499 | 0.413 | 0.487 | 0.508 | 0.552 | |
| BIM | 0.419 | 0.490 | 0.396 | 0.437 | 0.508 | 0.414 | |
| MIM | 0.437 | 0.501 | 0.403 | 0.483 | 0.501 | 0.554 | |
| MobileNet | FGSM | 0.403 | 0.470 | 0.383 | 0.412 | 0.472 | 0.392 |
| PGD | 0.385 | 0.452 | 0.367 | 0.403 | 0.455 | 0.381 | |
| BIM | 0.379 | 0.444 | 0.360 | 0.397 | 0.447 | 0.373 | |
| MIM | 0.381 | 0.451 | 0.363 | 0.401 | 0.450 | 0.377 | |
| ViT Base | FGSM | 0.067 | 0.512 | 0.037 | 0.041 | 0.502 | 0.022 |
| PGD | 0.195 | 0.494 | 0.124 | 0.083 | 0.503 | 0.049 | |
| BIM | 0.331 | 0.507 | 0.253 | 0.412 | 0.506 | 0.359 | |
| MIM | 0.207 | 0.496 | 0.131 | 0.069 | 0.507 | 0.040 | |
| DeiT | FGSM | 0.280 | 0.505 | 0.210 | 0.315 | 0.509 | 0.240 |
| PGD | 0.262 | 0.497 | 0.191 | 0.298 | 0.501 | 0.222 | |
| BIM | 0.301 | 0.503 | 0.230 | 0.334 | 0.506 | 0.258 | |
| MIM | 0.271 | 0.498 | 0.200 | 0.306 | 0.503 | 0.232 | |
| DISTL | FGSM | 0.268 | 0.505 | 0.359 | 0.244 | 0.496 | 0.317 |
| PGD | 0.294 | 0.499 | 0.298 | 0.243 | 0.491 | 0.320 | |
| BIM | 0.346 | 0.499 | 0.380 | 0.392 | 0.498 | 0.392 | |
| MIM | 0.311 | 0.504 | 0.305 | 0.253 | 0.492 | 0.347 | |
| Model | Attack | White-Box | Black-Box | ||||
|---|---|---|---|---|---|---|---|
| F1 | AUC | ACC | F1 | AUC | ACC | ||
| ResNet50 | BIM | 0.697 | 0.499 | 0.535 | 0.738 | 0.464 | 0.585 |
| FGSM | 0.010 | 0.485 | 0.005 | 0.030 | 0.457 | 0.009 | |
| MIM | 0.379 | 0.480 | 0.234 | 0.034 | 0.478 | 0.017 | |
| PGD | 0.346 | 0.503 | 0.210 | 0.002 | 0.529 | 0.001 | |
| DenseNet121 | BIM | 0.191 | 0.512 | 0.106 | 0.527 | 0.531 | 0.358 |
| FGSM | 0.085 | 0.514 | 0.045 | 0.137 | 0.520 | 0.074 | |
| MIM | 0.142 | 0.476 | 0.077 | 0.112 | 0.491 | 0.059 | |
| PGD | 0.190 | 0.532 | 0.105 | 0.119 | 0.503 | 0.063 | |
| MobileNet | BIM | 0.451 | 0.465 | 0.451 | 0.451 | 0.476 | 0.451 |
| FGSM | 0.466 | 0.522 | 0.466 | 0.466 | 0.511 | 0.466 | |
| MIM | 0.453 | 0.472 | 0.453 | 0.453 | 0.488 | 0.453 | |
| PGD | 0.454 | 0.481 | 0.454 | 0.454 | 0.474 | 0.454 | |
| ViT Base | BIM | 0.515 | 0.463 | 0.347 | 0.631 | 0.471 | 0.461 |
| FGSM | 0.513 | 0.535 | 0.345 | 0.598 | 0.494 | 0.427 | |
| MIM | 0.469 | 0.478 | 0.306 | 0.571 | 0.486 | 0.400 | |
| PGD | 0.460 | 0.491 | 0.299 | 0.582 | 0.477 | 0.410 | |
| DeiT | BIM | 0.462 | 0.472 | 0.462 | 0.462 | 0.475 | 0.462 |
| FGSM | 0.471 | 0.551 | 0.471 | 0.471 | 0.525 | 0.471 | |
| MIM | 0.465 | 0.481 | 0.465 | 0.465 | 0.487 | 0.465 | |
| PGD | 0.466 | 0.493 | 0.466 | 0.466 | 0.479 | 0.466 | |
| DISTL | BIM | 0.131 | 0.468 | 0.070 | 0.359 | 0.475 | 0.219 |
| FGSM | 0.098 | 0.549 | 0.051 | 0.122 | 0.532 | 0.065 | |
| MIM | 0.349 | 0.479 | 0.211 | 0.120 | 0.488 | 0.064 | |
| PGD | 0.357 | 0.493 | 0.217 | 0.180 | 0.481 | 0.099 | |
| Defence | Attack | White-Box | Black-Box | |||||
|---|---|---|---|---|---|---|---|---|
| F1 | AUC | ACC | F1 | AUC | ACC | |||
| MGM* | FGSM | 0.969 | 0.998 | 0.969 | 0.969 | 0.998 | 0.969 | |
| PGD | 0.969 | 0.998 | 0.969 | 0.969 | 0.998 | 0.969 | ||
| BIM | 0.969 | 0.998 | 0.969 | 0.969 | 0.998 | 0.969 | ||
| MIM | 0.969 | 0.998 | 0.969 | 0.969 | 0.998 | 0.969 | ||
| AT-PGD | FGSM | 0.385 | 0.459 | 0.405 | 0.396 | 0.487 | 0.428 | |
| PGD | 0.374 | 0.505 | 0.396 | 0.379 | 0.430 | 0.425 | ||
| BIM | 0.371 | 0.504 | 0.391 | 0.387 | 0.438 | 0.425 | ||
| MIM | 0.370 | 0.501 | 0.389 | 0.378 | 0.452 | 0.418 | ||
| TRADES | FGSM | 0.399 | 0.467 | 0.390 | 0.410 | 0.448 | 0.450 | |
| PGD | 0.413 | 0.507 | 0.411 | 0.411 | 0.436 | 0.444 | ||
| BIM | 0.417 | 0.503 | 0.414 | 0.421 | 0.477 | 0.427 | ||
| MIM | 0.423 | 0.504 | 0.422 | 0.440 | 0.507 | 0.434 | ||
| Model | Attack | White-Box | Black-Box | ||||
|---|---|---|---|---|---|---|---|
| F1 | AUC | ACC | F1 | AUC | ACC | ||
| ResNet50 | FGSM | 0.969 | 0.998 | 0.969 | 0.969 | 0.998 | 0.969 |
| PGD | 0.969 | 0.998 | 0.969 | 0.969 | 0.998 | 0.969 | |
| BIM | 0.969 | 0.998 | 0.969 | 0.969 | 0.998 | 0.969 | |
| MIM | 0.969 | 0.998 | 0.969 | 0.969 | 0.998 | 0.969 | |
| DenseNet121* | FGSM | 0.984 | 0.999 | 0.985 | 0.984 | 0.999 | 0.985 |
| PGD | 0.984 | 0.999 | 0.985 | 0.984 | 0.999 | 0.985 | |
| BIM | 0.980 | 0.997 | 0.981 | 0.980 | 0.997 | 0.981 | |
| MIM | 0.984 | 0.999 | 0.985 | 0.984 | 0.999 | 0.985 | |
| MobileNet | FGSM† | 0.994 | 1.000 | 0.994 | 0.994 | 1.000 | 0.994 |
| PGD | 0.918 | 0.994 | 0.916 | 0.918 | 0.994 | 0.916 | |
| BIM | 0.770 | 0.937 | 0.760 | 0.770 | 0.937 | 0.760 | |
| MIM | 0.782 | 0.953 | 0.772 | 0.782 | 0.953 | 0.772 | |
| ViT Base | FGSM | 0.964 | 0.994 | 0.964 | 0.964 | 0.994 | 0.964 |
| PGD | 0.964 | 0.994 | 0.964 | 0.964 | 0.994 | 0.964 | |
| BIM | 0.824 | 0.877 | 0.811 | 0.824 | 0.877 | 0.811 | |
| MIM | 0.964 | 0.994 | 0.964 | 0.964 | 0.994 | 0.964 | |
| DeiT | FGSM | 0.903 | 0.944 | 0.894 | 0.903 | 0.944 | 0.894 |
| PGD | 0.908 | 0.946 | 0.901 | 0.908 | 0.946 | 0.901 | |
| BIM | 0.876 | 0.919 | 0.864 | 0.876 | 0.919 | 0.864 | |
| MIM | 0.908 | 0.946 | 0.900 | 0.908 | 0.946 | 0.900 | |
| DISTL | FGSM | 0.953 | 0.990 | 0.983 | 0.957 | 0.990 | 0.988 |
| PGD | 0.945 | 0.987 | 0.982 | 0.951 | 0.987 | 0.983 | |
| BIM | 0.927 | 0.936 | 0.975 | 0.932 | 0.936 | 0.980 | |
| MIM | 0.951 | 0.987 | 0.983 | 0.951 | 0.987 | 0.985 | |
| Model | Attack | White-Box | Black-Box | ||||
|---|---|---|---|---|---|---|---|
| F1 | AUC | ACC | F1 | AUC | ACC | ||
| ResNet50* | FGSM | 0.969 | 0.998 | 0.969 | 0.969 | 0.998 | 0.969 |
| PGD | 0.969 | 0.998 | 0.969 | 0.969 | 0.998 | 0.969 | |
| BIM | 0.969 | 0.998 | 0.969 | 0.969 | 0.998 | 0.969 | |
| MIM | 0.969 | 0.998 | 0.969 | 0.969 | 0.998 | 0.969 | |
| DenseNet121 | FGSM | 0.966 | 0.996 | 0.966 | 0.966 | 0.996 | 0.966 |
| PGD | 0.966 | 0.996 | 0.966 | 0.966 | 0.996 | 0.966 | |
| BIM | 0.966 | 0.996 | 0.966 | 0.966 | 0.996 | 0.966 | |
| MIM | 0.966 | 0.996 | 0.966 | 0.966 | 0.996 | 0.966 | |
| MobileNet | FGSM | 0.962 | 0.995 | 0.962 | 0.962 | 0.995 | 0.962 |
| PGD | 0.962 | 0.995 | 0.962 | 0.962 | 0.995 | 0.962 | |
| BIM | 0.962 | 0.995 | 0.962 | 0.962 | 0.995 | 0.962 | |
| MIM | 0.962 | 0.995 | 0.962 | 0.962 | 0.995 | 0.962 | |
| ViT Base | FGSM | 0.954 | 0.992 | 0.954 | 0.954 | 0.992 | 0.954 |
| PGD | 0.954 | 0.992 | 0.954 | 0.954 | 0.992 | 0.954 | |
| BIM | 0.954 | 0.992 | 0.954 | 0.954 | 0.992 | 0.954 | |
| MIM | 0.954 | 0.992 | 0.954 | 0.954 | 0.992 | 0.954 | |
| DeiT | FGSM | 0.946 | 0.990 | 0.946 | 0.946 | 0.990 | 0.946 |
| PGD | 0.946 | 0.990 | 0.946 | 0.946 | 0.990 | 0.946 | |
| BIM | 0.946 | 0.990 | 0.946 | 0.946 | 0.990 | 0.946 | |
| MIM | 0.946 | 0.990 | 0.946 | 0.946 | 0.990 | 0.946 | |
| DISTL | FGSM | 0.958 | 0.993 | 0.958 | 0.958 | 0.993 | 0.958 |
| PGD | 0.958 | 0.993 | 0.958 | 0.958 | 0.993 | 0.958 | |
| BIM | 0.958 | 0.993 | 0.958 | 0.958 | 0.993 | 0.958 | |
| MIM | 0.958 | 0.993 | 0.958 | 0.958 | 0.993 | 0.958 | |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).














