Submitted:
31 March 2025
Posted:
31 March 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- We propose DeepPrior-EG, a deep prior-guided EG framework for addressing the longstanding issue of misalignment between baselines and the concept of missingness. It strategically initiates gradient path integration from the prior baselines, computing expectation gradients along the trajectory spanning to the input image. It autonomously also extracts priors from the intrinsic deep features of the CNN layers.
- We achieve these priors through two strategies: a multivariate Gaussian model (MGM) formulation that captures high-dimensional feature interdependencies, and a Bayesian nonparametric Gaussian mixture model(BGMM) approach that adaptively infers mixture complexity while representing heterogeneous feature distributions.
- We re-train models by incorporating the explanations from the proposed framework. It improves model robustness to noise and minimizes interference from irrelevant background features. Experimental evaluations across multiple metrics demonstrate that our approach outperforms traditional methods, while the BGMM approach achieves the best performances.
- We conduct extensive experiments using a range of evaluation metrics (e.g., KPM, KNM) to comprehensively assess the interpretability of our method. Results confirm its superiority in capturing relevant features and enhancing interpretive fidelity.
2. Related Work
2.1. Gradient-Based Explanation Methods
2.2. Integrated Gradients
2.3. Expected Gradients
3. Methodology
3.1. Motivation and Proposed Framework
- Inherent Prior Encoding: CNNs naturally encode hierarchical priors through layered architectures, capturing low-level features (edges/textures) and high-level semantics (object shapes/categories), enabling effective modeling of complex nonlinear patterns.
- Superior Feature Learning: Unlike traditional methods (SIFT/HOG) limited to low-level geometric features, CNNs automatically learn task-specific representations through end-to-end training, eliminating manual feature engineering.
- Noise-Robust Representation: By mapping high-dimensional pixels to compact low-dimensional spaces, CNNs reduce data redundancy while enhancing feature robustness.
- Transfer Learning Efficiency: Pretrained models (ResNet/VGG) on large datasets (ImageNet) provide transferable general features, improving generalization and reducing training costs.
3.2. Multivariate Gaussian Model for Deep Priors
3.3. Bayesian Gaussian Mixture Models for Deep Priors
- vs GMM: Avoids preset K and singular covariance issues through nonparametric regularization
- vs KDE: Explicitly models multimodality versus kernel-based density biases
- vs VAE: Maintains strict likelihood-based generation unlike decoder-induced distribution shifts
4. Experiments
4.1. ImageNet Dataset
4.2. Evaluation Metrics
- KPM/KAM: Measure the ability of the model to recover positive or important features, respectively. These metrics highlight the extent to which explanations emphasize feature contributions that positively influence predictions.
- RPM/RNM: Assess the impact of removing important features on model predictions. These metrics quantify how sensitive the model is to the absence of critical features, thereby evaluating the robustness of the explanation method.
- RAM: Provides a general measure of feature importance based on absolute values, capturing the overall contribution of features irrespective of their directionality.
4.3. Results
4.3.1. Evaluation on ImageNet



4.3.2. Improving Noise Robustness of Model Re-trained with Attribution Priors
5. Discussion
5.1. Comparison with Shape Priors
5.2. Comparison of Methods with CAM
6. Conclusion
Acknowledgments
Conflicts of Interest
References
- Wang H, Fu T, Du Y, et al. Scientific discovery in the age of artificial intelligence[J]. Nature, 2023, 620(7972): 47-60. [CrossRef]
- Alowais S A, Alghamdi S S, Alsuhebany N, et al. Revolutionizing healthcare: the role of artificial intelligence in clinical practice[J]. BMC medical education, 2023, 23(1): 689. [CrossRef]
- Khaleel M, Ahmed A A, Alsharif A. Artificial Intelligence in Engineering[J]. Brilliance: Research of Artificial Intelligence, 2023, 3(1): 32-42.
- Nazir S, Dickson D M, Akram M U. Survey of explainable artificial intelligence techniques for biomedical imaging with deep neural networks[J]. Computers in Biology and Medicine, 2023, 156: 106668. [CrossRef]
- Hassija V, Chamola V, Bajpai B C, et al. Security issues in implantable medical devices: Fact or fiction?[J]. Sustainable Cities and Society, 2021, 66: 102552.
- Wichmann F A, Geirhos R. Are deep neural networks adequate behavioral models of human visual perception?[J]. Annual Review of Vision Science, 2023, 9(1): 501-524.
- Hassija V, Chamola V, Mahapatra A, et al. Interpreting black-box models: a review on explainable artificial intelligence[J]. Cognitive Computation, 2024, 16(1): 45-74. [CrossRef]
- Slack D, Hilgard A, Singh S, et al. Reliable post hoc explanations: Modeling uncertainty in explainability[J]. Advances in neural information processing systems, 2021, 34: 9391-9404. [CrossRef]
- Ai Q, Narayanan. R L. Model-agnostic vs. model-intrinsic interpretability for explainable product search[C]//Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 2021: 5-15.
- Song Y Y, Ying L U. Decision tree methods: applications for classification and prediction[J]. Shanghai archives of psychiatry, 2015, 27(2): 130. [CrossRef]
- Su X, Yan X, Tsai C L. Linear regression[J]. Wiley Interdisciplinary Reviews: Computational Statistics, 2012, 4(3): 275-294.
- LaValley M, P. Logistic regression[J]. Circulation, 2008, 117(18): 2395-2399.
- Rigatti S, J. Random forest[J]. Journal of Insurance Medicine, 2017, 47(1): 31-39.
- Kontschieder P, Fiterau M, Criminisi A, et al. Deep neural decision forests[C]//Proceedings of the IEEE international conference on computer vision. 2015: 1467-1475.
- Chen T, Guestrin C. Xgboost: A scalable tree boosting system[C]//Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining. 2016: 785-794.
- Fernando T, Gammulle H, Denman S, et al. Deep learning for medical anomaly detection–a survey[J]. ACM Computing Surveys (CSUR), 2021, 54(7): 1-37. [CrossRef]
- Sundararajan M, Taly A, Yan Q. Gradients of counterfactuals[J]. arxiv preprint arxiv:1611.02639, 2016.
- Shrikumar A, Greenside P, Kundaje A. Learning important features through propagating activation differences[C]//International conference on machine learning. PMlR, 2017: 3145-3153.
- Bach S, Binder A, Montavon G, et al. On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation[J]. PloS one, 2015, 10(7): e0130140. [CrossRef]
- Erion G, Janizek J D, Sturmfels P, et al. Improving performance of deep learning models with axiomatic attribution priors and expected gradients[J]. Nature machine intelligence, 2021, 3(7): 620-631.
- Hamarneh G, Li X. Watershed segmentation using prior shape and appearance knowledge[J]. Image and Vision Computing, 2009, 27(1-2): 59-68.
- Ulyanov D, Vedaldi A, Lempitsky V. Deep image prior[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 9446-9454.
- Do C, B. The multivariate Gaussian distribution[J]. Section Notes, Lecture on Machine Learning, CS, 2008, 229.
- Nosrati, Masoud S., and Ghassan Hamarneh. "Incorporating prior knowledge in medical image segmentation: a survey." arXiv preprint (2016). arXiv:1607.01092 (2016).
- Zhao X, Huang W, Huang X, et al. Baylime: Bayesian local interpretable model-agnostic explanations[C]//Uncertainty in artificial intelligence. PMLR, 2021: 887-896.
- Baehrens D, Schroeter T, Harmeling S, et al. How to explain individual classification decisions[J]. The Journal of Machine Learning Research, 2010, 11: 1803-1831.
- Enguehard, J. Sequential Integrated Gradients: a simple but effective method for explaining language models[J]. arxiv preprint arxiv:2305.15853, 2023.
- Sayres R, Taly A, Rahimy E, et al. Using a deep learning algorithm and integrated gradients explanation to assist grading for diabetic retinopathy[J]. Ophthalmology, 2019, 126(4): 552-564.
- Sturmfels P, Lundberg S, Lee S I.Visualizing the Impact of Feature Attribution Baselines[J].Distill, 2020, 5(1). [CrossRef]
- Pignat E, Calinon S. Bayesian Gaussian mixture model for robotic policy imitation[J]. IEEE Robotics and Automation Letters, 2019, 4(4): 4452-4458. [CrossRef]
- Deng J, Dong W, Socher R, et al. Imagenet: A large-scale hierarchical image database[C]//2009 IEEE conference on computer vision and pattern recognition. Ieee, 2009: 248-255.
- Krizhevsky A, Sutskever I, Hinton G E. 2012 AlexNet[J]. Adv. Neural Inf. Process. Syst, 2012: 1-9.
- Lundberg S M, Erion G, Chen H, et al. From local explanations to global understanding with explainable AI for trees[J]. Nature machine intelligence, 2020, 2(1): 56-67. [CrossRef]
- Petsiuk, V. Rise: Randomized Input Sampling for Explanation of black-box models[J]. arxiv preprint arxiv:1806.07421, 2018.
- Lu, J. A survey on Bayesian inference for Gaussian mixture model[J]. arXiv preprint. arXiv:2108.11753, 2021.




| method | KPM | KNM | KAM | RPM | RNM | RAM |
|---|---|---|---|---|---|---|
| EG | 1.0952 | -1.1014 | 0.9653 | 1.2502 | -1.2558 | 1.4305 |
| LIFT | 1.1027 | -1.1283 | 0.9288 | 1.2120 | -1.2316 | 1.4432 |
| DeepBGMM-EG | 1.1193 | -1.1279 | 0.9580 | 1.2224 | -1.2340 | 1.4361 |
| DeepMGM-EG | 1.1192 | -1.1283 | 0.9565 | 1.2217 | -1.2333 | 1.4342 |
| method | KPM | KNM | KAM | RPM | RNM | RAM |
|---|---|---|---|---|---|---|
| EG | 1.2418 | -1.2390 | 1.0217 | 2.4655 | -2.4639 | 2.8490 |
| LIFT | 1.0055 | -0.8827 | 0.9758 | 2.7043 | -2.6894 | 2.8403 |
| DeepBGMM-EG | 1.2603 | -1.2412 | 1.0244 | 2.3692 | -2.3510 | 2.7601 |
| DeepMGM-EG | 1.2540 | -1.2416 | 1.0131 | 2.4678 | -2.4566 | 2.8758 |
| method | KPM | KNM | KAM | RPM | RNM | RAM |
|---|---|---|---|---|---|---|
| EG | 1.0952 | -1.1014 | 0.9653 | 1.2502 | -1.2558 | 1.4305 |
| DeepBGMM-EG | 1.1193 | -1.1279 | 0.9580 | 1.2224 | -1.2340 | 1.4361 |
| Shape-EG | 1.1018 | -1.1044 | 0.9649 | 1.2485 | -1.2504 | 1.4317 |
| method | insertion | deletion |
|---|---|---|
| Lime | 0.1178 | 0.1127 |
| Grad-CAM | 0.1233 | 0.1290 |
| Baylime | 0.1178 | 0.1127 |
| EG | 0.6849 | 0.1206 |
| DeepBGMM-EG | 0.6849 | 0.1202 |
| DeepMGM-EG | 0.6842 | 0.1204 |
| Shape-EG | 0.6841 | 0.1207 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).