Submitted:
01 June 2026
Posted:
03 June 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
- A dimensionally explicit transition-matrix formulation that maps compressed deep representations to interpretable semantic attributes without retraining the underlying feature extractor.
- The Weighted Entropy-Density Discretization (WEDD) method, a supervised discretizer that balances class-conditional entropy with a local density penalty to ensure stable symbolic boundaries.
- A rough-set rule-induction mechanism that constructs a global, conflict-aware production rulebook, featuring explicit handling of structural abstentions and boundary regions.
- A comprehensive evaluation on the AwA2 benchmark and controlled synthetic data, demonstrating the framework’s ability to carefully balance predictive variance with rigorous algorithmic auditability.
2. Related works
2.1. Local Post-Hoc Explanation
2.2. Concept-Based and Ante-Hoc Models
2.3. Formal Knowledge Granulation and Rough Sets
2.4. Semantic Transfer as a Validation Proxy
3. Materials and Methods
3.1. Framework Architecture Overview
3.2. Phase 1: Neural-to-Semantic Transition
3.3. Phase 2.1: Semantic Attribute Selection and Stable Discretization (WEDD)
| Algorithm 1: Neural-to-Semantic Transition and WEDD Discretization (Phases 1 and 2.1) |
|
3.4. Phase 2.2: Rough-Set Granulation and Knowledge Discovery
3.5. Phase 2.3: Conflict-Aware Inference and Evaluation
3.5.1. Conflict Resolution and Soft Matching
3.5.2. Fidelity and Coverage Metrics
| Algorithm 2: Rough-Set Granulation, Reduct Solver, and Inference (Phases 2.3–2.5) |
|
3.6. Evaluation Metrics for Auditability
3.7. Experimental Protocol and Proof-of-Concept Design
3.7.1. Datasets, Feature Extraction, and Base Modeling
3.7.2. Empirical Evaluation Splits
3.7.3. Methodological Ablations and XAI Baselines
3.7.4. Algorithmic Recovery on Synthetic Data
3.7.5. Diagnostic Tracking and Inference Stability
4. Results
4.1. Semantic Reconstruction Performance (Protocol A)
4.2. Rulebook Auditability and Fidelity
4.2.1. The Fidelity-Coverage Trade-off
4.2.2. Conflict Awareness and Representative Logic
4.3. Zero-Shot Transfer as Semantic Validation (Protocol B)
4.3.1. Contextual Performance and Semantic Integrity
4.3.2. Diagnostic Per-Class Analysis
4.4. Comparison with Local Surrogates and Concept Baselines
4.4.1. The Fragmentation Gap: SEMTRA vs. Local Surrogates
4.4.2. Post-hoc Auditability vs. Ante-hoc Design
4.5. Algorithmic Recovery (Synthetic Benchmark)
4.5.1. Exactness of Logic Extraction
4.5.2. Robustness Under Controlled Noise
5. Discussion
5.1. The Scientific Value of Global Auditability
5.2. Interpreting the Audit Tax: Accuracy vs. Verifiability
5.3. The Power of Honest Silence and Indiscernibility
5.4. Linear Transition as a Defensive Choice
5.5. Limitations and Strategic Directions
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Conflicts of Interest
Abbreviations
| AI | Artificial Intelligence |
| AUROC | Area Under the Receiver Operating Characteristic |
| AwA2 | Animals with Attributes 2 |
| CART | Classification and Regression Trees |
| CBM | Concept Bottleneck Model |
| Cov | Rulebook Coverage |
| CPU | Central Processing Unit |
| CR | Conflict Rate |
| DAP | Direct Attribute Prediction |
| ECE | Expected Calibration Error |
| GFZSL | Generative Framework for Zero-Shot Learning |
| GPU | Graphics Processing Unit |
| IAP | Indirect Attribute Prediction |
| ILSVRC | ImageNet Large Scale Visual Recognition Challenge |
| LIME | Local Interpretable Model-agnostic Explanations |
| LLM | Large Language Model |
| MAE | Mean Absolute Error |
| MDLP | Minimum Description Length Principle |
| MLP | Multi-Layer Perceptron |
| RBF | Radial Basis Function |
| ResNet | Residual Network |
| RMSE | Root Mean Square Error |
| SEMTRA | Global Semantic Transition |
| SHAP | Shapley Additive Explanations |
| SVD | Singular-Value Decomposition |
| TCAV | Testing with Concept Activation Vectors |
| WEDD | Weighted Entropy-Density Discretization |
| XAI | Explainable Artificial Intelligence |
| ZSL | Zero-Shot Learning |
Appendix A. Experimental Protocol and Hyperparameters
| Component | Value | Implementation detail |
|---|---|---|
| Feature extractor | ResNet-101 features released with AwA2 | ILSVRC-pretrained representation layer; no image-level augmentation in this package |
| Feature dimension | 2048 | Global average/penultimate representation coordinates |
| Auxiliary compression | 64 | Principal Component Analysis (PCA) reduced to 64 components retaining 95% variance |
| Base predictor | Ridge classifier | Trained on variance-screened representation coordinates |
| Random seed | 42 | Used for all stochastic revision experiments |
| Semantic bridge | Ridge regression | Grid alpha in {0.01, 0.1, 1, 10, 100} |
| Rule thresholds | WEDD | alpha=0.65, max_depth=2, min_bin_size=30, min_gain=0.002 |
| Rule induction | Greedy reducts | tau=0.84, s_min=18 for AwA2 Protocol A |
| Soft matching threshold | Maximum allowed masked Hamming distance |
| Split | Top-1 | Top-5 | Macro-F1 | Weighted-F1 | AUROC | ECE |
|---|---|---|---|---|---|---|
| Validation | 0.7093 | 0.9306 | 0.5399 | 0.6639 | 0.9832 | 0.6622 |
| Test | 0.7116 | 0.9291 | 0.5434 | 0.6681 | 0.9836 | 0.6645 |
| Control knob | Value | Rules | Rulebook Coverage | Covered Fidelity | Covered Accuracy |
|---|---|---|---|---|---|
| 0.25 | 50 | 0.7687 | 0.4796 | 0.4414 | |
| 0.50 | 54 | 0.8640 | 0.3829 | 0.4073 | |
| 0.75 | 57 | 0.7847 | 0.3856 | 0.3935 | |
| 0.25 | 58 | 0.7136 | 0.3514 | 0.3535 | |
| 0.65 | 54 | 0.8640 | 0.3829 | 0.4073 | |
| 0.90 | 53 | 0.8977 | 0.4349 | 0.4205 |
Appendix B. Full Semantic Attribute Discretization Diagnostics
| Method | Thresholds | Rules | Avg. len. | Coverage | Abstention | All acc. | Cov. fidelity | Conflict |
|---|---|---|---|---|---|---|---|---|
| WEDD | 54 | 54 | 4.0370 | 0.8640 | 0.1360 | 0.3519 | 0.3829 | 0.1354 |
| MDLP-like entropy | 54 | 53 | 4.0566 | 0.8714 | 0.1286 | 0.3928 | 0.4556 | 0.1125 |
| Equal frequency | 36 | 52 | 4.2885 | 0.7474 | 0.2526 | 0.2324 | 0.3126 | 0.2506 |
| Equal width | 36 | 61 | 3.6721 | 0.5830 | 0.4170 | 0.2521 | 0.4750 | 0.4170 |
Appendix C. Rule Inference Traces and Structural Perturbation Stability
| Case | Object | True | Pred. | Mode | Rule | Support | Conf. | Antecedent states |
|---|---|---|---|---|---|---|---|---|
| correct exact | 5083 | collie | collie | exact | R0027 | 1675 | 0.208 | stripes=s0; hooves=s0; swims=s1; paws=s3; longneck=s0; hunter=s1; ocean=s1; quadrapedal=s3 |
| abstention | 24024 | ox | abstain | exact | R0029 | 2820 | 0.195 | stripes=s0; hooves=s2; swims=s1; paws=s0; longneck=s1; hunter=s0; ocean=s1; quadrapedal=s2 |
| fallback error or boundary | 16781 | hamster | mole | fallback | – | – | – | stripes=s0; hooves=s1; swims=s1; paws=s2; longneck=s0; hunter=s0; ocean=s1; quadrapedal=s2 |
Appendix C.1. Trace Walkthrough for a Zebra Instance
| Sigma | Rule consistency | Decision consistency | Coverage | Coverage change | Abstention | Conflict |
|---|---|---|---|---|---|---|
| 0.0000 | 1.0000 | 1.0000 | 0.8796 | 0.0000 | 0.1204 | 0.1180 |
| 0.0100 | 0.9745 | 0.9663 | 0.8792 | -0.0004 | 0.1208 | 0.1183 |
| 0.0250 | 0.9326 | 0.9176 | 0.8808 | 0.0012 | 0.1192 | 0.1168 |
| 0.0500 | 0.8667 | 0.8401 | 0.8816 | 0.0020 | 0.1184 | 0.1160 |
| 0.1000 | 0.7307 | 0.7112 | 0.8940 | 0.0145 | 0.1060 | 0.1026 |
References
- Longo, L.; Brcic, M.; Cabitza, F.; Choi, J.; Confalonieri, R.; Ser, J.D.; Guidotti, R.; Hayashi, Y.; Herrera, F.; Holzinger, A.; et al. Explainable artificial intelligence (XAI) 2.0: A manifesto of open challenges and interdisciplinary research directions. Inf. Fusion 2024, 106, 102301. [Google Scholar] [CrossRef]
- Mersha, M.; Lam, K.N.; Wood, J.; AlShami, A.K.; Kalita, J. Explainable artificial intelligence: A survey of needs, techniques, applications, and future direction. Neurocomputing 2024, 599, 128111. [Google Scholar] [CrossRef]
- Radiuk, P.; Barmak, O.; Manziuk, E.; Krak, I. Explainable deep learning: A visual analytics approach with transition matrices. Mathematics 2024, 12, 1024. [Google Scholar] [CrossRef]
- Barmak, O.; Krak, I.; Yakovlev, S.; Manziuk, E.; Radiuk, P.; Kuznetsov, V. Toward explainable deep learning in healthcare through transition matrix and user-friendly features. Front. Artif. Intell. 2024, 7, 1482141. [Google Scholar] [CrossRef]
- Radiuk, P.; Barmak, O.; Bedratyuk, L.; Krak, I. Equivariant transition matrices for explainable deep learning: A Lie group linearization approach. Mach. Learn. Knowl. Extr. 2026, 8, 92. [Google Scholar] [CrossRef]
- Pawlak, Z. Rough sets. Int. J. Comput. Inf. Sci. 1982, 11, 341–356. [Google Scholar] [CrossRef]
- Pawlak, Z. Rough sets: Theoretical aspects of reasoning about data . In Theory and Decision Library D; Kluwer Academic Publishers: Dordrecht, The Netherlands, 1991; Vol. 9. [Google Scholar] [CrossRef]
- Salih, A.M.; Raisi-Estabragh, Z.; Galazzo, I.B.; Radeva, P.; Petersen, S.E.; Lekadir, K.; Menegaz, G. A perspective on explainable artificial intelligence methods: SHAP and LIME. Adv. Intell. Syst. 2025, 7, 2400304. [Google Scholar] [CrossRef]
- Bobek, S.; Nalepa, G.J. Local universal rule-based explainer (LUX). SoftwareX 2025, 30, 102102. [Google Scholar] [CrossRef]
- Kozielski, M.; Sikora, M.; Wawrowski. Towards consistency of rule-based explainer and black box model: Fusion of rule induction and XAI-based feature importance. Knowl.-Based Syst. 2025, 311, 113092. [Google Scholar] [CrossRef]
- Nicolson, A.; Schut, L.; Noble, J.A.; Gal, Y. Explaining explainability: Recommendations for effective use of concept activation vectors. In Trans. Mach. Learn. Res.; 2025. [Google Scholar]
- Srivastava, D.; Yan, G.; Weng, T.W. VLG-CBM: Training concept bottleneck models with vision-language guidance. In Proceedings of the Advances in Neural Information Processing Systems, 2024; Vol. 37. [Google Scholar]
- Zarlenga, M.E.; Barbiero, P.; Ciravegna, G.; Marra, G.; Giannini, F.; Diligenti, M.; Shams, Z.; Precioso, F.; Melacci, S.; Weller, A.; et al. Concept embedding models: Beyond the accuracy-explainability trade-off. Proc. Adv. Neural Inf. Process. Syst. 2022, Vol. 35, 21400–21413. [Google Scholar]
- Yüksekgönül, M.; Wang, M.; Zou, J. Post-hoc concept bottleneck models. In Proceedings of the Proceedings of the 11th International Conference on Learning Representations, 2023. [Google Scholar]
- Mienye, I.D.; Jere, N.R. A survey of decision trees: Concepts, algorithms, and applications. IEEE Access 2024, 12, 86716–86727. [Google Scholar] [CrossRef]
- Bollaert, H.; Palangetić, M.; Cornelis, C.; Greco, S.; Słowiński, R. FRRI: A novel algorithm for fuzzy-rough rule induction. Inf. Sci. 2025, 686, 121362. [Google Scholar] [CrossRef]
- Zhang, Y.; Li, R.; Wu, N.; Li, Q.; Lin, X.; Hu, Y.; Li, T.; Jiang, Y. Dissect black box: Interpreting for rule-based explanations in unsupervised anomaly detection. In Proceedings of the Advances in Neural Information Processing Systems, 2024; Vol. 37. [Google Scholar]
- Şenozan, H.; Soylu, B. A flexible non-monotonic discretization method for pre-processing in supervised learning. Pattern Recognit. Lett. 2024, 181, 77–85. [Google Scholar] [CrossRef]
- Biroli, G.; Mézard, M. Kernel density estimators in large dimensions. SIAM J. Math. Data Sci. 2026, 8, 46–76. [Google Scholar] [CrossRef]
- Xian, Y.; Lampert, C.H.; Schiele, B.; Akata, Z. Zero-shot learning: A comprehensive evaluation of the good, the bad and the ugly. IEEE Trans. Pattern Anal. Mach. Intell. 2019, 41, 2251–2265. [Google Scholar] [CrossRef]
- Lampert, C.H.; Pucher, D.; Dostal, J. Animals with attributes 2. Dataset web page. Accessed. 2017. (accessed on 22 May 2026).
- Lampert, C.H.; Nickisch, H.; Harmeling, S. Learning to detect unseen object classes by between-class attribute transfer. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009; pp. 951–958. [Google Scholar] [CrossRef]
- Lampert, C.H.; Nickisch, H.; Harmeling, S. Attribute-based classification for zero-shot visual object categorization. IEEE Trans. Pattern Anal. Mach. Intell. 2014, 36, 453–465. [Google Scholar] [CrossRef] [PubMed]
- Verma, V.K.; Rai, P. A simple exponential family framework for zero-shot learning. In Proceedings of the Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2017, Proceedings, Part II, 2017; Springer; pp. 792–808. [Google Scholar] [CrossRef]
- Norouzi, M.; Mikolov, T.; Bengio, S.; Singer, Y.; Shlens, J.; Frome, A.; Corrado, G.S.; Dean, J. Zero-shot learning by convex combination of semantic embeddings. In Proceedings of the Proceedings of the 2nd International Conference on Learning Representations, 2014. [Google Scholar]
- Socher, R.; Ganjoo, M.; Manning, C.D.; Ng, A.Y. Zero-shot learning through cross-modal transfer. Proc. Adv. Neural Inf. Process. Syst. 2013, Vol. 26, 935–943. [Google Scholar]
- Zhang, Z.; Saligrama, V. Zero-shot learning via semantic similarity embedding. In Proceedings of the Proceedings of the IEEE International Conference on Computer Vision, 2015; pp. 4166–4174. [Google Scholar] [CrossRef]
- Xian, Y.; Akata, Z.; Sharma, G.; Nguyen, Q.; Hein, M.; Schiele, B. Latent embeddings for zero-shot classification. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016; pp. 69–77. [Google Scholar] [CrossRef]
- Akata, Z.; Perronnin, F.; Harchaoui, Z.; Schmid, C. Label-embedding for image classification. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 38, 1425–1438. [Google Scholar] [CrossRef]
- Frome, A.; Corrado, G.S.; Shlens, J.; Bengio, S.; Dean, J.; Ranzato, M.; Mikolov, T. DeViSE: A deep visual-semantic embedding model. Proc. Adv. Neural Inf. Process. Syst. 2013, Vol. 26, 2121–2129. [Google Scholar]
- Akata, Z.; Reed, S.; Walter, D.; Lee, H.; Schiele, B. Evaluation of output embeddings for fine-grained image classification. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015; pp. 2927–2936. [Google Scholar] [CrossRef]
- Romera-Paredes, B.; Torr, P.H.S. An embarrassingly simple approach to zero-shot learning. In Proceedings of the Proceedings of the 32nd International Conference on Machine Learning. PMLR, 2015, Vol. 37, Proceedings of Machine Learning Research, pp. 2152–2161.
- Changpinyo, S.; Chao, W.L.; Gong, B.; Sha, F. Synthesized classifiers for zero-shot learning. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016; pp. 5327–5336. [Google Scholar] [CrossRef]
- Kodirov, E.; Xiang, T.; Gong, S. Semantic autoencoder for zero-shot learning. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017; pp. 3174–3183. [Google Scholar] [CrossRef]
- Halko, N.; Martinsson, P.G.; Tropp, J.A. Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions. SIAM Rev. 2011, 53, 217–288. [Google Scholar] [CrossRef]
- Kalyta, O.; Barmak, O.; Radiuk, P.; Krak, I. Facial emotion recognition for photo and video surveillance based on machine learning and visual analytics. Appl. Sci. 2023, 13, 9890. [Google Scholar] [CrossRef]







| Split | MAE | RMSE | Mean semantic correlation |
|---|---|---|---|
| train | |||
| validation | |||
| test |
| Operator | MAE | RMSE | Corr. | Rule cov. | Rule acc. | Runtime (s) |
|---|---|---|---|---|---|---|
| Linear ridge transition | 0.1573 | 0.7759 | 0.1941 | 0.0451 | ||
| RBF kernel ridge | 0.1958 | 0.2583 | 0.1176 | 1.0000 | 0.0027 | 2.8573 |
| Two-layer MLP regressor | 0.1685 | 0.2355 | 0.4497 | 0.8390 | 0.0915 | 0.6914 |
| Semantic attribute | Salience | Test MAE | Selection score |
|---|---|---|---|
| stripes | 0.025 | 0.105 | 0.202 |
| hooves | 0.036 | 0.153 | 0.208 |
| swims | 0.032 | 0.090 | 0.288 |
| paws | 0.045 | 0.180 | 0.228 |
| longneck | 0.020 | 0.099 | 0.166 |
| hunter | 0.037 | 0.159 | 0.204 |
| ocean | 0.032 | 0.072 | 0.346 |
| quadrapedal | 0.030 | 0.111 | 0.225 |
| water | 0.033 | 0.082 | 0.325 |
| flippers | 0.031 | 0.073 | 0.323 |
| hands | 0.025 | 0.075 | 0.271 |
| plankton | 0.016 | 0.043 | 0.257 |
| Metric | Value (Ours) | CART baseline | Separate-and-conquer |
|---|---|---|---|
| Number of rules | 53 | 197 | |
| Test coverage | 1.0000 | 0.2620 | |
| Test accuracy (non-abstained) | 0.4129 | 0.6334 | |
| Average conditions per rule | 7.0000 | 3.2589 | |
| Test abstention rate | 0.0000 | 0.7380 | |
| Test conflict rate | 0.0000 | 0.2301 |
| Rule | Antecedent | Numeric bounds | Class | Support* | Conf. |
|---|---|---|---|---|---|
| R0001 | paws=s2 AND hunter=s2 AND small=s0 AND yellow=s2 | paws ; hunter ; small ; yellow | tiger | 115 | 0.896 |
| R0003 | stripes=s1 AND hooves=s3 AND paws=s0 AND hands=s0 AND small=s1 AND yellow=s1 | stripes ; hooves ; paws ; hands ; small ; yellow | antelope | 132 | 0.871 |
| R0004 | hunter=s0 AND water=s0 AND small=s0 AND jungle=s2 AND yellow=s1 | hunter ; water ; small ; jungle ; yellow | zebra | 434 | 0.866 |
| Method | Family | Accuracy (%) | Source and scope |
|---|---|---|---|
| DAP [22] | Attribute transfer | 46.10 | Published ZSL baseline |
| IAP [23] | Attribute transfer | 35.90 | Published ZSL baseline |
| Ours: Semantic Transition | Post-hoc bridge | 48.43 | Class-averaged |
| Ours: Symbolic Template | Post-hoc rule | 39.93 | Class-averaged |
| GFZSL [24] | Generative | 63.80 | Published ZSL baseline |
| Unseen class | Prototype acc. | Symbolic template acc. | Mean Hamming |
|---|---|---|---|
| blue whale | 0.9195 | 0.8851 | 0.1858 |
| bobcat | 0.9397 | 0.7952 | 0.5134 |
| sheep | 0.5908 | 0.1465 | 0.4949 |
| bat | 0.0000 | 0.3655 | 0.5909 |
| seal | 0.6110 | 0.4500 | 0.4120 |
| walrus | 0.5200 | 0.3200 | 0.4800 |
| dolphin | 0.4500 | 0.3800 | 0.5200 |
| cow | 0.3320 | 0.2000 | 0.4600 |
| rat | 0.1500 | 0.1000 | 0.5500 |
| raccoon | 0.3300 | 0.3507 | 0.4750 |
| Average | 0.4843 | 0.3993 | 0.4682 |
| Method | n | Top-k agreement | Direction agreement | Rank correlation | 95% CI |
|---|---|---|---|---|---|
| LIME-style | 1000 | 0.2610 | 0.5907 | 0.0222 | [0.24, 0.27] |
| SHAP-style | 1000 | 0.3213 | 0.6071 | 0.1271 | [0.30, 0.33] |
| Component | Type | Accuracy / Concept Relevance | MAE | Concept Corr. |
|---|---|---|---|---|
| Frozen-feature CBM | Ante-hoc (Trained) | 0.7232 | 0.1076 | 0.7759 |
| SEMTRA (Ours) | Post-hoc (Audit) | 0.4073 | 0.1076 | 0.7759 |
| TCAV | Concept direction | 0.4850 | – | – |
| Class | Ground-truth rule | Numeric cut summary |
|---|---|---|
| 1 | high AND low | ; |
| 2 | medium AND high | ; |
| 3 | low, high, AND medium | ; ; |
| Noise () | MAE | Threshold error | Rule Jaccard | Macro-F1 | Coverage |
|---|---|---|---|---|---|
| 0.000 | 0.000 | 0.014 | 0.700 | 0.879 | 0.971 |
| 0.100 | 0.021 | 0.018 | 0.763 | 0.881 | 0.962 |
| 0.200 | 0.041 | 0.017 | 0.743 | 0.838 | 0.952 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).