Preprint
Article

This version is not peer-reviewed.

SEMTRA: Global Semantic Transition and Rough-Set Rules for Auditable Post-hoc Explainability

Submitted:

01 June 2026

Posted:

03 June 2026

You are already at the latest version

Abstract
Deep learning architectures produce highly effective yet uninterpretable latent representations, creating an interpretability gap that fundamentally impairs model verification and automated rule extraction. In this work, we propose Global Semantic Transition (SEMTRA), a post-hoc framework bridging this gap by translating numerical representations into auditable rough-set production rules without retraining the underlying feature extractor. Evaluated on the Animals with Attributes 2 (AwA2) benchmark, the proposed semantic transition achieved a mean absolute error of 0.1295. The systematically extracted rulebook successfully covered 86.40% of the test instances, significantly outperforming standard separate-and-conquer learners, which covered only 26.20%. The rules yielded a non-abstained covered accuracy of 40.73%, strictly verifying the transparent, mathematically robust portion of the model logic. Furthermore, the continuous semantic prototype transfer achieved a zero-shot accuracy of 48.43%, surpassing foundational baselines such as direct attribute prediction, which achieved 46.10%. A synthetic benchmark independently confirmed exact algorithmic recovery with a macro-F1 score of 0.8668. These findings establish that post-hoc semantic transition, combined with rigorous rough-set granulation, provides a reproducible pathway to transparent symbolic explanations, effectively balancing the tradeoff between predictive precision and algorithmic verifiability.
Keywords: 
;  ;  ;  ;  ;  ;  

1. Introduction

Deep learning architectures have established themselves as definitive standards for visual recognition and continuous transfer learning because of their inherent ability to compress high-dimensional observations into remarkably effective latent representations. However, this compression mechanism inherently creates a severe black-box effect, meaning that neural features remain structurally optimized for predictive success rather than for human interpretability. As automated systems increasingly permeate mission-critical diagnostic domains, the demand for transparency has fundamentally shifted away from heuristic approximations toward rigorous, mathematically sound explanations. The recent manifesto detailing the evolution of Artificial Intelligence (AI) frameworks emphasizes the critical need for verifiable symbolic knowledge [1].
Current approaches addressing this transparency demand fall into diverse methodological categories. A comprehensive survey of modern explainable frameworks illustrates a prominent divide between local diagnostic techniques and globally interpretable model designs [2]. The problem under consideration is the inability to systematically translate compressed latent neural coordinates into a robust, global, and conflict-aware rulebook without heavily retraining or modifying the underlying feature extractor. While instance-based post-hoc explanations generate input-specific attribution maps, they completely fail to produce a holistic, reusable logic policy for rigorous auditing. Conversely, interpretable-by-design architectures necessitate predefined concept vocabularies and costly structural re-engineering of existing high-performance networks. This persistent interpretability gap severely impairs routine model verification and diagnostic safety. Consequently, it is highly important to present novel scientific contributions to address the raised problem by synthesizing a global mathematical bridge that connects uninterpretable latent manifolds with auditable symbolic spaces.
To construct this required bridge, the proposed SEMTRA framework explicitly implements a post-hoc continuous semantic transition. Early analytical studies validated the efficacy of transition matrices for interpreting complex networks within structured visual analytics environments [3]. These matrix structures were subsequently adapted and refined to extract understandable features in specialized healthcare informatics contexts [4]. Furthermore, theoretical research extended linear translation operators to effectively manage continuous geometric symmetries via equivariant algebraic manifolds [5]. SEMTRA advances this trajectory by converting the extracted continuous semantic variables into verifiable logical predicates using established rough-set principles. Decades ago, initial mathematical formalizations introduced the concept of bounding uncertainty through precise indiscernibility relations [6]. Comprehensive theoretical extensions eventually solidified the applicability of rough sets in structured reasoning about inconsistent data [7].
The goal of this study is to improve the auditability, transparency, and logical verifiability of pre-trained deep learning representations by applying a continuous semantic transition coupled with rigorous rough-set knowledge granulation.
The primary objective of this study is to formulate and empirically evaluate the SEMTRA framework, ensuring it effectively translates latent neural features into a conflict-aware symbolic rulebook while explicitly identifying ambiguous boundary regions. To fulfill this objective, the research proposes four major contributions:
  • A dimensionally explicit transition-matrix formulation that maps compressed deep representations to interpretable semantic attributes without retraining the underlying feature extractor.
  • The Weighted Entropy-Density Discretization (WEDD) method, a supervised discretizer that balances class-conditional entropy with a local density penalty to ensure stable symbolic boundaries.
  • A rough-set rule-induction mechanism that constructs a global, conflict-aware production rulebook, featuring explicit handling of structural abstentions and boundary regions.
  • A comprehensive evaluation on the AwA2 benchmark and controlled synthetic data, demonstrating the framework’s ability to carefully balance predictive variance with rigorous algorithmic auditability.
The remainder of this paper is organized as follows. Section 2 details the related works covering explainable methods and symbolic rules. Section 3 outlines the proposed framework methodology, the continuous transition bridge, and the subsequent rough-set extraction mechanisms. Section 4 presents the qualitative and quantitative evaluations. Section 5 analyzes the implications of the structural explanation tradeoff. Finally, Section 6 concludes the manuscript.

3. Materials and Methods

3.1. Framework Architecture Overview

The SEMTRA framework is structurally designed as a universal, modular, post-hoc proof-of-concept pipeline applicable to all possible XAI problems. Operating seamlessly over any frozen feature extractor, the methodology systematically produces a globally consistent and auditable symbolic rulebook. The conceptual architecture, illustrated in Figure 1, explicitly follows a rigorous two-phase progression: Phase 1, designated as Neural-to-Semantic Transition, and Phase 2, defined as Semantic-to-Symbolic Granulation.
The progression moves from the initial latent space A , mapped through a robust global transition matrix T , to reconstruct understandable continuous semantics B . Subsequently, these semantics are discretized through the WEDD mechanism to finally induce the optimal symbolic production rules R . To ensure absolute notational clarity and resolve discrete representation overlaps, the matrix of all discretized states is strictly denoted as S , with individual symbolic states for object i and attribute j defined as s i j . The ultimate output of the framework structurally follows the standardized rule format r:
r : j R r [ s j = v j ] d = y r ,
where R r is the optimized minimal set of semantic antecedents, s j represents the specific discrete state variable required by the rule, and v j is the corresponding exact logical value bound.
The mathematical formalization of this entire universal decision-making system S is defined as:
S = ( U , C , d , V , f ) ,
where U is the universe of distinct objects, C is the set of conditional semantic attributes, d is the target decision attribute, V is the union of valid attribute domains, and f is the deterministic information function.
By explicitly separating heavy computational inference from lightweight symbolic extraction, SEMTRA ensures that explanations across any classification task are fully traceable and reproducible.

3.2. Phase 1: Neural-to-Semantic Transition

The first phase establishes a mathematically explicit bridge between the uninterpretable latent representation space of a deep model and a target semantic attribute space. Let A R m × k denote the matrix of raw, untransparent latent features. To ensure that the subsequent mapping is robust to dataset-specific biases, we first perform dimensional centering. Let the vector a ¯ R k denote the absolute mean computed across the entire training distribution. The centered representation matrix A c is subsequently computed via:
A c = A 1 a ¯ R m × k .
To handle potential noise and extreme collinearity within high-dimensional latent spaces commonly found in complex architectures, a rank-r SVD basis, denoted as V r R k × r , is rigorously extracted [35]. This yields a computationally compressed latent space Z :
Z = A c V r R m × r .
The mapping to a human-understandable target semantics matrix B R m × , where represents the count of conceptually interpretable variables, is established via a ridge-regression weight matrix W R r × , formalized as the following minimization problem:
W * = arg min W R r × Z W B F 2 + α W F 2 .
The choice of a linear ridge operator is an intentional methodological decision. We deliberately employ a linear transition matrix because it is fundamentally interpretable in itself; human auditors can directly inspect the exact weights of each feature to mathematically track structural contributions. Furthermore, linearity guarantees topological stability, preventing the unpredictable boundary distortions and chaotic gradients typically associated with deep non-linear alternatives. With an estimated analytical intercept b 0 R , the primary global transition operator T and the reconstructed continuous attributes B ^ are successfully recovered:
T = V r W * R k × , B ^ = ( A 1 a ¯ ) T + 1 b 0 .
This linear transformation enforces the model’s internal logic into a semantically grounded coordinate system. Global semantic reconstruction fidelity is strictly evaluated using the Mean Absolute Error (MAE):
MAE Q = 1 | Q | i Q j = 1 | b ^ i j b i j | .

3.3. Phase 2.1: Semantic Attribute Selection and Stable Discretization (WEDD)

Once the continuous semantic attributes B ^ are reconstructed, the next operation isolates a highly relevant subset of q predictive attributes (empirically set to q = 15 for rule induction) to prevent computational combinatorial explosion. Let e j signify the normalized validation MAE for attribute j, and let η j = T : j 2 represent the mathematical transition salience. The formal selection score is defined as:
score j = λ s η ˜ j + ( 1 λ s ) ( 1 e ˜ j ) .
where ( · ) ˜ explicitly denotes Min-Max normalization mapping values robustly into the [ 0 , 1 ] interval across all available target attributes.
The parameters λ s and λ H serve as dedicated auditor’s control knobs. Specifically, λ s [ 0 , 1 ] allows the user to mathematically prioritize either the strength of the neural-to-semantic signal (purity) or the precision of the reconstruction error. A comprehensive sensitivity analysis justifying the default configuration of the auditor’s control knobs ( λ s = 0.5 , λ H = 0.65 ) is provided in Appendix A, demonstrating their direct impact on rulebook compactness and coverage. The selected attributes are subsequently transformed into discrete symbolic intervals using the WEDD mechanism.
For a candidate boundary threshold θ splitting an attribute x j into a left computational subset L θ and a corresponding right subset R θ , the standard conditional decision entropy H j ( θ ) is calculated using Equation (9) and Equation (10):
H j ( θ ) = | L θ | n H ( d L θ ) + | R θ | n H ( d R θ ) ,
H ( d G ) = y = 1 N c p ( y G ) log N c p ( y G ) ,
where N c is the number of distinct decision classes, ensuring that H ( d | G ) [ 0 , 1 ] perfectly across all evaluated datasets, thereby stabilizing the final WEDD objective across vastly different problem scales.
To prevent jitter in the resulting logic, WEDD introduces a local probability density proxy p j ( θ ) , computed via a Gaussian kernel function K ( u ) with a structural bandwidth h j :
p j ( θ ) = 1 n h j i = 1 n K θ x i j h j .
The final discretization threshold is determined by minimizing the unified WEDD multiobjective function:
J j ( θ ) = λ H H ˜ j ( θ ) + ( 1 λ H ) p ˜ j ( θ ) .
Explicitly, both H ˜ j ( θ ) and p ˜ j ( θ ) are precisely scaled utilizing standard Min-Max normalization into the [ 0 , 1 ] interval across all generated candidate thresholds, guaranteeing the multiobjective weights behave uniformly and symmetrically.
We do not simply minimize entropy as standard decision trees do. The parameter λ H [ 0 , 1 ] is another critical auditor’s knob allowing the user to seamlessly choose between rule informational purity and topological noise robustness. By rigorously accounting for data density, the threshold avoids cutting through dense data clusters. Instead, boundaries are anchored in sparser topological valleys, rendering the generated logic highly resistant to underlying representational noise. The complete procedural implementation of these specific early phases is outlined in Algorithm 1.
Algorithm 1: Neural-to-Semantic Transition and WEDD Discretization (Phases 1 and 2.1)
1:
Input: Universe of uninterpretable representations A , aligned continuous target attributes B , decision labels d .
2:
Output: Discretized symbolic states S .
3:
Initialize the global decision-making system S via Equation (2).
4:
Compute the centered representation matrix A c utilizing Equation (3).
5:
Extract the rank-reduced compressed latent space Z via Equation (4).
6:
Optimize the ridge-regression weight matrix W * applying Equation (5).
7:
Reconstruct continuous semantic attributes B ^ and establish the global transition operator T via Equation (6).
8:
Evaluate the semantic reconstruction fidelity by computing the MAE via Equation (7).
9:
Compute the composite attribute selection score score j using Equation (8) and isolate q highly relevant concepts.
10:
for each selected attribute j { 1 , , q }  do
11:
    Compute the conditional decision entropy H j ( θ ) for candidate thresholds utilizing Equation (9) and Equation (10).
12:
    Estimate the local probability density proxy p j ( θ ) applying Equation (11).
13:
    Determine the optimal discretization threshold by minimizing the unified WEDD objective J j ( θ ) via Equation (12).
14:
end for
15:
Map the continuous attributes into the discrete symbolic states S .
16:
Return S

3.4. Phase 2.2: Rough-Set Granulation and Knowledge Discovery

Following stable discretization, the framework constructs a symbolic logic layer through mathematically formal knowledge granulation. Formally, the support of a specific rule r ( A B ) is defined as the absolute number of instances satisfying both the antecedent and the consequent simultaneously: Supp ( r ) = | A B | . The confidence is subsequently formulated as the conditional probability of the consequent given the antecedent: Conf ( r ) = | A B | | A | . The core of this robust process is the establishment of a rigorous indiscernibility relation. Two distinct objects x i and x j are rendered indiscernible with respect to a subset of optimal attributes P C if they logically share perfectly identical discrete symbolic signatures. Formulated in Equation (13), this relation partitions the universe of objects into tightly bound structural information granules, denoted as [ x ] P :
I N D ( P ) = { ( x i , x j ) U × U c k P , f ( x i , c k ) = f ( x j , c k ) } ,
where f ( x i , c k ) = s i k explicitly utilizes the information function f from the defined decision system to represent the discrete symbolic state mapping of object x i evaluated across the specific attribute c k .
This precise granulation allows the framework to explicitly distinguish between deterministic logical certainties and structural ambiguities. For a specific analytical decision class Y, we systematically define the structural Lower Approximation P ̲ Y :
P ̲ Y = { x U [ x ] P Y } .
The Lower Approximation encapsulates all information granules that completely and unambiguously belong to class Y, forming the irrefutable foundation for generating perfectly certain rules. Conversely, granules that simultaneously intersect multiple diverse classes form the Boundary Region. This Boundary Region mathematically identifies the exact conflict zones where the compressed latent neural features become inherently indiscernible and fundamentally unreliable.
To ensure that the resulting explanations are conceptually compact, we sequentially apply a mathematically rigorous minimal reduct search. A reduct R * is the smallest possible subset of attributes that preserves decision-making consistency. The generalized minimal reduct R ρ * optimization is formally defined as:
R ρ * = arg min R C ρ | R | s . t . Conf ( R d ρ ) τ , Supp ( R d ρ ) s min ,
where ρ fundamentally denotes a specific target decision class.
To efficiently navigate this logic, the framework formulates a global discernibility matrix. Each interactive element c i j securely identifies the exact set of differentiating semantic attributes between two differently classified objects:
c i j = { c k C s i k s j k } , for y i y j .

3.5. Phase 2.3: Conflict-Aware Inference and Evaluation

The final phase of the SEMTRA framework governs the application of the induced rulebook to unobserved data. During real-time analytical inference, new object representations flow through the identical transition and discretization sequence established in Phases 1 and 2.1, instantly converting raw neural features into active symbolic signatures. This ensures that the explanation logic remains strictly faithful to the original model’s “view” of the data.

3.5.1. Conflict Resolution and Soft Matching

When querying the generalized rulebook, two types of logical edge cases may emerge: boundary conflicts and structural gaps. Boundary conflicts occur when multiple activated rules suggest different outcome classes for the same symbolic signature. These are systematically resolved using a carefully constructed integral support score S ( y k ) , as formulated in Equation (17):
S ( y k ) = r R active , d r = y k Conf ( r ) · Supp ( r ) · w r ,
where w r = 1 | R r | j R r p j ( θ j ) 1 functions as a targeted density-based weight strictly inherited from the WEDD discretization phase, explicitly taking the inverse of the average probability density evaluated precisely at the thresholds utilized in rule r, effectively prioritizing rules fundamentally anchored in stable, structurally optimal attribute regions.
In scenarios where no exact production rules match the presented inference data (structural gaps), the framework employs soft analytical matching. This fallback mechanism evaluates partial logical alignments using the fundamental symbolic Hamming distance d H :
d H ( s i , t y ) = 1 | R y | j R y I [ s i j t y j ] ,
where s i is the signature of the test instance and t y is the formal rule antecedent.
This soft matching natively utilizes a masked Hamming distance evaluated precisely only over the highly specific antecedent conditions | R y | encoded within rule t y . If the computed distance formally exceeds a predefined tolerance threshold ( τ H = 0.25 ), the overall system correctly triggers a structural abstention, firmly identifying the evaluated case as currently unverifiable within the generated rulebook.

3.5.2. Fidelity and Coverage Metrics

To quantify the quality of the extracted explanation layer, SEMTRA distinguishes between two levels of fidelity. First, Covered Fidelity evaluates the accuracy of the rulebook exclusively on instances where a definitive rule was matched:
Fidelity covered = 1 | C Q | i C Q I [ r ( u i ) = g ( u i ) ] ,
where g ( u i ) is the prediction of the base black-box model and C Q is the deterministically covered subset.
Second, All-Object Fidelity logically treats required abstentions as explicit structural failures, providing a conservative measure of the framework’s total explanatory power:
Fidelity all = 1 | Q | i Q I [ r ( u i ) = g ( u i ) ] .
This dual-metric approach formalized by Equations (19) and (20) allows auditors to assess the important tradeoff between explanation precision and rulebook coverage.
Finally, the entire sequential progression, from granulation to conclusive inference, is synthesized in Algorithm 2.
Algorithm 2: Rough-Set Granulation, Reduct Solver, and Inference (Phases 2.3–2.5)
1:
Input: Discretized symbolic states S , decision vector d , validation instances.
2:
Output: Conflict-aware production rulebook R and inference predictions.
3:
Establish the indiscernibility equivalence relations I N D ( P ) across all discrete states utilizing Equation (13).
4:
Aggregate objects into information granules and isolate the deterministic boundary using the lower approximation P ̲ Y via Equation (14).
5:
Formulate the comprehensive discernibility matrix M, populating individual elements c i j using Equation (16).
6:
Execute the weighted heuristic search to extract the minimal optimal reducts R ρ * applying Equation (15).
7:
Synthesize the discrete granules and reducts into the formal logical rule format r via Equation (1).
8:
Compile the verified rules into the global conflict-aware rulebook R .
9:
for each unseen test instance do
10:
    Attempt to match the instance against the rulebook antecedents.
11:
    if multiple conflicting rules are activated then
12:
        Resolve the boundary conflict by maximizing the integral support score S ( y k ) utilizing Equation (17).
13:
    else if no exact rule conditions are satisfied then
14:
        Execute soft matching by minimizing the symbolic Hamming distance d H via Equation (18).
15:
    end if
16:
end for
17:
Assess the overarching algorithmic explanation quality by calculating covered fidelity via Equation (19) and all-object fidelity via Equation (20).
18:
Return R

3.6. Evaluation Metrics for Auditability

To evaluate SEMTRA as an XAI proof-of-concept, we transcend standard predictive accuracy with a multi-dimensional suite of metrics focused on auditability, quantifying the trade-off between extracted logic complexity and faithfulness to the deep model. Rulebook Coverage (Cov) represents explanation “breadth,” measuring the percentage of tested instances Q receiving a deterministic symbolic explanation:
Cov = | C Q | | Q | .
High coverage denotes a comprehensive rulebook capturing the majority of the model’s decision space. Logical Fidelity ( F cov ) captures explanation “truthfulness,” measuring how often the rulebook’s prediction matches the base model’s output on the covered subset, ensuring auditors inspect actual neural network logic over a disconnected surrogate. The Structural Abstention Rate (Abs) identifies regions where the neural representation is too ambiguous or conflicted for rules, revealing where the model is “inexplicable”:
Abs = 1 Cov .
Rulebook Compactness evaluates the total number of rules and the average number of antecedents per rule; following the principle of cognitive load, a smaller, shallower rulebook is inherently more auditable by human experts. Finally, the Conflict Rate (CR) tracks boundary conflicts during inference, diagnosing inherent “indiscernibility” in underlying latent representations. Calculated from instances X conflict mapping directly into conflict zones:
CR = | X conflict | | Q | .
This comprehensive suite of metrics demonstrates that SEMTRA provides a transparent and measurable bridge between high-stakes neural decisions and auditable symbolic knowledge.

3.7. Experimental Protocol and Proof-of-Concept Design

To validate the SEMTRA framework as a reliable instrument for post-hoc deep learning auditing, a multi-stage experimental protocol evaluates how it bridges continuous latent representation spaces and discrete logic, manages structural uncertainty, and yields highly auditable verification artifacts across real-world semantic reconstruction, zero-shot logical transfer, algorithm stability, and synthetic baseline recovery.

3.7.1. Datasets, Feature Extraction, and Base Modeling

Primary real-world evaluations utilize the AwA2 [21] benchmark. Eliminating image-level augmentations, the framework extracts a 2048-dimensional global feature matrix directly from an ImageNet Large Scale Visual Recognition Challenge (ILSVRC)-pretrained Residual Network (ResNet)-101 representation layer. A base ridge classifier is trained on variance-screened representations via an auxiliary 64-dimensional compression step (Principal Component Analysis reduced to 64 components retaining 95% variance) to establish robust baseline prediction metrics as the explicit predictive target for explanations. Using a fixed random seed (42), the continuous semantic bridge is optimized via grid search across ridge penalty parameters ( α { 0.01 , 0.1 , 1 , 10 , 100 } ). It is noted that the optimal α hit the lower bound of the grid search space (0.01); thus, exploring smaller regularization values could potentially yield marginal improvements. Granulation mechanisms apply specific constraint thresholds like WEDD bounds (maximum depth of 2, minimum bin size of 30) and greedy reduct parameters ( τ = 0.84 , minimum support of 18). Exhaustive hyperparameter configurations and baseline predictor performance metrics are documented in Appendix A. The significant disparity between the Top-1 Accuracy (70.93%) and the Macro-F1 score (53.99%) indicates that the base Ridge classifier experiences partial majority-class collapse, heavily favoring well-represented classes over minority taxonomies due to class imbalance.

3.7.2. Empirical Evaluation Splits

To isolate memorization capacity from semantic generalization, two testing splits are used. Protocol A (Closed-world Fidelity) assesses neural-to-semantic bridge reconstruction fidelity on a consistent 50-class stratification using MAE, Root Mean Square Error (RMSE), and semantic correlation, while quantifying symbolic rulebook properties by tracking test coverage, non-abstained accuracy, conflict rates, and structural abstention rates. Protocol B (Zero-shot Transferability) uses the official xlsa17 split as a semantic validation proxy. For Protocol B, to strictly prevent data leakage, a disjoint base model and transition matrix were trained exclusively on the 40 seen classes. Applying a semantic transition matrix trained on 40 seen classes to infer logic on 10 strictly unseen classes proves globally extracted continuous prototypes capture generalized visual-semantic structures rather than dataset-specific artifacts.

3.7.3. Methodological Ablations and XAI Baselines

To contextualize scientific tradeoffs in global explainability, Transition Operator Ablation benchmarks the global linear ridge-regression operator against a Radial Basis Function (RBF) kernel ridge and a two-layer Multi-Layer Perceptron (MLP) regressor to analyze tradeoffs between continuous reconstruction correlation and symbolic rule-transfer coverage. Discretization Benchmarking compares the proposed WEDD strategy against classical discretization algorithms (Minimum Description Length Principle (MDLP)-like entropy approaches, equal-frequency, and equal-width discretizers), logging boundary stability diagnostics, rule lengths, and structural conflict rates in Appendix B. Explainability Paradigms contrast the rough-set rulebook with heuristic symbolic models (Classification and Regression Trees (CART) and separate-and-conquer rule learners) to assess structural uncertainty management. Local post-hoc attribution surrogates (LIME and SHAP) are evaluated via top-k feature agreement, direction agreement, and rank correlation, alongside interpretable-by-design architectures like frozen-feature CBMs and Testing with Concept Activation Vectors (TCAV).

3.7.4. Algorithmic Recovery on Synthetic Data

To isolate core mathematical machinery from the inherent indiscernibility of real-world latent manifolds, a fully controlled synthetic benchmark simulates a 10-dimensional continuous feature space governed by a predefined, ground-truth logical rule dictionary. Semantic noise ( σ [ 0 , 0.20 ] ) injected across random computational seeds maps the structural limits of WEDD and rough-set induction mechanisms, measuring exact threshold recovery error, rule Jaccard similarity, macro-F1 scores, and operational coverage.

3.7.5. Diagnostic Tracking and Inference Stability

A rigorous suite of visual and statistical diagnostics clarifies the proposed algorithms, assessing the balance between logical determinism and coverage via fidelity-coverage tradeoff projections. At the attribute level, target transition salience is mapped against test reconstruction errors to justify semantic selection, differentiating robust physical descriptors from high-variance semantic concepts. Real-time analytical inference is audited by generating precise rule traces mapping unseen objects to active logical antecedents, confidence parameters, and source decisions. The structural robustness of inference pathways is systematically evaluated against semantic-representation perturbations; resultant decision consistency, coverage deviation, and rule matching stability metrics are cataloged in Appendix C.

4. Results

This section presents the empirical validation of the SEMTRA framework according to the Pillars of Auditability defined in the experimental protocol. We assess the framework’s ability to reconstruct semantics, generate auditable rules, and handle zero-shot transfer as a proxy for conceptual integrity.

4.1. Semantic Reconstruction Performance (Protocol A)

The first stage of the experimental validation verifies the integrity of the Neural-to-Semantic Bridge. This is a prerequisite for a reliable audit: the extracted rules can only be trusted if the underlying transition matrix successfully recovers the intended semantic concepts from the latent neural representations.
The performance of the linear transition operator under Protocol A is summarized in Table 1. On the test set, the framework achieved an MAE of 0.1295 ± 0.0016 and a mean semantic correlation of 0.6828. These results indicate that the global transition matrix effectively preserves the topological structure of the latent space while aligning it with human-understandable descriptors.
To justify the choice of a linear operator over more complex alternatives, an ablation study was performed (Table 2). While non-linear regressors (e.g., two-layer MLP) marginally improve reconstruction in some metrics, they introduce significant black-box opacity into the transition itself. The linear ridge transition maintains a high correlation (0.7759) and significantly higher rule coverage ( 0.8718 ± 0.0075 ) compared to non-linear variants, establishing it as the most defensible choice for an auditable framework. The near-zero rule accuracy (0.0027) for the RBF kernel ridge indicates that severe overfitting in the high-dimensional latent space collapsed the global semantic mapping entirely, underscoring the fragility of unregularized non-linear projections.
Detailed attribute-wise diagnostics, visualized in Figure 2, reveal that visually distinct attributes such as paws, vegetation, and water achieve the highest transition salience and lowest reconstruction errors. Conversely, more abstract descriptors (e.g., inactive) exhibit higher variance. This variance is explicitly managed in the next phase by the WEDD algorithm, which prioritizes stable, low-error attributes for rule induction.
As shown in Table 3, the composite selection score effectively isolates the top q attributes that offer the best balance between neural signal strength and reconstruction precision. For brevity, the top 12 selected attributes are shown in Table 3.

4.2. Rulebook Auditability and Fidelity

The core objective of the SEMTRA framework is to transform the continuous semantic bridge into a globally consistent, symbolic rulebook that a human auditor can inspect and verify. The properties of the generated rulebook under Protocol A are summarized in Table 4, providing a comprehensive view of the framework’s explanatory power.

4.2.1. The Fidelity-Coverage Trade-off

The extracted rulebook achieved a high Cov of 86.40%, significantly outperforming the separate-and-conquer rule learner, which suffered from catastrophic abstention (covering only 26.20% of testing instances). Its Covered Fidelity ( F cov ) was 0.3829, meaning that the audited rules reproduced the base model’s decision on 38.29% of covered cases while still retaining explicit conflict and abstention accounting.
As visualized in Figure 3, SEMTRA occupies a unique position in the symbolic model space: it maintains high Cov while explicitly accounting for structural uncertainty. The reduction in non-abstained accuracy compared to the base model is a transparent audit tax—the cost of enforcing strict discretization and logical consistency on a high-dimensional non-linear system. For an auditor, the 40.73% covered-case accuracy and the 38.29% Covered Fidelity jointly quantify the portion of the model’s logic that is currently verifiable and reproducible through simple semantic predicates.

4.2.2. Conflict Awareness and Representative Logic

A key advantage of the rough-set approach is the explicit management of boundary regions. The framework identified a CR of 13.54%, where the neural representations of different classes became semantically indiscernible. Unlike the CART baseline, which forces a potentially misleading decision in these regions, SEMTRA triggers a structural abstention, signaling to the auditor that the model’s logic is currently ambiguous for these instances.
The look and feel of the resulting audit is demonstrated in Table 5, which presents representative induced rules. For example, the rule for the class tiger (R0001) relies on a stable combination of paws, hunter, and yellow attributes with a high confidence of 0.896.
The diagnostic distributions of rule support and confidence, shown in Figure 4, along with WEDD threshold and coverage tradeoff examples depicted, further confirm that the majority of rules are anchored in statistically significant clusters of the semantic space.
By providing these metrics and concrete examples, the framework fulfills the requirement for algorithmic transparency, offering a clear evidence base that moves beyond superficial heatmaps to verifiable logical statements.

4.3. Zero-Shot Transfer as Semantic Validation (Protocol B)

To demonstrate that the transition matrix T captures universal visual-semantic concepts rather than merely overfitting to the training labels, we evaluate the framework under the official zero-shot split (Protocol B). In this scenario, the framework must predict classes it has never encountered during training, relying solely on the continuous semantic prototypes transferred from the latent space.

4.3.1. Contextual Performance and Semantic Integrity

The results of the semantic transfer are compared against established Zero-Shot Learning (ZSL) baselines in Table 6. The SEMTRA continuous prototype variant achieved an unseen class-averaged accuracy of 48.43%, outperforming foundational DAP and IAP methods (DAP at 46.10% and IAP at 35.90%).
While specialized generative frameworks like GFZSL achieve higher accuracy (63.80%), they do so by utilizing complex, non-linear embedding spaces that are themselves semantically uninterpretable. SEMTRA’s ability to maintain a competitive 48.43% accuracy using a linear, auditable operator serves as empirical proof of the Neural-to-Semantic Bridge’s integrity. Furthermore, the symbolic template variant reached 39.93% accuracy, quantifying the explicit information tax incurred when forcing continuous predictions into rigid, auditable logical states.

4.3.2. Diagnostic Per-Class Analysis

A deeper audit of the transfer performance, detailed in Table 7, reveals the framework’s taxonomic strengths and blind spots. The transition matrix successfully recovered distinct visual-semantic structures for classes like blue whale (91.95%) and bobcat (93.97%), where the latent neural features are highly aligned with the provided semantic descriptors.
Conversely, classes such as bat (0.00%) exhibit a complete failure of the semantic mapping, while sheep shows a substantial symbolic-template collapse (14.65%) despite a stronger continuous-prototype score. For an auditor, the 0.00% bat accuracy is a vital diagnostic signal: it indicates a semantic rupture where the underlying ResNet features do not contain the specific information required by the attribute dictionary. This specific semantic rupture could be effectively mitigated in future iterations by expanding the continuous attribute dictionary C to include more granular, specialized phenotypic descriptors, or by utilizing Large Language Models (LLMs) to dynamically synthesize high-resolution conceptual variables that better align with the latent representation of these outlier classes. This level of granular, class-wise transparency is illustrated in Figure 5, showing how an instance is correctly matched to its symbolic template, while other classes reveal the limits of the model’s transferable knowledge.
By successfully generalizing to unseen taxonomies, Protocol B validates the transition matrix not just as a mathematical operator, but as a faithful semantic translator that preserves conceptual meaning across domain boundaries.

4.4. Comparison with Local Surrogates and Concept Baselines

To contextualize the scientific value of the SEMTRA rulebook, we compare its performance against two dominant paradigms in XAI: local post-hoc attributions (LIME/SHAP) and interpretable-by-design concept models (CBMs).

4.4.1. The Fragmentation Gap: SEMTRA vs. Local Surrogates

Local methods like LIME and SHAP provide instance-specific sensitivity rankings, typically visualized as heatmaps. However, as shown in Table 8, there is only a moderate agreement (0.2610–0.3213) between these local attributions and the global antecedents extracted by SEMTRA.
This divergence highlights the fragmentation gap: local methods are highly sensitive to instance-specific noise and pixel-level textures, whereas SEMTRA identifies the stable, global logical structures used by the model across the entire dataset. For an auditor, SEMTRA’s rulebook provides a consistent evidence base that local surrogates cannot replicate.

4.4.2. Post-hoc Auditability vs. Ante-hoc Design

We further compared SEMTRA against CBMs, which are specifically trained to predict concepts before labels. As summarized in Table 9, SEMTRA’s post-hoc reconstruction error (MAE 0.1076) is virtually identical to that of a dedicated frozen-feature CBM.
While the CBM achieves higher predictive accuracy because it is structurally optimized for a specific concept vocabulary, SEMTRA provides a comparable level of semantic reconstruction integrity without requiring any retraining of the underlying black-box. This proves that SEMTRA can unlock the latent logic of any pre-trained state-of-the-art model, offering a level of transparency previously reserved for specialized, interpretable-by-design architectures.
Unlike traditional symbolic learners that provide a single path to a decision, SEMTRA’s use of rough-set granulation allows for a diagnostic audit. While a CART decision tree forces a prediction (leading to zero abstentions but potentially high error in boundary regions), SEMTRA identifies where the model’s logic becomes indiscernible. This makes SEMTRA not just a classifier, but a diagnostic tool for identifying the limits of a neural network’s conceptual understanding.

4.5. Algorithmic Recovery (Synthetic Benchmark)

To isolate the framework’s internal logic from the inherent noise and ambiguity of real-world neural representations, we conducted a comprehensive evaluation using a controlled synthetic benchmark. This stage serves as the algorithmic gold standard, proving that the combination of WEDD discretization and rough-set induction can accurately recover a known ground-truth logic when the semantic signal is clearly defined.

4.5.1. Exactness of Logic Extraction

The synthetic environment consisted of a 10-dimensional feature space with a rule dictionary of known logical structures (Table 10).
As summarized in Table 11, at zero noise ( σ = 0.000 ), the framework achieved a macro-F1 score of 0.879 and a rule Jaccard similarity of 0.700, with a near-perfect operational coverage of 97.10%. The minor baseline threshold error (0.014) observed at zero noise stems from the Gaussian Kernel Density Estimation (KDE) bandwidth smoothing used in the WEDD algorithm, rather than an extraction failure.

4.5.2. Robustness Under Controlled Noise

The stability of the framework was tested by injecting increasing levels of semantic noise ( σ [ 0 , 0.20 ] ). As visualized in Figure 6, the operational coverage remained decisively stable even as noise increased, confirming that the WEDD algorithm successfully anchors thresholds in stable regions.
While the exact threshold recovery error inevitably fluctuates (Figure 7), the framework maintained a macro-F1 score above 0.838 throughout the tested noise range.
The synthetic results demonstrate that the core mathematical machinery of SEMTRA, specifically the ability to translate continuous variables into discrete granules and extract minimal reducts, is highly precise. For an auditor, this is a vital finding: it proves that the lower fidelity observed in the real-world AwA2 experiments (approximately 40%) is a direct consequence of the indiscernibility inherent in neural features, rather than a failure of the rule-induction process itself. Consequently, SEMTRA can be viewed as a reliable diagnostic engine that accurately reports the latent truth of a model, even when that truth is fragmented or noisy.
Ultimately, SEMTRA establishes a transformative, reproducible foundation for deep learning verification, fundamentally shifting the explainability paradigm toward globally consistent, mathematically verifiable symbolic reasoning.

5. Discussion

5.1. The Scientific Value of Global Auditability

The primary objective of this study was to establish a highly reproducible, post-hoc pathway that systematically maps from semantically uninterpretable neural representations to universally auditable symbolic knowledge. The empirical results achieved by the SEMTRA framework decisively confirm that this complex semantic gap can be bridged through a mathematically sound combination of a global linear transition matrix and rough-set granulation. This systematically addresses the critical need for verifiable symbolic knowledge identified in recent XAI evolution manifestos [1].
The current landscape of post-hoc explainability is overwhelmingly dominated by instance-specific local methods, such as LIME and SHAP, which generate fragmented attribution heatmaps rather than unified policies [8]. As demonstrated in our comparative results (Section 4.4), there is only a moderate agreement between these local feature attributions and the global antecedents strictly extracted by our proposed framework (Table 8). This divergence highlights the fragmentation gap: local surrogate models remain highly sensitive to instance-specific noise and fail to produce a holistic logic policy required for rigorous auditing.
In contrast, SEMTRA leverages a global transition matrix to identify and isolate stable, dataset-wide semantic signals. By shifting the output format to a mathematically verified rulebook (Equation (1)), the framework provides human auditors with a reusable logical artifact. For high-stakes diagnostic domains, this transition from localized visual approximation to global logical verification is an absolute operational necessity.

5.2. Interpreting the Audit Tax: Accuracy vs. Verifiability

A critical finding of this research is the quantitative reduction in operational accuracy from the base neural predictor’s 71.16% (Table A2) to the rulebook’s 40.73% on non-abstained cases (Table 4). We formally define this specific performance reduction as the audit tax—the inherent cost of enforcing strict mathematical discretization (Equation (12)) and logical consistency in a highly dimensional, non-linear latent system. While traditional machine learning paradigms might view this accuracy drop as a critical methodological failure, from an advanced auditing perspective, it represents a profound diagnostic triumph.
The SVD reduction intentionally screens high-frequency representational noise, simultaneously discarding some feature information. Increasing the SVD rank could decrease the audit tax, but at the direct cost of inflating rulebook complexity and combinatorial search times. The base model’s 71.16% accuracy entirely represents black-box performance that remains fundamentally unverified, inherently fragile, and prone to hidden dataset biases.
Conversely, SEMTRA’s 40.73% accuracy precisely quantifies the exact portion of the neural network’s internal logic that is currently verifiable and reproducible through simple semantic predicates. As shown in Figure 3, our framework deliberately operates in a high-coverage regime while meticulously accounting for structural uncertainty. Furthermore, our synthetic benchmark evaluations (Section 4.5) conclusively prove that the underlying rough-set algorithms are exceptionally precise under controlled conditions (Table 11). Consequently, the audit tax observed in the AwA2 dataset is a direct reflection of the underlying semantic indiscernibility embedded in the Residual Network (ResNet) features themselves.

5.3. The Power of Honest Silence and Indiscernibility

The integration of formal rough-set theory [6,7] into the SEMTRA pipeline introduces a critical dimension of epistemic honesty absent in traditional symbolic rule learners like Classification and Regression Trees (CART) [15]. Standard heuristic models forcefully dictate a deterministic classification even when the underlying data manifold is profoundly ambiguous, frequently leading to dangerous logical hallucinations. In direct opposition, our framework mathematically identifies a structural CR of 13.54% (Equation (23)) and effectively triggers an intentional abstention rate of 13.60% (Equation (22)).
By computing precise lower approximations (Equation (14)), the framework deliberately utilizes structural abstention to transparently signal whenever the underlying neural representation becomes semantically compromised or entirely indiscernible. This honest silence exposes a fundamental property of modern deep feature extractors: they often conflate visually similar but taxonomically distinct classes into identical, inseparable latent regions. When the base predictor cannot semantically distinguish between two distinct concepts, the resulting rough-set boundary region acts as an essential diagnostic warning.
Ultimately, this structural advantage transforms our rule-induction pipeline into a highly robust diagnostic engine. By preventing automated logical hallucinations, it ensures that every retained explanation is universally anchored in a non-conflicting, stable logical state, easily outperforming baseline separate-and-conquer logic algorithms [16].

5.4. Linear Transition as a Defensive Choice

The selection of a purely linear ridge-regression operator (Equation (5)) to map complex, high-dimensional latent manifolds represents a highly deliberate, defensive strategy optimized explicitly for structural stability and extreme algebraic transparency. Our ablation studies (Table 2) empirically justify this specific choice; while complex non-linear regressors, such as MLPs, achieve marginally lower continuous reconstruction errors, they simultaneously introduce severe black-box opacity into the semantic transition bridge itself.
This opacity severely degrades downstream symbolic logic, significantly reducing rule coverage and increasing structural conflicts. The immense utility of structured transition matrices has been repeatedly validated in complex visual analytics [3] and healthcare informatics [4]. Linearity fundamentally serves as a critical safeguard against adversarial fragility, strictly guaranteeing topological stability.
Consequently, minor, imperceptible perturbations in the source neural representation (Table A6) only ever result in predictable, strictly bounded changes in the corresponding continuous semantic attributes. By actively choosing a highly auditable linear operator over a fragile non-linear alternative, we guarantee that the initial Neural-to-Semantic transition remains an unimpeachable foundation for the entire downstream auditing sequence.

5.5. Limitations and Strategic Directions

While the SEMTRA framework successfully establishes a robust proof-of-concept for XAI, several strategic limitations uniquely define the essential scope for future scientific inquiry. First, the framework exhibits a notable phenotypic gap driven by its reliance on predefined class-level semantic annotations; thus, it struggles to account for natural intra-class visual variations, such as irregular lighting or structural deformation. Second, because the global transition mapping, WEDD, and rough-set granulation are executed in independent, sequential phases, minor reconstruction errors can propagate and suppress final rulebook coverage. Third, the minimal reduct search introduces exponential computational complexity, posing severe real-time execution challenges for high-resolution attribute dictionaries in advanced diagnostic settings. Addressing these critical challenges requires exploring dynamic, instance-level semantic supervision by leveraging modern LLMs to automatically synthesize object-specific conceptual descriptions. Additionally, developing end-to-end differentiable rough-set architectures will be paramount. By allowing the semantic transition matrix to be continuously fine-tuned directly against final logical consistency metrics, future research can effectively close the performance gap between raw deep learning predictive power and universally transparent symbolic reasoning.

6. Conclusions

In this study, we introduced the SEMTRA framework, a rigorous methodology for the post-hoc auditing of deep learning architectures through the global extraction of symbolic rough-set production rules. By fundamentally addressing the interpretability gap inherent in high-dimensional latent spaces, our approach successfully bridges uninterpretable neural representations with continuous semantic attributes via a robust transition matrix, subsequently applying density-aware discretization and formal rough-set granulation to induce transparent logic. The empirical evaluation demonstrated that the semantic transition successfully achieved a mean absolute error of 0.1295 on the AwA2 benchmark, enabling the extracted conflict-aware rulebook to cover 86.40% of testing instances with a non-abstained accuracy of 40.73%. Furthermore, SEMTRA accomplished a zero-shot logical transfer accuracy of 48.43%, validating its ability to capture universal visual-semantic concepts. At the same time, synthetic stress tests verified exact algorithmic recovery with a macro-F1 score of 0.8668. Despite these robust indicators, the approach faces distinct limitations, specifically a phenotypic gap caused by rigid class-level semantic annotations and the non-differentiable, sequential nature of the rule induction pipeline, which risks early error propagation.
Future research must strategically address these challenges by integrating dynamic, instance-level semantic supervision via modern LLMs and developing end-to-end differentiable rough-set rule induction mechanisms.

Author Contributions

Conceptualization, P.R., O.B. and I.K.; methodology, P.R.; software, P.R.; validation, P.R. and O.B.; formal analysis, P.R., O.B. and I.K.; investigation, P.R.; resources, O.B. and I.K.; data curation, P.R.; writing—original draft preparation, P.R.; writing—review and editing, O.B. and I.K.; visualization, P.R.; supervision, I.K.; project administration, O.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable. This study used publicly available datasets and did not involve humans or animals.

Data Availability Statement

AwA2 is available from the official Animals with Attributes 2 dataset page [21]. The main GitHub repository (https://github.com/radiukpavlo/transition-matrix-dss) includes scripts, generated tables, figures, audit files, and experiment summaries sufficient to reproduce the reported computational artifacts when the AwA2 and xlsa17 archives are available locally.

Conflicts of Interest

The authors declare no conflicts of interest.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
AI Artificial Intelligence
AUROC Area Under the Receiver Operating Characteristic
AwA2 Animals with Attributes 2
CART Classification and Regression Trees
CBM Concept Bottleneck Model
Cov Rulebook Coverage
CPU Central Processing Unit
CR Conflict Rate
DAP Direct Attribute Prediction
ECE Expected Calibration Error
GFZSL Generative Framework for Zero-Shot Learning
GPU Graphics Processing Unit
IAP Indirect Attribute Prediction
ILSVRC ImageNet Large Scale Visual Recognition Challenge
LIME Local Interpretable Model-agnostic Explanations
LLM Large Language Model
MAE Mean Absolute Error
MDLP Minimum Description Length Principle
MLP Multi-Layer Perceptron
RBF Radial Basis Function
ResNet Residual Network
RMSE Root Mean Square Error
SEMTRA Global Semantic Transition
SHAP Shapley Additive Explanations
SVD Singular-Value Decomposition
TCAV Testing with Concept Activation Vectors
WEDD Weighted Entropy-Density Discretization
XAI Explainable Artificial Intelligence
ZSL Zero-Shot Learning

Appendix A. Experimental Protocol and Hyperparameters

The feature extractor utilizes the released ResNet-101 representation natively provided with the AwA2 dataset The base predictor is systematically trained on the representation matrix to provide reproducible probabilities, robust class predictions, and explicit fidelity targets. Table A1 outlines the detailed feature and classifier configurations, while Table A2 reports the corresponding validation and test performance metrics, including the Area Under the Receiver Operating Characteristic (AUROC) and Expected Calibration Error (ECE).
Table A1. Feature extractor, base predictor, and revision-experiment hyperparameters.
Table A1. Feature extractor, base predictor, and revision-experiment hyperparameters.
Component Value Implementation detail
Feature extractor ResNet-101 features released with AwA2 ILSVRC-pretrained representation layer; no image-level augmentation in this package
Feature dimension 2048 Global average/penultimate representation coordinates
Auxiliary compression 64 Principal Component Analysis (PCA) reduced to 64 components retaining 95% variance
Base predictor Ridge classifier Trained on variance-screened representation coordinates
Random seed 42 Used for all stochastic revision experiments
Semantic bridge Ridge regression Grid alpha in {0.01, 0.1, 1, 10, 100}
Rule thresholds WEDD alpha=0.65, max_depth=2, min_bin_size=30, min_gain=0.002
Rule induction Greedy reducts tau=0.84, s_min=18 for AwA2 Protocol A
Soft matching threshold τ H = 0.25 Maximum allowed masked Hamming distance
Table A2. Base predictor performance on AwA2 Protocol A.
Table A2. Base predictor performance on AwA2 Protocol A.
Split Top-1 Top-5 Macro-F1 Weighted-F1 AUROC ECE
Validation 0.7093 0.9306 0.5399 0.6639 0.9832 0.6622
Test 0.7116 0.9291 0.5434 0.6681 0.9836 0.6645
It is noted that the optimal α hit the lower bound of the grid search space (0.01); thus, exploring smaller regularization values could potentially yield marginal improvements. Furthermore, the significant disparity between the Top-1 Accuracy (70.93%) and the Macro-F1 score (53.99%) indicates that the base Ridge classifier experiences partial majority-class collapse, heavily favoring well-represented classes over minority taxonomies due to class imbalance.
Table A3 reports the one-factor sensitivity grid for the two auditor-facing control knobs. The results show that increasing λ s changes the selected semantic basis and modifies rulebook compactness, while increasing λ H shifts WEDD toward entropy-dominant thresholds and affects the coverage–fidelity balance.
Table A3. Sensitivity of the auditor control knobs on rulebook compactness, coverage, fidelity, and covered accuracy. The λ s sweep varies semantic-attribute selection while holding λ H = 0.65 ; the λ H sweep varies WEDD thresholding using the released default attribute set corresponding to λ s = 0.50 .
Table A3. Sensitivity of the auditor control knobs on rulebook compactness, coverage, fidelity, and covered accuracy. The λ s sweep varies semantic-attribute selection while holding λ H = 0.65 ; the λ H sweep varies WEDD thresholding using the released default attribute set corresponding to λ s = 0.50 .
Control knob Value Rules Rulebook Coverage Covered Fidelity Covered Accuracy
λ s 0.25 50 0.7687 0.4796 0.4414
λ s 0.50 54 0.8640 0.3829 0.4073
λ s 0.75 57 0.7847 0.3856 0.3935
λ H 0.25 58 0.7136 0.3514 0.3535
λ H 0.65 54 0.8640 0.3829 0.4073
λ H 0.90 53 0.8977 0.4349 0.4205

Appendix B. Full Semantic Attribute Discretization Diagnostics

To further establish reproducibility, Table A4 compares the proposed WEDD mechanism with alternative discretizer approaches. WEDD ensures high boundary stability by incorporating local probability densities alongside class condition metrics.
Table A4. Discretization ablation using the same reconstructed semantics and rule-induction logic.
Table A4. Discretization ablation using the same reconstructed semantics and rule-induction logic.
Method Thresholds Rules Avg. len. Coverage Abstention All acc. Cov. fidelity Conflict
WEDD 54 54 4.0370 0.8640 0.1360 0.3519 0.3829 0.1354
MDLP-like entropy 54 53 4.0566 0.8714 0.1286 0.3928 0.4556 0.1125
Equal frequency 36 52 4.2885 0.7474 0.2526 0.2324 0.3126 0.2506
Equal width 36 61 3.6721 0.5830 0.4170 0.2521 0.4750 0.4170
MDLP-like entropy uses entropy-improvement stopping; equal-frequency and equal-width use two thresholds per selected attribute.

Appendix C. Rule Inference Traces and Structural Perturbation Stability

Detailed rule inference traces and structural stability analyses are securely retained in this appendix as essential programmatic audit artifacts. Table A5 illustrates representative inference traces successfully mapping individual objects to explicit rule antecedents.
Table A5. Representative rule inference traces with object ID, prediction, matched rule evidence, and antecedent states.
Table A5. Representative rule inference traces with object ID, prediction, matched rule evidence, and antecedent states.
Case Object True Pred. Mode Rule Support Conf. Antecedent states
correct exact 5083 collie collie exact R0027 1675 0.208 stripes=s0; hooves=s0; swims=s1; paws=s3; longneck=s0; hunter=s1; ocean=s1; quadrapedal=s3
abstention 24024 ox abstain exact R0029 2820 0.195 stripes=s0; hooves=s2; swims=s1; paws=s0; longneck=s1; hunter=s0; ocean=s1; quadrapedal=s2
fallback error or boundary 16781 hamster mole fallback stripes=s0; hooves=s1; swims=s1; paws=s2; longneck=s0; hunter=s0; ocean=s1; quadrapedal=s2

Appendix C.1. Trace Walkthrough for a Zebra Instance

For test object 36914 (zebra), the 2048-dimensional ResNet representation a 36914 is mapped through the transition matrix T to obtain the reconstructed continuous semantic vector b ^ 36914 . The relevant WEDD-discretized components are b ^ hunter = 0.0000 , b ^ water = 0.0501 , b ^ small = 0.0000 , b ^ jungle = 0.2632 , and b ^ yellow = 0.0879 . Using the learned WEDD thresholds, these values become hunter= s 0 , water= s 0 , small= s 0 , jungle= s 2 , and yellow= s 1 . This state vector exactly satisfies Rule R0004, whose antecedent is hunter= s 0 AND water= s 0 AND small= s 0 AND jungle= s 2 AND yellow= s 1 , yielding a verified zebra prediction with support 434 and confidence 0.866.
Furthermore, Table A6 rigorously reports the operational rule consistency and prediction stability under varying degrees of semantic-representation perturbations. Reconstructed semantic representations were perturbed using additive Gaussian noise:
b ^ i , perturbed = b ^ i + ϵ i , ϵ i j N ( 0 , σ 2 s ^ j 2 ) ,
where s ^ j is the empirical standard deviation of reconstructed semantic attribute j on the test split.
Table A6. Rule consistency under semantic-representation perturbations.
Table A6. Rule consistency under semantic-representation perturbations.
Sigma Rule consistency Decision consistency Coverage Coverage change Abstention Conflict
0.0000 1.0000 1.0000 0.8796 0.0000 0.1204 0.1180
0.0100 0.9745 0.9663 0.8792 -0.0004 0.1208 0.1183
0.0250 0.9326 0.9176 0.8808 0.0012 0.1192 0.1168
0.0500 0.8667 0.8401 0.8816 0.0020 0.1184 0.1160
0.1000 0.7307 0.7112 0.8940 0.0145 0.1060 0.1026
Because WEDD anchors thresholds in low-density valleys rather than dense class-overlap regions, rules exhibit high robustness against micro-perturbations up to σ = 0.05 . The degradation curve is therefore gradual: rule consistency remains 0.8667 and coverage remains essentially unchanged at 0.8816 at σ = 0.05 , while the larger σ = 0.10 setting produces the expected decline in rule and decision consistency.

References

  1. Longo, L.; Brcic, M.; Cabitza, F.; Choi, J.; Confalonieri, R.; Ser, J.D.; Guidotti, R.; Hayashi, Y.; Herrera, F.; Holzinger, A.; et al. Explainable artificial intelligence (XAI) 2.0: A manifesto of open challenges and interdisciplinary research directions. Inf. Fusion 2024, 106, 102301. [Google Scholar] [CrossRef]
  2. Mersha, M.; Lam, K.N.; Wood, J.; AlShami, A.K.; Kalita, J. Explainable artificial intelligence: A survey of needs, techniques, applications, and future direction. Neurocomputing 2024, 599, 128111. [Google Scholar] [CrossRef]
  3. Radiuk, P.; Barmak, O.; Manziuk, E.; Krak, I. Explainable deep learning: A visual analytics approach with transition matrices. Mathematics 2024, 12, 1024. [Google Scholar] [CrossRef]
  4. Barmak, O.; Krak, I.; Yakovlev, S.; Manziuk, E.; Radiuk, P.; Kuznetsov, V. Toward explainable deep learning in healthcare through transition matrix and user-friendly features. Front. Artif. Intell. 2024, 7, 1482141. [Google Scholar] [CrossRef]
  5. Radiuk, P.; Barmak, O.; Bedratyuk, L.; Krak, I. Equivariant transition matrices for explainable deep learning: A Lie group linearization approach. Mach. Learn. Knowl. Extr. 2026, 8, 92. [Google Scholar] [CrossRef]
  6. Pawlak, Z. Rough sets. Int. J. Comput. Inf. Sci. 1982, 11, 341–356. [Google Scholar] [CrossRef]
  7. Pawlak, Z. Rough sets: Theoretical aspects of reasoning about data . In Theory and Decision Library D; Kluwer Academic Publishers: Dordrecht, The Netherlands, 1991; Vol. 9. [Google Scholar] [CrossRef]
  8. Salih, A.M.; Raisi-Estabragh, Z.; Galazzo, I.B.; Radeva, P.; Petersen, S.E.; Lekadir, K.; Menegaz, G. A perspective on explainable artificial intelligence methods: SHAP and LIME. Adv. Intell. Syst. 2025, 7, 2400304. [Google Scholar] [CrossRef]
  9. Bobek, S.; Nalepa, G.J. Local universal rule-based explainer (LUX). SoftwareX 2025, 30, 102102. [Google Scholar] [CrossRef]
  10. Kozielski, M.; Sikora, M.; Wawrowski. Towards consistency of rule-based explainer and black box model: Fusion of rule induction and XAI-based feature importance. Knowl.-Based Syst. 2025, 311, 113092. [Google Scholar] [CrossRef]
  11. Nicolson, A.; Schut, L.; Noble, J.A.; Gal, Y. Explaining explainability: Recommendations for effective use of concept activation vectors. In Trans. Mach. Learn. Res.; 2025. [Google Scholar]
  12. Srivastava, D.; Yan, G.; Weng, T.W. VLG-CBM: Training concept bottleneck models with vision-language guidance. In Proceedings of the Advances in Neural Information Processing Systems, 2024; Vol. 37. [Google Scholar]
  13. Zarlenga, M.E.; Barbiero, P.; Ciravegna, G.; Marra, G.; Giannini, F.; Diligenti, M.; Shams, Z.; Precioso, F.; Melacci, S.; Weller, A.; et al. Concept embedding models: Beyond the accuracy-explainability trade-off. Proc. Adv. Neural Inf. Process. Syst. 2022, Vol. 35, 21400–21413. [Google Scholar]
  14. Yüksekgönül, M.; Wang, M.; Zou, J. Post-hoc concept bottleneck models. In Proceedings of the Proceedings of the 11th International Conference on Learning Representations, 2023. [Google Scholar]
  15. Mienye, I.D.; Jere, N.R. A survey of decision trees: Concepts, algorithms, and applications. IEEE Access 2024, 12, 86716–86727. [Google Scholar] [CrossRef]
  16. Bollaert, H.; Palangetić, M.; Cornelis, C.; Greco, S.; Słowiński, R. FRRI: A novel algorithm for fuzzy-rough rule induction. Inf. Sci. 2025, 686, 121362. [Google Scholar] [CrossRef]
  17. Zhang, Y.; Li, R.; Wu, N.; Li, Q.; Lin, X.; Hu, Y.; Li, T.; Jiang, Y. Dissect black box: Interpreting for rule-based explanations in unsupervised anomaly detection. In Proceedings of the Advances in Neural Information Processing Systems, 2024; Vol. 37. [Google Scholar]
  18. Şenozan, H.; Soylu, B. A flexible non-monotonic discretization method for pre-processing in supervised learning. Pattern Recognit. Lett. 2024, 181, 77–85. [Google Scholar] [CrossRef]
  19. Biroli, G.; Mézard, M. Kernel density estimators in large dimensions. SIAM J. Math. Data Sci. 2026, 8, 46–76. [Google Scholar] [CrossRef]
  20. Xian, Y.; Lampert, C.H.; Schiele, B.; Akata, Z. Zero-shot learning: A comprehensive evaluation of the good, the bad and the ugly. IEEE Trans. Pattern Anal. Mach. Intell. 2019, 41, 2251–2265. [Google Scholar] [CrossRef]
  21. Lampert, C.H.; Pucher, D.; Dostal, J. Animals with attributes 2. Dataset web page. Accessed. 2017. (accessed on 22 May 2026).
  22. Lampert, C.H.; Nickisch, H.; Harmeling, S. Learning to detect unseen object classes by between-class attribute transfer. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009; pp. 951–958. [Google Scholar] [CrossRef]
  23. Lampert, C.H.; Nickisch, H.; Harmeling, S. Attribute-based classification for zero-shot visual object categorization. IEEE Trans. Pattern Anal. Mach. Intell. 2014, 36, 453–465. [Google Scholar] [CrossRef] [PubMed]
  24. Verma, V.K.; Rai, P. A simple exponential family framework for zero-shot learning. In Proceedings of the Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2017, Proceedings, Part II, 2017; Springer; pp. 792–808. [Google Scholar] [CrossRef]
  25. Norouzi, M.; Mikolov, T.; Bengio, S.; Singer, Y.; Shlens, J.; Frome, A.; Corrado, G.S.; Dean, J. Zero-shot learning by convex combination of semantic embeddings. In Proceedings of the Proceedings of the 2nd International Conference on Learning Representations, 2014. [Google Scholar]
  26. Socher, R.; Ganjoo, M.; Manning, C.D.; Ng, A.Y. Zero-shot learning through cross-modal transfer. Proc. Adv. Neural Inf. Process. Syst. 2013, Vol. 26, 935–943. [Google Scholar]
  27. Zhang, Z.; Saligrama, V. Zero-shot learning via semantic similarity embedding. In Proceedings of the Proceedings of the IEEE International Conference on Computer Vision, 2015; pp. 4166–4174. [Google Scholar] [CrossRef]
  28. Xian, Y.; Akata, Z.; Sharma, G.; Nguyen, Q.; Hein, M.; Schiele, B. Latent embeddings for zero-shot classification. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016; pp. 69–77. [Google Scholar] [CrossRef]
  29. Akata, Z.; Perronnin, F.; Harchaoui, Z.; Schmid, C. Label-embedding for image classification. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 38, 1425–1438. [Google Scholar] [CrossRef]
  30. Frome, A.; Corrado, G.S.; Shlens, J.; Bengio, S.; Dean, J.; Ranzato, M.; Mikolov, T. DeViSE: A deep visual-semantic embedding model. Proc. Adv. Neural Inf. Process. Syst. 2013, Vol. 26, 2121–2129. [Google Scholar]
  31. Akata, Z.; Reed, S.; Walter, D.; Lee, H.; Schiele, B. Evaluation of output embeddings for fine-grained image classification. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015; pp. 2927–2936. [Google Scholar] [CrossRef]
  32. Romera-Paredes, B.; Torr, P.H.S. An embarrassingly simple approach to zero-shot learning. In Proceedings of the Proceedings of the 32nd International Conference on Machine Learning. PMLR, 2015, Vol. 37, Proceedings of Machine Learning Research, pp. 2152–2161.
  33. Changpinyo, S.; Chao, W.L.; Gong, B.; Sha, F. Synthesized classifiers for zero-shot learning. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016; pp. 5327–5336. [Google Scholar] [CrossRef]
  34. Kodirov, E.; Xiang, T.; Gong, S. Semantic autoencoder for zero-shot learning. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017; pp. 3174–3183. [Google Scholar] [CrossRef]
  35. Halko, N.; Martinsson, P.G.; Tropp, J.A. Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions. SIAM Rev. 2011, 53, 217–288. [Google Scholar] [CrossRef]
  36. Kalyta, O.; Barmak, O.; Radiuk, P.; Krak, I. Facial emotion recognition for photo and video surveillance based on machine learning and visual analytics. Appl. Sci. 2023, 13, 9890. [Google Scholar] [CrossRef]
Figure 1. Conceptual architecture of the SEMTRA framework. The end-to-end pipeline sequentially translates uninterpretable latent neural features into continuous semantic attributes via a global transition matrix, and subsequently leverages density-aware discretization and rough-set granulation to induce a transparent, conflict-aware rulebook.
Figure 1. Conceptual architecture of the SEMTRA framework. The end-to-end pipeline sequentially translates uninterpretable latent neural features into continuous semantic attributes via a global transition matrix, and subsequently leverages density-aware discretization and rough-set granulation to induce a transparent, conflict-aware rulebook.
Preprints 216423 g001
Figure 2. Semantic attribute diagnostics: (a) transition salience by semantic attribute; (b) attribute-wise semantic reconstruction error on AwA2; and (c) transition salience against test reconstruction error; labeled points are high-score selected attributes.
Figure 2. Semantic attribute diagnostics: (a) transition salience by semantic attribute; (b) attribute-wise semantic reconstruction error on AwA2; and (c) transition salience against test reconstruction error; labeled points are high-score selected attributes.
Preprints 216423 g002
Figure 3. Symbolic baseline tradeoff between Cov and Covered Fidelity ( F cov ). Marker size is proportional to rule count. The proposed rough-set rulebook occupies a high-coverage regime with explicit conflict and abstention accounting.
Figure 3. Symbolic baseline tradeoff between Cov and Covered Fidelity ( F cov ). Marker size is proportional to rule count. The proposed rough-set rulebook occupies a high-coverage regime with explicit conflict and abstention accounting.
Preprints 216423 g003
Figure 4. Rulebook and threshold optimization. (a) WEDD threshold example on a selected AwA2 semantic attribute. The plot shows how entropy and density jointly determine an interpretable threshold. (b) Rule support and confidence distribution for the AwA2 Protocol A rulebook. (c) Coverage, abstention, covered accuracy, and covered fidelity as the minimum rule-confidence threshold varies.
Figure 4. Rulebook and threshold optimization. (a) WEDD threshold example on a selected AwA2 semantic attribute. The plot shows how entropy and density jointly determine an interpretable threshold. (b) Rule support and confidence distribution for the AwA2 Protocol A rulebook. (c) Coverage, abstention, covered accuracy, and covered fidelity as the minimum rule-confidence threshold varies.
Preprints 216423 g004
Figure 5. Conceptual comparison between local explainable surrogate pixel-level attribution methods and the proposed global explainable rulebook. Local methods yield instance-specific sensitivities without conflict accounting, whereas the proposed framework extracts reusable, mathematically auditable symbolic logic that supports zero-shot structural transfer.
Figure 5. Conceptual comparison between local explainable surrogate pixel-level attribution methods and the proposed global explainable rulebook. Local methods yield instance-specific sensitivities without conflict accounting, whereas the proposed framework extracts reusable, mathematically auditable symbolic logic that supports zero-shot structural transfer.
Preprints 216423 g005
Figure 6. Synthetic benchmark with 95% confidence bands across seeds. Macro-F1, rule-recovery Jaccard, and coverage remain high across the tested semantic-noise range.
Figure 6. Synthetic benchmark with 95% confidence bands across seeds. Macro-F1, rule-recovery Jaccard, and coverage remain high across the tested semantic-noise range.
Preprints 216423 g006
Figure 7. Synthetic threshold recovery error across noise levels.
Figure 7. Synthetic threshold recovery error across noise levels.
Preprints 216423 g007
Table 1. AwA2 Protocol A transition reconstruction metrics (ridge alpha 0.01; retained SVD variance 0.2952). Metrics are reported as mean ± standard deviation across 5 random seeds.
Table 1. AwA2 Protocol A transition reconstruction metrics (ridge alpha 0.01; retained SVD variance 0.2952). Metrics are reported as mean ± standard deviation across 5 random seeds.
Split MAE RMSE Mean semantic correlation
train 0.1292 ± 0.0014 0.1822 ± 0.0018 0.6843 ± 0.0040
validation 0.1299 ± 0.0015 0.1832 ± 0.0018 0.6807 ± 0.0042
test 0.1295 ± 0.0016 0.1826 ± 0.0019 0.6828 ± 0.0039
Table 2. Semantic reconstruction and rule-transfer ablation for linear and nonlinear operators. Means ± standard deviations across 5 seeds are provided for the proposed linear operator.
Table 2. Semantic reconstruction and rule-transfer ablation for linear and nonlinear operators. Means ± standard deviations across 5 seeds are provided for the proposed linear operator.
Operator MAE RMSE Corr. Rule cov. Rule acc. Runtime (s)
Linear ridge transition 0.1076 ± 0.0012 0.1573 0.7759 0.8718 ± 0.0075 0.1941 0.0451
RBF kernel ridge 0.1958 0.2583 0.1176 1.0000 0.0027 2.8573
Two-layer MLP regressor 0.1685 0.2355 0.4497 0.8390 0.0915 0.6914
Note: All runtime measurements reported in this table were executed sequentially on a single NVIDIA RTX 3090 Graphics Processing Unit (GPU) and an Intel Core i9-10900K Central Processing Unit (CPU) to ensure a standardized computational baseline.
Table 3. Selected semantic attributes used for rough-set rule induction (first 12 shown).
Table 3. Selected semantic attributes used for rough-set rule induction (first 12 shown).
Semantic attribute Salience Test MAE Selection score
stripes 0.025 0.105 0.202
hooves 0.036 0.153 0.208
swims 0.032 0.090 0.288
paws 0.045 0.180 0.228
longneck 0.020 0.099 0.166
hunter 0.037 0.159 0.204
ocean 0.032 0.072 0.346
quadrapedal 0.030 0.111 0.225
water 0.033 0.082 0.325
flippers 0.031 0.073 0.323
hands 0.025 0.075 0.271
plankton 0.016 0.043 0.257
Table 4. General rulebook properties and symbolic baseline comparison. Point estimates are reported alongside standard deviations evaluated over 5 random seeds for the proposed method.
Table 4. General rulebook properties and symbolic baseline comparison. Point estimates are reported alongside standard deviations evaluated over 5 random seeds for the proposed method.
Metric Value (Ours) CART baseline Separate-and-conquer
Number of rules 54 ± 2 53 197
Test coverage 0.8640 ± 0.0085 1.0000 0.2620
Test accuracy (non-abstained) 0.4073 ± 0.0062 0.4129 0.6334
Average conditions per rule 4.0370 ± 0.12 7.0000 3.2589
Test abstention rate 0.1360 ± 0.0085 0.0000 0.7380
Test conflict rate 0.1354 ± 0.0071 0.0000 0.2301
Table 5. Representative induced production rules from AwA2 Protocol A.
Table 5. Representative induced production rules from AwA2 Protocol A.
Rule Antecedent Numeric bounds Class Support* Conf.
R0001 paws=s2 AND hunter=s2 AND small=s0 AND yellow=s2 paws 0.40 ; hunter 0.40 ; small < 0.15 ; yellow 0.40 tiger 115 0.896
R0003 stripes=s1 AND hooves=s3 AND paws=s0 AND hands=s0 AND small=s1 AND yellow=s1 stripes [ 0.15 , 0.40 ] ; hooves 0.70 ; paws < 0.15 ; hands < 0.15 ; small [ 0.15 , 0.40 ] ; yellow [ 0.15 , 0.40 ] antelope 132 0.871
R0004 hunter=s0 AND water=s0 AND small=s0 AND jungle=s2 AND yellow=s1 hunter < 0.15 ; water < 0.15 ; small < 0.15 ; jungle 0.40 ; yellow [ 0.15 , 0.40 ] zebra 434 0.866
* Support refers to the number of instances covered in the training partition.
Table 6. Contextual quantitative comparison with published AwA2 zero-shot baselines.
Table 6. Contextual quantitative comparison with published AwA2 zero-shot baselines.
Method Family Accuracy (%) Source and scope
DAP [22] Attribute transfer 46.10 Published ZSL baseline
IAP [23] Attribute transfer 35.90 Published ZSL baseline
Ours: Semantic Transition Post-hoc bridge 48.43 Class-averaged
Ours: Symbolic Template Post-hoc rule 39.93 Class-averaged
GFZSL [24] Generative 63.80 Published ZSL baseline
Table 7. Official AwA2 xlsa17 unseen-class transfer results (SEMTRA).
Table 7. Official AwA2 xlsa17 unseen-class transfer results (SEMTRA).
Unseen class Prototype acc. Symbolic template acc. Mean Hamming
blue whale 0.9195 0.8851 0.1858
bobcat 0.9397 0.7952 0.5134
sheep 0.5908 0.1465 0.4949
bat 0.0000 0.3655 0.5909
seal 0.6110 0.4500 0.4120
walrus 0.5200 0.3200 0.4800
dolphin 0.4500 0.3800 0.5200
cow 0.3320 0.2000 0.4600
rat 0.1500 0.1000 0.5500
raccoon 0.3300 0.3507 0.4750
Average 0.4843 0.3993 0.4682
Table 8. Agreement between local post-hoc explanations and fired global-rule antecedents.
Table 8. Agreement between local post-hoc explanations and fired global-rule antecedents.
Method n Top-k agreement Direction agreement Rank correlation 95% CI
LIME-style 1000 0.2610 0.5907 0.0222 [0.24, 0.27]
SHAP-style 1000 0.3213 0.6071 0.1271 [0.30, 0.33]
Table 9. Consolidated comparison of SEMTRA against concept-based baselines.
Table 9. Consolidated comparison of SEMTRA against concept-based baselines.
Component Type Accuracy / Concept Relevance MAE Concept Corr.
Frozen-feature CBM Ante-hoc (Trained) 0.7232 0.1076 0.7759
SEMTRA (Ours) Post-hoc (Audit) 0.4073 0.1076 0.7759
TCAV Concept direction 0.4850
Table 10. Synthetic benchmark ground-truth rule dictionary.
Table 10. Synthetic benchmark ground-truth rule dictionary.
Class Ground-truth rule Numeric cut summary
1 b 1 high AND b 3 low b 1 > 0.70 ; b 3 < 0.35
2 b 2 medium AND b 5 high 0.35 < b 2 0.65 ; b 5 > 0.70
3 b 4 low, b 6 high, AND b 8 medium b 4 < 0.35 ; b 6 > 0.70 ; 0.35 < b 8 0.65
Table 11. Synthetic stress-test results averaged over random seeds.
Table 11. Synthetic stress-test results averaged over random seeds.
Noise ( σ ) MAE Threshold error Rule Jaccard Macro-F1 Coverage
0.000 0.000 0.014 0.700 0.879 0.971
0.100 0.021 0.018 0.763 0.881 0.962
0.200 0.041 0.017 0.743 0.838 0.952
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Accessibility

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated