1. Introduction
Premier-class motorcycle racing is poised for a transformative shift following the publication of the
MotoGP 2027 technical package. This new regulatory framework aims to reduce cornering speeds and redefine the sport’s performance envelope through three primary pillars: (i) a reduction in engine displacement to
850 cc with a
75 mm maximum bore; (ii) a significant restriction of aerodynamic appendages; and (iii) the
prohibition of all mechanical ride-height and holeshot devices. These constraints, explicitly detailed in the official Grand Prix Commission decisions and technical summaries
123, fundamentally redefine how load transfer and chassis attitude are managed on-track.
From an engineering standpoint, the key discontinuity is not only the engine capacity reduction (
Figure 1) but the loss of
mechanical ride-height actuation that previously helped teams tune squat/pitch control during launch and acceleration phases. With reduced aero load and restricted mechanical actuation, motorcycles are expected to become more sensitive to oscillatory stability phenomena (e.g., headshake, braking-induced vibration modes, and chassis/suspension coupling). The dynamics of these high-performance instabilities have been studied in depth in the racing context, notably under the umbrella of
chatter and related self-excited vibration modes [
1,
2,
3,
4].
The same regulatory package implies a shift in on-track strategy
4. With reduced acceleration headroom and altered aerodynamic support, riders are expected to prioritize maintaining momentum and corner speed rather than relying on stop-and-go exits.
Figure 2 provides a schematic view of this transition, which matters computationally because it changes visual reference points (braking markers, apex approach textures, exit trajectories) that can be exploited as
static context by caching mechanisms.
Critically, the prohibition of mechanical ride-height devices removes one of the practical “knobs” teams used to shape attitude transitions during launch and acceleration. In racing motorcycles, attitude transitions and vibration modes are tightly coupled; classical studies describe how racing chatter emerges from interactions between tire dynamics, suspension compliance, and chassis modes, often producing observable high-frequency oscillations in the front assembly [
1,
2].
Figure 3 is therefore presented
as a motivation schematic: it does not claim official pitch numbers, but it visualizes why small degradations in damping or tire state can become operationally critical under the 2027 constraints.
These dynamics motivate
visual monitoring. Standard telemetry channels (IMU, suspension travel, wheel speeds) are invaluable but cannot directly observe contact-patch surface state, tire-sidewall texture evolution, or subtle bodywork/suspension oscillations visible in high-resolution video. Multimodal learning surveys and telemetry-driven motorsport case studies support the view that fusing heterogeneous channels is often necessary to disambiguate aggressive maneuvers from failure precursors [
5,
6]. Moreover, recent vision-based defect/anomaly detection methods show that CNN features can capture fine-grained texture signatures that may precede macroscopic failures [
7,
8]. The specific visual anomaly categories targeted in this work are illustrated in
Figure 4.
The remaining barrier is
latency. At 300 km/h (
m/s), a 100 ms end-to-end perception delay implies an 8.3 m “blind distance”. Even if video is processed locally, real deployments must satisfy tight power and determinism budgets typical of edge inference [
9,
10]. Furthermore, retrieval-based context injection (standard RAG) introduces additional non-deterministic overhead due to vector search, re-ranking, and cross-modal grounding [
11].
Figure 5 summarizes the operational latency budget that motivates the hybrid design.
To address this, we propose an
agentic visual perception framework (conceptually outlined in
Figure 6) that orchestrates two memory paths: a
cache for static circuit context and a
retrieval channel for rare, uncertainty-triggered anomaly grounding. The agent is implemented using a
ReAct (Reason+Act) control loop [
12], while fine-grained texture extraction is performed with
UNet++ (nested skip connections) to preserve high-frequency cues [
13]. The cache exploits track-level spatiotemporal redundancy (i.e., recurrent backgrounds and landmarks), closely related to place-recognition principles in long image sequences [
14]. Vector search components follow best practices for efficient similarity retrieval (e.g., FAISS-based indexing) [
10]. Importantly, the agent’s gating policy is driven by uncertainty signals; we avoid naïvely equating softmax entropy with epistemic uncertainty by drawing on modern uncertainty estimation literature [
15,
16].
1.1. Contributions and Paper Organization
This paper makes the following contributions:
Regulatory-aware problem framing: We formalize the 2027 MotoGP technical package as a perception-latency problem in which reduced mechanical/aero stabilization raises the value of real-time visual anomaly cues [
17,
18].
Hybrid memory (RAG–CAG) for millisecond budgets: We introduce a cache-first design that exploits circuit redundancy for
context access, reserving retrieval for rare events [
10,
11,
14].
Texture-sensitive visual encoder: We integrate UNet++ to preserve fine-grained texture and oscillation cues relevant to tires and front-end dynamics [
7,
13].
Uncertainty-driven agent orchestration: We deploy a ReAct loop whose routing decisions are informed by modern uncertainty estimation methods rather than raw softmax entropy alone [
12,
15,
16].
The remainder of this paper details the proposed methodology (
Section 3), the experimental validation (
Section 4), and a discussion of practical deployment considerations under racing constraints (
Section 5).
Figure 1.
Engine Regulation Shift (2027). Displacement reduction from 1000 cc to 850 cc is mandated by the 2027 framework.The power values shown are illustrative (not an official specification) and are included only to motivate the expected reduction in acceleration margin.
Figure 1.
Engine Regulation Shift (2027). Displacement reduction from 1000 cc to 850 cc is mandated by the 2027 framework.The power values shown are illustrative (not an official specification) and are included only to motivate the expected reduction in acceleration margin.
Figure 2.
Trajectory Evolution (schematic). A conceptual illustration based on the 2027 technical summary of how reduced acceleration margin can bias riders toward momentum preservation.
Figure 2.
Trajectory Evolution (schematic). A conceptual illustration based on the 2027 technical summary of how reduced acceleration margin can bias riders toward momentum preservation.
Figure 3.
Motivation Schematic: attitude sensitivity without RHD. Not an official measurement. Included to visualize why damping/tire degradations can become critical when mechanical ride-height actuation is prohibited.
Figure 3.
Motivation Schematic: attitude sensitivity without RHD. Not an official measurement. Included to visualize why damping/tire degradations can become critical when mechanical ride-height actuation is prohibited.
Figure 4.
Target Anomaly Classes. Representative visual anomaly categories addressed in this paper. They are not “defined” by regulation, but become more consequential as the 2027 package constrains mechanical/aero stabilization.
Figure 4.
Target Anomaly Classes. Representative visual anomaly categories addressed in this paper. They are not “defined” by regulation, but become more consequential as the 2027 package constrains mechanical/aero stabilization.
Figure 5.
Latency budget motivation. Standard retrieval-heavy perception can exceed a practical reaction window at racing speeds. A cache-first design aims to keep most frames within a sub-50 ms operational envelope.
Figure 5.
Latency budget motivation. Standard retrieval-heavy perception can exceed a practical reaction window at racing speeds. A cache-first design aims to keep most frames within a sub-50 ms operational envelope.
Figure 6.
System concept. A ReAct agent routes perception through a cache-first static context (CAG) path for nominal lapping, and triggers deeper historical retrieval (RAG) only when uncertainty/anomaly signals justify the cost.
Figure 6.
System concept. A ReAct agent routes perception through a cache-first static context (CAG) path for nominal lapping, and triggers deeper historical retrieval (RAG) only when uncertainty/anomaly signals justify the cost.
Figure 7.
Taxonomy of real-time perception in motorsport. Prior work emphasizes detection/tracking, temporal modeling, or video optimization in isolation. Our approach adds an agentic decision layer with entropy-gated conditional computation and a hybrid memory (CAG+RAG) designed for tight latency budgets.
Figure 7.
Taxonomy of real-time perception in motorsport. Prior work emphasizes detection/tracking, temporal modeling, or video optimization in isolation. Our approach adds an agentic decision layer with entropy-gated conditional computation and a hybrid memory (CAG+RAG) designed for tight latency budgets.
Figure 8.
High-level module interaction. The vision encoder produces embeddings ; a ReAct-style agent computes a budget-aware routing decision and selects either a fast cache path (CAG) for invariant circuit context or a deep retrieval path (RAG) for anomaly grounding in historical exemplars.
Figure 8.
High-level module interaction. The vision encoder produces embeddings ; a ReAct-style agent computes a budget-aware routing decision and selects either a fast cache path (CAG) for invariant circuit context or a deep retrieval path (RAG) for anomaly grounding in historical exemplars.
Figure 9.
Nested U-Net Architecture. Unlike standard U-Nets, the nested topology (nodes with ) aggregates features at multiple semantic levels via dense skip connections. This preserves high-frequency vibration cues (e.g., chatter) that typically vanish in deep bottlenecks.
Figure 9.
Nested U-Net Architecture. Unlike standard U-Nets, the nested topology (nodes with ) aggregates features at multiple semantic levels via dense skip connections. This preserves high-frequency vibration cues (e.g., chatter) that typically vanish in deep bottlenecks.
Figure 12.
Hysteresis-based Routing Logic. Instead of a single threshold, the system employs a Schmitt trigger mechanism. To enter the high-cost RAG mode, uncertainty must exceed (0.6). To return to CAG, uncertainty must drop below (0.4). The band prevents "flickering" (rapid switching) during ambiguous transitions.
Figure 12.
Hysteresis-based Routing Logic. Instead of a single threshold, the system employs a Schmitt trigger mechanism. To enter the high-cost RAG mode, uncertainty must exceed (0.6). To return to CAG, uncertainty must drop below (0.4). The band prevents "flickering" (rapid switching) during ambiguous transitions.
Figure 14.
Evolution of Braking Strategy (2026 vs 2027). A) The 2026 baseline (Blue) utilizes a "V-Shape" approach with late, high-pressure braking points allowed by high engine torque. B) The 2027 generated nodes (Orange) shift systematically earlier and lower in pressure ("U-Shape") to maintain corner speed. The faint blue circles in B represent the original 2026 positions, highlighting the spatial drift () and pressure reduction () visualized by the green arrows.
Figure 14.
Evolution of Braking Strategy (2026 vs 2027). A) The 2026 baseline (Blue) utilizes a "V-Shape" approach with late, high-pressure braking points allowed by high engine torque. B) The 2027 generated nodes (Orange) shift systematically earlier and lower in pressure ("U-Shape") to maintain corner speed. The faint blue circles in B represent the original 2026 positions, highlighting the spatial drift () and pressure reduction () visualized by the green arrows.
Figure 15.
RAG Precision Analysis. Comparing retrieval quality before (A) and after (B) applying regulation constraints. By zooming the similarity axis (), we observe that unfiltered retrieval prioritizes high-similarity but obsolete features (e.g., banned mechanics), whereas our domain filter ensures all retrieved exemplars are physically valid for the 2027 season.
Figure 15.
RAG Precision Analysis. Comparing retrieval quality before (A) and after (B) applying regulation constraints. By zooming the similarity axis (), we observe that unfiltered retrieval prioritizes high-similarity but obsolete features (e.g., banned mechanics), whereas our domain filter ensures all retrieved exemplars are physically valid for the 2027 season.
Figure 16.
Scenario B Spatial Heatmap. The track coloring visualizes the system’s routing decision during the "Mechanical Stress" test. The system maintains efficient CAG mode (Green) on straights but correctly escalates to RAG (Red) in Sector 3/4 where the suspension chatter anomaly manifests.
Figure 16.
Scenario B Spatial Heatmap. The track coloring visualizes the system’s routing decision during the "Mechanical Stress" test. The system maintains efficient CAG mode (Green) on straights but correctly escalates to RAG (Red) in Sector 3/4 where the suspension chatter anomaly manifests.
| System State Map |
| [fill=caggreen, draw=none] (0,0) circle (0.8ex); |
CAG (Fast path) |
| [fill=warnorange, draw=none] (0,0) circle (0.8ex); |
Gate (Hysteresis) |
| [fill=ragred, draw=none] (0,0) circle (0.8ex); |
RAG (Retrieval) |
Figure 17.
Real-Time Safety Logic. A watchdog enforces deterministic latency. Frames exceeding the safety threshold are dropped (fail-silent) to prevent stale advisories.
Figure 17.
Real-Time Safety Logic. A watchdog enforces deterministic latency. Frames exceeding the safety threshold are dropped (fail-silent) to prevent stale advisories.
Figure 18.
Physical Integration Concept for 2027-Spec Prototype. Edge compute in the tail subframe for thermal management and mass centralization; shielded links for signal integrity; isolation mounting for vibration robustness.
Figure 18.
Physical Integration Concept for 2027-Spec Prototype. Edge compute in the tail subframe for thermal management and mass centralization; shielded links for signal integrity; isolation mounting for vibration robustness.
Figure 19.
Latency Optimization Results. A) ECDF showing tail risks. B1 (Blue) breaches the 50ms deadline significantly at P95. B5 (Green) effectively truncates the tail (P99=46.5ms) via hybrid gating. B) Latency breakdown reveals that B5 amortizes the expensive Retrieval cost (Red bar) by only triggering it during high-entropy events, unlike B1 which pays the full cost every frame.
Figure 19.
Latency Optimization Results. A) ECDF showing tail risks. B1 (Blue) breaches the 50ms deadline significantly at P95. B5 (Green) effectively truncates the tail (P99=46.5ms) via hybrid gating. B) Latency breakdown reveals that B5 amortizes the expensive Retrieval cost (Red bar) by only triggering it during high-entropy events, unlike B1 which pays the full cost every frame.
Figure 20.
F1-Score Analysis by Physics Regime. The proposed Hybrid architecture (B5, Green) matches baselines in static tasks but provides a decisive uplift in the Dynamic Regime (shaded area). The retrieval mechanism allows B5 to outperform the static baseline (B0, Gray) and the simple cache (B2, Orange) in oscillatory failure modes.
Figure 20.
F1-Score Analysis by Physics Regime. The proposed Hybrid architecture (B5, Green) matches baselines in static tasks but provides a decisive uplift in the Dynamic Regime (shaded area). The retrieval mechanism allows B5 to outperform the static baseline (B0, Gray) and the simple cache (B2, Orange) in oscillatory failure modes.
Figure 21.
Confusion Matrix Comparison (Suspension Chatter). Left (A): Baseline performance shows high risk of missed detection. Right (B): Proposed hybrid system significantly increases True Positives (TP) and reduces safety-critical False Negatives (FN).
Figure 21.
Confusion Matrix Comparison (Suspension Chatter). Left (A): Baseline performance shows high risk of missed detection. Right (B): Proposed hybrid system significantly increases True Positives (TP) and reduces safety-critical False Negatives (FN).
Figure 23.
Operational Burden Analysis. At a mandatory safety recall of 90%, the memoryless baseline (B0) overwhelms the operator with false alarms (FAR 1.27). The proposed B5 system (Green) suppresses spurious warnings, keeping the False Alarm Ratio well below 1.0.
Figure 23.
Operational Burden Analysis. At a mandatory safety recall of 90%, the memoryless baseline (B0) overwhelms the operator with false alarms (FAR 1.27). The proposed B5 system (Green) suppresses spurious warnings, keeping the False Alarm Ratio well below 1.0.
Figure 24.
Performance Stability across Scenarios. While the baseline (Gray) degrades under stress, our Hybrid method (Teal) maintains robustness comparable to the expensive RAG oracle (Purple), validating its suitability for variable racing conditions.
Figure 24.
Performance Stability across Scenarios. While the baseline (Gray) degrades under stress, our Hybrid method (Teal) maintains robustness comparable to the expensive RAG oracle (Purple), validating its suitability for variable racing conditions.
Figure 25.
Energy–Accuracy Pareto Landscape. B1 (Purple) represents the theoretical ceiling but is energetically prohibitive. B5 (Teal) sits on the efficient frontier, retaining ∼99% of B1’s accuracy while reducing energy consumption by 61%, making it the only viable candidate for the 50W edge budget.
Figure 25.
Energy–Accuracy Pareto Landscape. B1 (Purple) represents the theoretical ceiling but is energetically prohibitive. B5 (Teal) sits on the efficient frontier, retaining ∼99% of B1’s accuracy while reducing energy consumption by 61%, making it the only viable candidate for the 50W edge budget.
Figure 26.
Power trace (Scenario B). Retrieval induces short power spikes (∼45–47W), while CAG remains near ∼31W. The 50W cap is never exceeded.
Figure 26.
Power trace (Scenario B). Retrieval induces short power spikes (∼45–47W), while CAG remains near ∼31W. The 50W cap is never exceeded.
Figure 27.
Cost Dynamics and Efficiency Gap. Energy consumption scales with retrieval frequency (). Crucially, the B5 operating points (Teal stars) lie below the naive linear interpolation (dashed line). This "convexity" proves that B5 is more efficient per-retrieval than B1, thanks to domain filtering reducing the vector search space.
Figure 27.
Cost Dynamics and Efficiency Gap. Energy consumption scales with retrieval frequency (). Crucially, the B5 operating points (Teal stars) lie below the naive linear interpolation (dashed line). This "convexity" proves that B5 is more efficient per-retrieval than B1, thanks to domain filtering reducing the vector search space.
Figure 28.
Topology-Aware Computation. A clear inverse correlation is observed. In high-speed sectors (Bars, grey), the system relies on cache (low lines). In complex technical sectors (Braking/Chicane), entropy rises (dashed purple), triggering the RAG mechanism (solid teal) to handle the uncertainty.
Figure 28.
Topology-Aware Computation. A clear inverse correlation is observed. In high-speed sectors (Bars, grey), the system relies on cache (low lines). In complex technical sectors (Braking/Chicane), entropy rises (dashed purple), triggering the RAG mechanism (solid teal) to handle the uncertainty.
Figure 29.
Impact of Domain Filtering. While filtering improves retrieval precision (Rel@1/5), its critical contribution is the total elimination of Physics Hallucinations (Rightmost bars). The unfiltered baseline retrieves obsolete failure modes (e.g., 2026 Ride-Height devices), creating a 28% error rate that our method sanitizes to 0%.
Figure 29.
Impact of Domain Filtering. While filtering improves retrieval precision (Rel@1/5), its critical contribution is the total elimination of Physics Hallucinations (Rightmost bars). The unfiltered baseline retrieves obsolete failure modes (e.g., 2026 Ride-Height devices), creating a 28% error rate that our method sanitizes to 0%.
Figure 30.
Drop Burstiness Analysis. 96% of watchdog triggers are isolated single-frame drops (Teal). No bursts exceeding 2 frames were observed, preventing sustained data blackouts.
Figure 30.
Drop Burstiness Analysis. 96% of watchdog triggers are isolated single-frame drops (Teal). No bursts exceeding 2 frames were observed, preventing sustained data blackouts.
Figure 32.
Thermal FSM with Hysteresis Guard. The system uses a bi-stable controller to manage thermal load. It enters Safe Mode only when the junction temperature reaches C and enforces a C cooling requirement (C) before restoring full RAG capabilities, preventing oscillation.
Figure 32.
Thermal FSM with Hysteresis Guard. The system uses a bi-stable controller to manage thermal load. It enters Safe Mode only when the junction temperature reaches C and enforces a C cooling requirement (C) before restoring full RAG capabilities, preventing oscillation.
Figure 33.
Dual-Process Decision Logic. Visualizing the routing between the reflexive “Green Path” (System 1) and the deliberative “Purple Path” (System 2). The timeline on the right illustrates how the worst-case hybrid latency fits safely within the real-time deadline.
Figure 33.
Dual-Process Decision Logic. Visualizing the routing between the reflexive “Green Path” (System 1) and the deliberative “Purple Path” (System 2). The timeline on the right illustrates how the worst-case hybrid latency fits safely within the real-time deadline.
Figure 34.
The Entropy-Compute Inverse. Visualizing the optimization principle. Standard RAG (Dashed Purple) maintains high compute cost regardless of context. Our B5 policy (Teal line) acts as the inverse of Visual Redundancy (Gray area): it minimizes compute during stationary straights and surges resources only during high-entropy events (corners), aligning energy expenditure with information gain.
Figure 34.
The Entropy-Compute Inverse. Visualizing the optimization principle. Standard RAG (Dashed Purple) maintains high compute cost regardless of context. Our B5 policy (Teal line) acts as the inverse of Visual Redundancy (Gray area): it minimizes compute during stationary straights and surges resources only during high-entropy events (corners), aligning energy expenditure with information gain.
Figure 35.
Resolution of Epistemic Uncertainty. Visual ambiguity (e.g., motion blur) causes the stateless model to output a flat, high-entropy distribution (Left). By retrieving a semantically aligned exemplar and injecting it into the fusion process (Center), the hybrid system grounds the observation, resolving the ambiguity into a sharp, confident diagnosis (Right).
Figure 35.
Resolution of Epistemic Uncertainty. Visual ambiguity (e.g., motion blur) causes the stateless model to output a flat, high-entropy distribution (Left). By retrieving a semantically aligned exemplar and injecting it into the fusion process (Center), the hybrid system grounds the observation, resolving the ambiguity into a sharp, confident diagnosis (Right).
Figure 36.
The Regulatory "Air Gap". Unlike standard RAG approaches that rely on cloud APIs, our architecture is engineered for the Air-Gapped reality of racing. The "Regulatory Firewall" prevents high-bandwidth cloud dependency and active control signals. Our system (Right) is self-contained on the edge, ensuring compliance with homologation rules that forbid external interference during the race.
Figure 36.
The Regulatory "Air Gap". Unlike standard RAG approaches that rely on cloud APIs, our architecture is engineered for the Air-Gapped reality of racing. The "Regulatory Firewall" prevents high-bandwidth cloud dependency and active control signals. Our system (Right) is self-contained on the edge, ensuring compliance with homologation rules that forbid external interference during the race.
Figure 39.
The "Air-Gapped" Security Pipeline. To protect strategic IP, the Vector Index is compiled and encrypted at the secure factory (Left). It is deployed to the edge (Right) as a read-only artifact. A logical "Data Diode" ensures that while the AI can read sensor data to generate advisories, it has no write path to the vehicle’s ECU, preventing control-level cyberattacks.
Figure 39.
The "Air-Gapped" Security Pipeline. To protect strategic IP, the Vector Index is compiled and encrypted at the secure factory (Left). It is deployed to the edge (Right) as a read-only artifact. A logical "Data Diode" ensures that while the AI can read sensor data to generate advisories, it has no write path to the vehicle’s ECU, preventing control-level cyberattacks.
Figure 40.
The System Operating Envelope. The architecture excels in low-to-medium entropy regimes (Green/Yellow zones), maintaining high throughput. However, global high-entropy events (e.g., severe weather, Red zone) saturate the retrieval budget (), causing throughput to breach the real-time floor. Future work must address this "Thermal Wall."
Figure 40.
The System Operating Envelope. The architecture excels in low-to-medium entropy regimes (Green/Yellow zones), maintaining high throughput. However, global high-entropy events (e.g., severe weather, Red zone) saturate the retrieval budget (), causing throughput to breach the real-time floor. Future work must address this "Thermal Wall."
Figure 41.
Strategic research roadmap. We progress from a validated single-agent edge loop (Phase 1), to on-device cache adaptation (Phase 2), and to post-session collaborative learning that avoids raw video exchange (Phase 3).
Figure 41.
Strategic research roadmap. We progress from a validated single-agent edge loop (Phase 1), to on-device cache adaptation (Phase 2), and to post-session collaborative learning that avoids raw video exchange (Phase 3).
Table 1.
Notation and constraints. Symbols used in the problem setting and real-time budgeting.
Table 1.
Notation and constraints. Symbols used in the problem setting and real-time budgeting.
| Symbol |
Meaning |
|
RGB frame at time t,
|
|
Telemetry vector at time t,
|
|
Ground-truth anomaly label (or multi-label vector) |
|
Predicted posterior over classes,
|
|
Output vector (posterior + advisory ) |
|
Deadline (ms) for end-to-end processing |
|
End-to-end latency at time t
|
|
Allowed tail-latency violation probability (chance constraint) |
|
Energy proxy per step t (J) |
|
Gate variable selecting memory path (0=CAG, 1=RAG) |
Table 2.
Latency budget breakdown. Measured on NVIDIA Jetson AGX Orin (MaxN mode, 50W cap). The vision encoder utilizes TensorRT (INT8), while RAG retrieval uses a GPU-accelerated HNSW index. represents the VRAM hash lookup.
Table 2.
Latency budget breakdown. Measured on NVIDIA Jetson AGX Orin (MaxN mode, 50W cap). The vision encoder utilizes TensorRT (INT8), while RAG retrieval uses a GPU-accelerated HNSW index. represents the VRAM hash lookup.
| Module |
Median (ms) |
p95 (ms) |
p99 (ms) |
|
(HW VIC: decode, resize, norm) |
1.20 |
1.35 |
1.80 |
|
(Nested U-Net Encoder [INT8]) |
8.45 |
8.60 |
9.12 |
|
(Telemetry MLP fusion) |
0.30 |
0.35 |
0.45 |
|
(Entropy calculation) |
0.15 |
0.18 |
0.22 |
|
(VRAM Context Cache) |
0.80 |
0.92 |
1.15 |
|
(HNSW Index + Re-ranking) |
26.50 |
32.10 |
38.40 |
|
(Decoder Heads) |
1.50 |
1.65 |
1.85 |
| Total (CAG Path - Low Entropy) |
12.40 |
13.05 |
14.59 |
| Total (RAG Path - High Entropy) |
38.10 |
44.23 |
49.82 |
Table 3.
System components and interfaces. Inputs/outputs, statefulness, and latency-criticality.
Table 3.
System components and interfaces. Inputs/outputs, statefulness, and latency-criticality.
| Module |
Input |
Output |
State |
Criticality |
| Preprocess
|
|
|
stateless |
medium |
| Vision encoder
|
|
|
parametric |
high |
| Telemetry norm.
|
|
|
stateless |
low |
| Gate/agent
|
|
|
stateful (EMA/hyst.) |
high |
| CAG memory
|
key from
|
|
cached (VRAM) |
very high |
| RAG memory
|
|
|
external index |
very high |
| Fusion
|
|
|
stateless |
medium |
| Decision head
|
|
|
parametric |
high |
Table 4.
Encoder configuration. Hyperparameters used to train the hardware-aware Nested U-Net. The setup prioritizes high-frequency texture retention and retrieval discriminability.
Table 4.
Encoder configuration. Hyperparameters used to train the hardware-aware Nested U-Net. The setup prioritizes high-frequency texture retention and retrieval discriminability.
| Item |
Value |
| Input resolution |
(RGB, fp16 normalized) |
| Embedding dimension
|
512 (L2-normalized) |
| Backbone Architecture |
ResNet-18 (with dense skip links) |
| Base channels |
|
| Normalization |
GroupNorm (groups=32) |
| Activation |
SiLU (Sigmoid Linear Unit) |
| Optimizer |
AdamW (lr=, wd=) |
| Loss weights
|
|
| Contrastive temperature
|
|
| Window W for
|
5 frames (ms at 120fps) |
Table 5.
Orchestrator hyperparameters. calibrated for the Jetson AGX Orin target. These settings prioritize deadline compliance (ms) over maximum retrieval depth.
Table 5.
Orchestrator hyperparameters. calibrated for the Jetson AGX Orin target. These settings prioritize deadline compliance (ms) over maximum retrieval depth.
| Param |
Value |
Description / Rational |
|
|
Base entropy threshold (balanced routing) |
|
|
Hysteresis band to prevent flicker |
| m |
5 |
Min dwell time (frames) ms at 120fps |
|
|
EMA smoothing factor (noise rejection) |
| T |
|
Temperature scaling for calibrated softmax |
|
|
Weights for Entropy, Energy, and Drift |
|
45 ms |
Tail-latency guard (conservative RAG cap) |
Table 6.
Hybrid memory configuration. Specific hyperparameters tuned for the NVIDIA Jetson AGX Orin (MaxN mode) to satisfy the 50ms real-time deadline.
Table 6.
Hybrid memory configuration. Specific hyperparameters tuned for the NVIDIA Jetson AGX Orin (MaxN mode) to satisfy the 50ms real-time deadline.
| Parameter |
Value |
Rationale / Constraint |
|
m |
Cache bin size (m safety margin) |
|
|
Strict drift test ( confidence) to limit re-keying |
|
|
Slow EMA adaptation rate to filter sensor noise |
| w |
60 frames |
0.5s persistence required to confirm systematic drift |
| k |
5 |
Soft-voting consensus size (precision vs. latency trade-off) |
|
|
Softmax temperature for similarity weighting |
| HNSW M
|
32 |
Graph connectivity optimized for vectors |
| efSearch |
64 |
Limits graph traversal depth to cap tail latency (ms) |
| Partitions |
Year_Track |
Physical index isolation (2026 vs 2027) |
Table 7.
Aspar-Synth-10K anomaly taxonomy and injection parameters. Frequencies and triggers are calibrated to match the 2027 850cc chassis dynamics.
Table 7.
Aspar-Synth-10K anomaly taxonomy and injection parameters. Frequencies and triggers are calibrated to match the 2027 850cc chassis dynamics.
| Class |
Phenomenon |
Freq. Band |
Trigger / Onset |
Severity Function |
Samples |
|
Nominal (Track/Rain) |
– |
Random |
– |
4,000 |
|
Headshake (Geometry) |
6–9 Hz |
Accel ( thr) |
Linear Ramp () |
1,500 |
|
Suspension Chatter |
18–24 Hz |
Lean () |
Sigmoid Step |
1,500 |
|
Brake Resonance |
12–16 Hz |
Brake ( bar) |
Pressure-Coupled |
1,500 |
|
Tire Graining (Visual) |
Spatial |
Late Stint () |
Exp. Accumulation |
1,500 |
Table 8.
Edge inference specifications. Key hardware constraints for the deployment target.
Table 8.
Edge inference specifications. Key hardware constraints for the deployment target.
| Parameter |
Configuration |
| Platform |
NVIDIA Jetson AGX Orin (64GB) |
| Compute Power |
275 TOPS (INT8 Sparse) |
| Memory BW |
204.8 GB/s (Critical for 4K) |
| Pipeline |
TensorRT 8.5 + HW VIC (Zero-copy) |
| Constraints |
50W TDP / ms Latency |
Table 9.
Compliance-by-design matrix (constraints → design decisions).
Table 9.
Compliance-by-design matrix (constraints → design decisions).
| Constraint |
Design decision |
| No bike↔pit signals while moving |
On-device inference; no network; local RAG; deferred updates post-session. |
| Organizer-controlled onboard cameras |
Uses simulation/engineering camera (test mode); no broadcast feed dependency. |
| No interference with ECU/actuation |
Advisory-only; read-only taps; no inline hardware; fail-silent watchdog. |
| Deterministic latency requirement |
Watchdog drops late frames; hysteresis; cache-first routing. |
Table 10.
Experimental variants (Ablation Study). We compare the full system (B5) against architectural subsets to isolate the impact of memory, hybrid routing, and domain awareness.
Table 10.
Experimental variants (Ablation Study). We compare the full system (B5) against architectural subsets to isolate the impact of memory, hybrid routing, and domain awareness.
| ID |
Configuration |
Research Question / Hypothesis |
| B0 |
No-Mem (CNN Encoder only) |
Is external memory actually necessary, or is the frozen encoder sufficient? |
| B1 |
RAG-only (Always retrieve top-k) |
What is the latency/energy penalty of continuous retrieval? (Upper bound on accuracy). |
| B2 |
CAG-only (Static cache lookup) |
Can simple caching handle novel anomalies without deep retrieval? |
| B3 |
Hybrid (Basic Entropy Gate) |
Does uncertainty-based routing effectively balance B1 and B2? |
| B4 |
Hybrid + Hysteresis (Equation (45)) |
Does the Schmitt trigger reduce "flicker" and routing instability? |
| B5 |
Ours (Full + Domain Filter Equation (63)) |
Does filtering obsolete (2026) data improve precision in the 2027 regime? |
Table 11.
Summary of Evaluation Metrics. Definitions and targets for the Jetson AGX Orin deployment.
Table 11.
Summary of Evaluation Metrics. Definitions and targets for the Jetson AGX Orin deployment.
| Symbol |
Metric Name |
Definition / Objective |
| Real-Time Safety |
|
Tail Latency |
99th percentile of end-to-end time (↓ better) |
| DMR |
Deadline Miss Rate |
(↓, target ) |
|
Blind Distance |
Meters traveled during latency at
|
| Diagnostic Quality |
| F1 |
Macro F1-Score |
Harmonic mean of precision/recall (↑) |
| ECE |
Calibration Error |
Weighted gap between confidence and accuracy (↓) |
| Rel@k
|
Retrieval Relevance |
Fraction of top-k neighbors matching GT class (↑) |
| System Efficiency |
|
Energy Cost |
Joules consumed per inference step (↓) |
|
Routing Rate |
Frequency of deep memory access () |
| FPS |
Throughput |
Frames processed per second (↑, target ) |
Table 12.
Main Results (Mean ± Std where applicable). End-to-end latency percentiles, deadline miss-rate (ms), diagnostic performance, and energy. Red values indicate violations of the 50ms safety budget.
Table 12.
Main Results (Mean ± Std where applicable). End-to-end latency percentiles, deadline miss-rate (ms), diagnostic performance, and energy. Red values indicate violations of the 50ms safety budget.
| |
Latency (ms) |
Miss |
Diagnosis |
Energy/Perf |
Energy |
| Variant |
P50 |
P95 |
P99 |
Rate (%) |
Macro-F1 |
PR-AUC |
FPS |
Avg W |
J/frame |
| B0 (No-Mem) |
12.4 |
16.1 |
18.5 |
0.0 |
0.62 |
0.70 |
75 |
28.0 |
0.37 |
| B1 (RAG-only) |
38.2 |
95.4 |
112.1 |
16.8 |
0.89 |
0.94 |
26 |
36.0 |
1.38 |
| B2 (CAG-only) |
13.1 |
17.5 |
21.3 |
0.0 |
0.71 |
0.79 |
72 |
29.0 |
0.40 |
| B3 (Hybrid) |
16.5 |
42.1 |
58.4 |
2.1 |
0.84 |
0.90 |
55 |
33.0 |
0.60 |
| B4 (Hyst.) |
16.8 |
39.5 |
49.2 |
0.9 |
0.86 |
0.91 |
56 |
32.5 |
0.58 |
| B5 (Ours) |
16.9 |
38.2 |
46.5 |
0.4 |
0.88 |
0.93 |
58 |
31.5 |
0.54 |
Table 13.
Per-Class F1 Score Analysis. The hybrid architecture yields decisive gains in dynamic/oscillatory classes (Chatter, Shaking) compared to static baselines. denotes the net improvement of B5 over B0.
Table 13.
Per-Class F1 Score Analysis. The hybrid architecture yields decisive gains in dynamic/oscillatory classes (Chatter, Shaking) compared to static baselines. denotes the net improvement of B5 over B0.
| Anomaly Class |
Dynamics |
B0 (No-Mem) |
B5 (Ours) |
Gain |
| Normal (Nominal) |
Static |
0.93 |
0.98 |
+5% |
| Track Limits |
Static |
0.92 |
0.94 |
+2% |
| Tire Blistering |
Visual |
0.78 |
0.88 |
+10% |
| Brake Shaking |
12–16 Hz |
0.66 |
0.85 |
+19% |
| Susp. Chatter |
18–24 Hz |
0.61 |
0.89 |
+28% |
Table 14.
Operational Burden at High Recall (). B5 maintains high precision where baselines fail. The "False Alarm Ratio" indicates the operational noise: B0 generates more noise than signal (1.27), while B5 is clean (0.35).
Table 14.
Operational Burden at High Recall (). B5 maintains high precision where baselines fail. The "False Alarm Ratio" indicates the operational noise: B0 generates more noise than signal (1.27), while B5 is clean (0.35).
| Method |
Precision @
|
False Alarm Ratio |
Operational Status |
| B0 (No-Mem) |
0.44 |
1.27 (High) |
Unusable (Noise > Signal) |
| B2 (CAG) |
0.52 |
0.92 (Med) |
Marginal |
| B5 (Ours) |
0.74 |
0.35 (Low) |
Viable (Signal > Noise) |
Table 15.
Robustness Analysis (Macro PR-AUC). While the baseline degrades under stress (Scenarios B/C), B5 retains the stability of the full retrieval system.
Table 15.
Robustness Analysis (Macro PR-AUC). While the baseline degrades under stress (Scenarios B/C), B5 retains the stability of the full retrieval system.
| Variant |
A: Nominal |
B: Stress |
C: Env. Shift |
Stability () |
| B0 (No-Mem) |
0.72 |
0.68 |
0.69 |
-5.6% |
| B1 (RAG-only) |
0.95 |
0.93 |
0.92 |
-3.1% |
| B5 (Ours) |
0.94 |
0.92 |
0.91 |
-3.2% |
Table 16.
B5 scenario profile. Energy correlates with RAG usage (). Nominal laps are most efficient; stress remains feasible under the 50ms deadline and 50W cap.
Table 16.
B5 scenario profile. Energy correlates with RAG usage (). Nominal laps are most efficient; stress remains feasible under the 50ms deadline and 50W cap.
| Scenario |
|
P99 (ms) |
Miss (%) |
FPS |
Avg W |
J/frame |
| A: Qualifying (Nominal) |
0.12 |
24.0 |
0.0 |
76 |
30.5 |
0.40 |
| B: Mechanical Stress |
0.45 |
46.5 |
0.4 |
51 |
34.5 |
0.68 |
| C: Environmental Shift |
0.23 |
39.0 |
0.2 |
61 |
31.1 |
0.51 |
Table 18.
Sector-wise Analysis. The system adapts to track topology. High-speed sectors allow for cache reuse (Low ), while technical low-speed sectors trigger retrieval to handle uncertainty.
Table 18.
Sector-wise Analysis. The system adapts to track topology. High-speed sectors allow for cache reuse (Low ), while technical low-speed sectors trigger retrieval to handle uncertainty.
| Sector |
Avg speed (km/h) |
|
Mean
|
| S1 (Main straight) |
270 |
0.05 |
0.15 |
| S2 (Turn 1 braking) |
60 |
0.68 |
0.75 |
| S3 (Turn 2 apex) |
100 |
0.45 |
0.55 |
| S4 (Banked) |
160 |
0.30 |
0.40 |
| S5 (Back straight) |
240 |
0.08 |
0.20 |
| S6 (Tight chicane) |
55 |
0.72 |
0.80 |
| S7 (Fast curve) |
180 |
0.25 |
0.35 |
| S8 (Finish straight) |
280 |
0.04 |
0.12 |
Table 19.
Retrieval Hygiene. Standard retrieval is polluted by obsolete data (2026 spec). Domain filtering eliminates these "Physics Hallucinations," ensuring that all retrieved context is mechanically compliant with the 2027 chassis regulations.
Table 19.
Retrieval Hygiene. Standard retrieval is polluted by obsolete data (2026 spec). Domain filtering eliminates these "Physics Hallucinations," ensuring that all retrieved context is mechanically compliant with the 2027 chassis regulations.
| Configuration |
Rel@1 ↑ |
Rel@5 ↑ |
Hallucination Rate ↓ |
Status |
| Unfiltered RAG |
0.78 |
0.72 |
0.28 |
Unsafe (Obsolete Physics) |
| Domain-Aware |
0.92 |
0.88 |
0.00 |
Compliant (2027 Spec) |
Table 22.
Scientific Traceability Matrix. A rigorous mapping of architectural claims to empirical evidence, ensuring no "orphan claims" exist in the discussion.
Table 22.
Scientific Traceability Matrix. A rigorous mapping of architectural claims to empirical evidence, ensuring no "orphan claims" exist in the discussion.
| Core Claim |
Empirical Artifact (Evidence) |
Status |
| 1. Real-Time Feasibility |
Figure 19 (ECDF) & Table 12 (Budget) confirm tail latency stays below the 50ms hard deadline (P99 ms). |
Verified |
| 2. Diagnostic Gain |
Figure 20 (F1 Uplift) & Figure 21 (Conf. Matrix) demonstrate a +28% sensitivity gain in critical Chatter faults. |
Verified |
| 3. Fail-Silent Safety |
Figure 30 (Burst Analysis) & Figure 32 (FSM Logic) prove deterministic degradation without staleness. |
Verified |
| 4. Energy Efficiency |
Figure 25 (Pareto) & Figure 27 (Convexity) validate a 61% energy reduction vs. pure RAG. |
Verified |