Submitted:
27 January 2026
Posted:
29 January 2026
Read the latest preprint version here
Abstract
Keywords:
1. Introduction
1.1. Contributions and Paper Organization
1.1.1. Joint Optimization Framework
1.1.2. Empirical Stability Analysis
1.1.3. Comprehensive Performance Validation
- Statistically superior fusion accuracy and fault detection compared to classical base-lines.
- Hard real-time performance profiling on edge hardware, confirming its feasibility for latency-sensitive control loops.
- Effective handling of imbalanced fault data via established techniques, ensuring robust classifier training.
2. Literature Review
| Sensor | Modality | Sampling Rate | Preprocessing |
|---|---|---|---|
| S1 | Vibration (accelerometer) |
1kHz | High-pass (10 Hz), rolling std (100), cubic interpolation |
| S2 | Temperature | 1Hz | Moving median (5), z-score, zero order hold up sampling |
| S3 | Pressure | 100Hz | Low-pass (50 Hz), cubic spline interpolation |
| S4 | Current | 500Hz | Calibration offset, exponentiation smoothing (α = 0.3) |
2.1. Explainable AI and Stability in Learning-Based Control
2.2. Traditional VS. Intelligent Control Approaches
2.3. Fault Detection and Tolerance Mechanisms
2.4. Deep Learning Applications in Control Systems
3. Methodology
3.1. Research Framework Overview
3.2. Data Acquisition and Synchronization
- Vibration (S1): High-pass filter (10 Hz cutoff), rolling standard deviation (100-sample window);
- Temperature (S2): Moving median filter (5-sample window), z-score normalization;
- Pressure (S3): Low-pass filter (50 Hz cutoff), cubic spline interpolation for missing samples;
- Current (S4): Calibration offset correction, exponential smoothing (α = 0.3).
3.3. Adaptive Sensor Fusion Architecture
3.3.1. Joint Optimization Framework
- : 1D convolutional layers with kernel size= 5, stride= 1, channels= [32, 64, 128];
- : 4 encoder layers, 8 attention heads, hidden dimension= 256, feedforward dimension= 512;
- : Learnable projection matrices;
- : Interpretable fusion weights used for both estimation and fault analysis.
3.4. Joint Optimization Objective Function

3.5. Control Integration and Stability Analysis
3.5.1. Stability-Aware Design Principles
- 1.
- Pre-validated Gain Sets: All controller gains (K_nominal, K_degraded, K_safe) are designed using robust control synthesis. Each gain set satisfies stability criteria independently:
- K_nominal: Optimized for performance (settling time < 2s, overshoot < 10%)
- K_degraded: Reduced bandwidth (40% gain reduction) for robustness
- K_safe: Conservative failsafe with guaranteed stability margins > 6dB
- 2.
- Bounded Adaptation: The fault severity index triggers discrete switches between stable controllers rather than continuous gain modulation.
- 3.
- Hysteresis and Dwell Time: To prevent chattering:
- Hysteresis bands: τ₁ = 0.3, τ₁^upper = 0.35 and τ₂ = 0.7, τ₂^upper = 0.75
- Minimum dwell time: 0.5 seconds between switches
- Rate limiting: per second
3.5.2. Small-Gain Analysis Framework
- We analyze the system's robustness using the small-gain theorem as a guiding framework.
- The learned variations in effective process dynamics, introduced when the fusion model
- Pole placement verification across 1000+ switching events
- Energy-based metrics (V = x^T P x) decreasing monotonically post-fault
- Bounded trajectories over 500 fault injection trials
| Parameter | Value |
|---|---|
| Optimizer | Adam (,) |
| Batch Size | 64 (stratified sampling) |
| Epochs | 100 (Early stopping patience= 10) |
| Windows lengths | 5 seconds (5000 samples at 1 kHz) |
| Loss weights | , , |
| Class weights | Inverse frequency weighting |
| Edge device | NVIDIA Jetson AGX Xavier |
| Training hardware | NVIDIA RTX 3080 (32GB RAM) |
- Weighted Loss: Inverse frequency weighting in ;
- Data Augmentation: Synthetic minority oversampling (SMOTE) for fault classes;
- Batch Sampling: Stratified sampling ensuring equal representation per epoch.
3.6. Learning and Explainable Attention Mechanisms
- Attention Visualization: Time-varying sensor importance scores.
- Activation Clustering: Feature space analysis for fault discrimination.
- Gradient-based Saliency: Input-space importance mapping.
3.7. Fault Detection and Classification with Imbalance Handling
3.8. Fault Severity Estimation and Recovery Dynamics
3.9. Control Performance and Stability Validation

3.10. Hard Real-Time Performance Analysis
| Fault Label | Description | Raw Samples | Augmented Samples | Class Weight |
|---|---|---|---|---|
| F0 (Normal) | Nominal operation | 7000 | 1000 | 0.14 |
| F1 (Sensor Bias) | Gradual offset on sensor | 500 | 2000 | 1.0 |
| F2 (Stuck) | Sensor stuck at constant value | 500 | 2000 | 1.0 |
| F3 (Impulse) | Short high-amplitude spikes | 500 | 2000 | 1.0 |

3.11. Extended Fault Scenario Evaluation
3.11.1. Single-Sensor Fault Types
- Type 1 - Bias Drift: Gradual offset accumulation: si(t) = sitrue(t) + βt, where β ~ U (0.01, 0.05)
- Type 2 - Gain Degradation: Multiplicative scaling: si(t) = γ · sitrue(t), where γ ~ U (0.5, 0.8)
- Type 3 - Stuck-at Fault: Sensor freezes at last valid reading for duration Δt ~ U (2, 10) seconds
- Type 4 - Impulse Noise: Random spikes: si(t) = sitrue(t) + ξ(t), where ξ(t) is Pois-son-distributed impulses with amplitude ~ N (0, 5σs)
3.11.2. Compound Fault Scenarios
- 1.
- Scenario A - Sequential Faults:
- Sensor 1 develops bias drift at t = 10s
- Sensor 3 experiences stuck-at fault at t = 15s
- Tests recovery under cascading degradation
- 2.
- Scenario B - Simultaneous Multi-Sensor:
- Two sensors (randomly selected) fail concurrently
- Fault types: one bias, one impulse noise
- Tests fusion robustness with reduced redundancy
- 3.
- Scenario C - Fault + High Noise:
- Single sensor bias fault during 3× elevated ambient noise (σnoise = 3σnominal)
- Tests detection sensitivity under challenging conditions
- 4.
- Scenario D - Intermittent Faults:
- Sensor alternates between normal and stuck-at states every 3-5 seconds
- Tests rapid adaptation and detection consistency
| Scenario | Detection Rate | Mean Recovery Time | RMSE During Fault |
|---|---|---|---|
| Single (Type 1-4) | 94.2 ± 2.1% | 1.1 ± 0.2s | 0.086 ± 0.012 |
| Sequential (A) | 89.7 ± 3.4% | 1.8 ± 0.4s | 0.121 ± 0.018 |
| Simultaneous (B) High Noise (C) Intermittent (D) |
82.3 ± 4.7% 85.1 ± 3.9% 91.5 ± 2.8% |
2.3 ± 0.6s 1.4 ± 0.3s 0.9 ± 0.2s |
0.167 ± 0.031 0.143 ± 0.024 0.094 ± 0.015 |
4. Results and Discussion
4.1. Statistical Validation of Fusion Accuracy
4.2. Convergence and Training Stability
4.3. Fault Classification Robustness with Imbalance Handling
4.4. Fault Recovery Dynamics and Stability Guarantees
4.5. Hard Real-Time Performance Validation
| Metric | Nominal | During Fault | After Recovery |
|---|---|---|---|
| Rise Time (s) | 0.25 ± 0.02 | 0.38 ± 0.05 | 0.27 ± 0.03 |
| Overshoot (%) | 0.90 ± 0.08 | 1.60 ± 0.15 | 1.00 ± 0.09 |
| Settling time (s) | 0.03 ±0.005 | 0.12 ± 0.02 | 0.04 ± 0.006 |
| RMSE | 6.2 dB ±0.5 | 2.1 dB ± 0.3 | 5.8 dB ± 0.4 |
| Stability Margin | 0.90 ± 0.08 | 1.60 ± 0.15 | 1.00 ± 0.09 |
4.5. Statistical Significance of Comparative Result
4.6. Generalization and Industrial Viability
4.6.1. Limitations and Future Work
4.7. Ablation Studies
5. Conclusion
Author Contributions
Funding
Conflicts of Interest
References
- M. H. El-Mahdy and M. I. Awad, “Deep-learning-based design of active fault-tolerant control for automated manufacturing systems subjected to faulty sensors,” The International Journal of Advanced Manufacturing Technology and Control, 2024. [CrossRef]
- J. Desikan, S. K. Singh, A. Jayanthiladevi, S. Bhushan, and S. Al-Jabari, “Hybrid Machine Learning-Based Fault-Tolerant Sensor Data Fusion and Anomaly Detection for Fire Risk Mitigation in IIoT Environment,” Sen- sors, vol. 25, no. 3, p. 753, 2025. [CrossRef]
- M. M. Quamar and A. Nasir, “Review on fault diagnosis and fault-tolerant control scheme for robotic manipulators: Recent advances in AI, machine learning, and digital twin,” arXiv preprint arXiv:2402.02980, 2024.
- M. Elhoseny, D. D. Rao, B. D. Veerasamy, N. Alduaiji, and V. Thandi, “Deep Learning Algorithm for Optimized Sensor Data Fusion in Fault Diagnosis and Tolerance,” in International Conference on Machine Intelli- gence Systems and Applications, vol. 64, 2024, pp. 249–260. [CrossRef]
- U. Farinu, “Optimizing Sensor Data Fusion Using Deep Reinforcement Learning for Fault Tolerance,” Available at SSRN 5130119, 2025.
- L. Ren, Z. Jia, Y. Laili, and D. Huang, “Deep learning for time-series prediction in IIoT: progress, challenges, and prospects,” IEEE Transactions on Neural Networks and Learning Systems, vol. 34, no. 10, pp. 7837– 7854, 2023. [CrossRef]
- H. Bilal, M. S. Obaidat, M. S. Aslam, J. Zhang, A. G. G. Khan, and H. J. Al-Hajji, “Online fault diagnosis of industrial robot using IoRT and hybrid deep learning techniques: An experimental approach,” IEEE Internet of Things Journal, vol. 11, no. 12, pp. 20908–20915, 2024. [CrossRef]
- J. Dong, Z. Li, Y. Zheng, J. Luo, M. Zhang, Y. Wang, and Z. Sun, “Real- time fault detection for IIoT facilities using GA-Att-LSTM based on edge- cloud collaboration,” Frontiers in Neurosciences, vol. 18, p. 1380922, 2024. [CrossRef]
- Z. H. Khan, S. Mekid, L. A. Al-Haddad, R. T. S. K. Al-Ani, and A. T. Al- Ani, “AI Enabled Manufacturing: A Deep Learning Approach to Network Fault Detection,” in 2025 4th International Conference on Computer and Robotics Applications (ICCRA), 2025, pp. 111–117.
- H. Belgacem and I. Chihi, “Toward Reliable and Intelligent Sensor Sys- tems: A Comprehensive Study of Fault Diagnosis and Mitigation,” IEEE Sensors Reviews, 2025. [CrossRef]
- D. K. Sah, S. Almujaiwel, K. Cengiz, S. K. Singh, M. B. E. G. O. A.
- M. H. El-Mahdy, and M. I. Awad, “Energy-Efficient Task Allocation for IIoT Deep Learning Applications: An Embedded Edge Clusters Solution,” IEEE Internet of Things Magazine, vol. 8, no. 2, pp. 12–19, 2025.
- H. Mohapatra, “A Comprehensive Review on Urban Resilience via Fault- Tolerant IoT and Sensor Networks,” Computers, Materials & Continua, vol. 74, no. 1, pp. 883–901, 2025. [CrossRef]
- I. S. Gherghina, N. Bizon, G. V. Iana, and B. V. Vasilica˘, “Recent Advances in Fault Detection and Analysis of Synchronous Motors: A Review,” Machines, vol. 13, no. 1, p. 55, 2025. [CrossRef]
- Y. Feng, “Adaptive control system for collaborative sorting robotic arms based on multimodal sensor fusion and edge computing,” Scientific Re- ports, vol. 15, no. 1, p. 1125, 2025. [CrossRef]
- A. Masri and M. Al-Jabi, “Toward fault tolerant modelling for SCADA based electricity distribution networks, machine learning approach,” PeerJ Computer Science, vol. 7, e339, 2021. [CrossRef]
- M. Mousavi, M. Kordestani, M. Moradi, and M. Ghassemi, “A New Triple Deep Learning Strategy for Fault Isolation and Tolerant Cruise Control in Connected Autonomous Vehicles,” IEEE Sensors Journal, 2025. [CrossRef]
- S. Alshathri, E. E. D. Hemdan, W. El-Shafai, A. H. Al-Quraishi, K. S. M. I. Awad, and M. M. Quamar, “Digital twin-based automated fault diagnosis in industrial IoT applications,” Computers, Materials & Continua, vol. 77, no. 1, pp. 1025–1045, 2023.
- Z. Lan, “A comprehensive review of fault-tolerant routing mechanisms for the internet of things,” International Journal of Advanced Computer Science and Applications, vol. 14, no. 10, 2023. [CrossRef]
- O. Dogru, J. Xie, O. Prakash, R. Chiplunkar, and S. Alshathri, “Reinforce- ment learning in process industries: Review and perspective,” IEEE/CAA Journal of Automatica Sinica, vol. 11, no. 6, pp. 1285–1306, 2024. [CrossRef]
- Al-Salim, “Real-Time Fault Diagnosis in Control Systems Using Big Data,” Australian Journal of Instrumentation and Control, vol. 32, no. 1,.
- pp. 101–108, 2025.
- A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, "Attention is all you need," in Proc. 31st Int. Conf. Neural Inf. Process. Syst. (NeurIPS), 2017, pp. 5998–6008.
- Taylor, A. Singletary, Y. Yue, and A. Ames, "Learning for safety-critical control with control barrier functions," in Proc. 2nd Conf. Learn. Dyn. Control (L4DC), vol. 120, 2020, pp. 708–717.
- M. Fazlyab, A. Robey, H. Hassani, M. Morari, and G. J. Pappas, "Efficient and accurate estimation of Lipschitz constants for deep neural networks," in Proc. 33rd Int. Conf. Neural Inf. Process. Syst. (NeurIPS), 2019, pp. 11423–1143.









| Method | Fusion RMSE | F1 (detection) | Recovery Time (s) | Latency (ms) | Energy/Inf (J) |
|---|---|---|---|---|---|
| Kalman Filter | 0.118 ± 0.008 | 0.70 ± 0.04 | 2.5 ± 0.3 | 2.1 ± 0.3 | 0.05 ± 0.005 |
| Extended KF | 0.095 ± 0.007 | — | — | 4.3 ± 0.5 | 0.005 |
| Particle Filter SVM (fault only) LSTM Fusion Fixed Average |
0.087 ± 0.009 — 0.076 ± 0.005 0.152 ± 0.010 |
— 0.71 ± 0.03 0.79 ± 0.04 0.62 ± 0.05 |
— 2.8 ± 0.5 1.9 ± 0.43.1 ± 0.4 3.1 ± 0.4 |
18.7 ± 2.1 3.4 ± 0.4 34.2 ± 3.8 3.0 |
0.012 0.004 0.038 0.03 ± 0.003 |
| Proposed (FP32) | 0.049 ± 0.003 | 0.89 ± 0.02 | 1.1 ± 0.2 | 45.0 ± 3.1 | 0.071 |
| Propose Proposed (INT8) | 0.049 ± 0.03 | 0.89 ± 0.02 | 1.1 ± 0.2 | 23.7 ± 2.4 | 0.043 |
| Method | RMSE (mean ± std) | Correlation Gain (∆r ± std) |
|---|---|---|
| Kalman Filter | 0.152 ± 0.010 | 0.00 ± 0.00 |
| Fixed Average |
0.118 ± 0.008 |
+0.12 ± 0.015 |
| Average Proposed | 0.049 ± 0.003 | +0.27 ± 0.020 |
| Variant | RMSE | F1-score | 99th %ile Latency (ms) | Notes |
|---|---|---|---|---|
| Full model (Proposed) | 0.049 ± 0.003 | 0.89 ± 0.02 | 58 | Baseline performance |
| CNN only (no Transformer) | 0.078 ± 0.005 | 0.82 ± 0.03 | 42 | Loses long-range dependencies |
| No attention fusion (simple average) | 0.092 ± 0.006 | 0.79 ± 0.04 | 51 | Largest drop; confirms adaptive weighting key |
| Single-task loss (estimation only) | 0.061 ± 0.004 | 0.75 ± 0.03 | 58 | Fault detection suffers without multi-task |
| Half attention heads (8 → 4 | 0.055 ± 0.003 | 0.87 ± 0.02 | 55 | Minor degradation; 8 heads optimal |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.