Submitted:
13 April 2026
Posted:
15 April 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
- Gearbox Fault Diagnosis Dataset from SpectraQuest, which includes vibration data collected under various load conditions for both healthy and faulty gearboxes.
- Wind Turbine SCADA Dataset, which contains operational metrics such as wind speed, power output, theoretical power curves, and wind direction.
- 3.
- Application of a hybrid CNN-LSTM architecture to jointly exploit spatial and temporal patterns.
- 4.
- Visualization and interpretation of signal characteristics through FFT, spectrograms, and SCADA trend plots.
- 5.
- Demonstration of improved fault detection capability compared to conventional analysis.
2. Literature Review
2.1. Gear Fault Diagnosis in Wind Turbines
2.2. Time-Series and Frequency-Domain Signal Analysis
2.3. Deep Learning for Condition Monitoring
2.4. CNN-LSTM Architectures in Industrial Applications
2.5. Gaps in Existing Research
- 6.
- Limited Real-Time Application: Many proposed models are developed and tested in offline settings using curated datasets. Their effectiveness in real-time fault detection remains largely unexplored due to computational complexity and latency issues.
- 7.
- Interpretability and Black Box Concerns: Deep learning models, especially hybrid architectures, are often criticized for their lack of interpretability, which can hinder adoption in critical applications like wind energy where decision transparency is essential.
- 8.
- Underrepresentation of Dynamic Conditions: Few models adequately address variable load, speed, or environmental noise, which are prevalent in wind turbine operations and significantly affect model generalization.
- 9.
- Limited Use of Spectrogram-Based Temporal Features: While spectrograms provide powerful time-frequency representations, their dynamic nature is underutilized in LSTM-based models that could otherwise leverage this information for better temporal modeling.
3. Methodology
3.1. Rationale for Hybrid Model Architecture
- Spatial Patterns in Vibration: Localized faults (e.g., a broken tooth) generate periodic impulsive shocks. In the frequency domain, this manifests as sidebands around the GMF and its harmonics, caused by amplitude modulation. A CNN is uniquely suited to automatically learn these complex, localized patterns from vibration spectrograms without relying on manual feature extraction.
- Temporal Patterns: These impulsive shocks occur at a rate determined by the shaft speed, creating a temporal signature. Furthermore, the fault progression causes a gradual deviation in SCADA trends (e.g., a growing discrepancy between theoretical and actual power). An LSTM is explicitly designed to capture such long-term temporal dependencies and sequential anomalies.
3.2. Dataset Description
3.3. Data Preprocessing
3.4. Model Architecture
3.5. Model Training and Evaluation
4. Data Visualization and Analysis
4.1. Block Diagram of Wind Turbine Gearbox System
4.2. Training Process Visualization
4.2.1. Model Training Loss Over Epochs
4.2.2. Validation Accuracy over Epochs
4.3. Vibration Signal Analysis
4.3.1. Healthy vs Faulty Gearbox Time-Series Signals
4.3.2. Frequency Domain Analysis Using FFT
4.3.3. Time-Frequency Analysis Using Spectrogram
4.4. SCADA Data Analysis
4.4.1. Wind Speed vs Power Output Curve
4.4.2. Actual vs Theoretical Power Comparison
4.4.3. Anomaly Detection in Power Output Using Z-Score
5. Results and Discussion
5.1. Performance of the Hybrid CNN-LSTM Model
5.2. Interpretability Through Feature Maps
5.3. Gear Fault Detection Insights from Vibration Data
5.4. SCADA-Based Operational Fault Detection
5.5. Comparison with Existing Methods
5.6. Confusion Matrix Analysis
- CNN-LSTM (Proposed): Achieves 95 true positives (faulty correctly identified) and 95 true negatives (healthy correctly identified), with only 5 false positives and 5 false negatives. This represents a balanced performance across both classes.
- CNN Only: Shows 85 true positives and 88 true negatives, with 12 false positives and 15 false negatives. The higher false negative rate (15%) indicates difficulty in detecting some fault patterns.
- LSTM Only: Demonstrates 88 true positives and 90 true negatives, performing better than CNN alone but still inferior to the hybrid approach.
- SVM with FFT: Exhibits the poorest performance with 80 true positives and 82 true negatives, confirming the superiority of deep learning approaches for this task.
5.7. Training Dynamics
- Rapid convergence within the first 20 epochs
- Final training loss of 0.08 and validation loss of 0.10
- Final training accuracy of 97.2% and validation accuracy of 96.8%
- Minimal gap between training and validation curves, indicating no overfitting
5.8. Comparative Performance Analysis
5.9. Model Interpretability
- Healthy condition: Feature maps show relatively uniform, low-magnitude activations
- Faulty condition: Distinct high-activation regions appear, particularly in deeper layers
- Progressive abstraction: Early layers capture basic frequency patterns, while deeper layers learn fault-specific signatures
5.10. Vibration Signal Analysis Results


- Healthy gearbox: Clean spectrum with dominant gear mesh frequency (56 Hz)
- Faulty gearbox: Presence of sidebands around gear mesh frequency (56 ± 28 Hz)
- Faulty gearbox: Additional harmonics at 84 Hz and 112 Hz
- Spectrograms show time-varying frequency content in faulty cases
6. Conclusions and Future Work
Acknowledgments
References
- Alsumaidaee, Y. A. M.; Paw, J. K. S.; Yaw, C. T.; Tiong, S. K.; Chen, C. P.; Yusaf, T.; et al. Fault detection for medium voltage switchgear using a deep learning hybrid 1D-CNN-LSTM model. IEEE Access 2023, 11, 97574–97589. [Google Scholar] [CrossRef]
- Alsumaidaee, Y. A. M.; Yaw, C. T.; Koh, S. P.; Tiong, S. K.; Chen, C. P.; Yusaf, T.; et al. Detection of corona faults in switchgear by using 1D-CNN, LSTM, and 1D-CNN-LSTM methods. Sensors 2023, 23(6), 3108. [Google Scholar]
- Borré, A.; Seman, L. O.; Camponogara, E.; Stefenon, S. F.; Mariani, V. C.; Coelho, L. D. S. Machine fault detection using a hybrid CNN-LSTM attention-based model. Sensors 2023, 23(9), 4512. [Google Scholar] [CrossRef] [PubMed]
- Crabbé, J.; Huynh, N.; Stanczuk, J.; Van Der Schaar, M. Time series diffusion in the frequency domain. arXiv 2024, arXiv:2402.05933. [Google Scholar] [CrossRef]
- Dao, F.; Zeng, Y.; Zou, Y.; Qian, J. Wear fault diagnosis in hydro-turbine via the incorporation of the IWSO algorithm optimized CNN-LSTM neural network. Scientific Reports 2024, 14(1), 25278. [Google Scholar] [CrossRef] [PubMed]
- Elmaz, F.; Eyckerman, R.; Casteels, W.; Latré, S.; Hellinckx, P. CNN-LSTM architecture for predictive indoor temperature modeling. Building and Environment 2021, 206, 108327. [Google Scholar] [CrossRef]
- Gangsar, P.; Bajpei, A. R.; Porwal, R. A review on deep learning based condition monitoring and fault diagnosis of rotating machinery. Noise & Vibration Worldwide 2022, 53(11), 550–578. [Google Scholar] [CrossRef]
- Gebreamlak, B. Hybrid CNN-Integrated LSTM for Fault Detection and Diagnosis of Wind Turbines. Master's thesis, Itä-Suomen yliopisto, 2024. [Google Scholar]
- Keil, A.; Bernat, E. M.; Cohen, M. X.; Ding, M.; Fabiani, M.; Gratton, G.; et al. Recommendations and publication guidelines for studies using frequency domain and time-frequency domain analyses of neural time series. Psychophysiology 2022, 59(5), e14052. [Google Scholar] [CrossRef] [PubMed]
- Mohamad, T. H.; Abbasi, A.; Kim, E.; Nataraj, C. Application of deep CNN-LSTM network to gear fault diagnostics. 2021 IEEE International Conference on Prognostics and Health Management (ICPHM); 2021; pp. 1–6. [Google Scholar]
- Mohammed Alsumaidaee, Y. A.; Yaw, C. T.; Koh, S. P.; Tiong, S. K.; Chen, C. P.; Yusaf, T.; et al. Detection of corona faults in switchgear by using 1D-CNN, LSTM, and 1D-CNN-LSTM methods. Sensors 2023, 23(6), 3108. [Google Scholar]
- Nguyen-Da, T.; Nguyen-Thanh, P.; Cho, M. Y. Real-time AIoT anomaly detection for industrial diesel generator based an efficient deep learning CNN-LSTM in industry 4.0. Internet of Things 2024, 27, 101280. [Google Scholar] [CrossRef]
- Qi, L.; Zhang, Q.; Xie, Y.; Zhang, J.; Ke, J. Research on wind turbine fault detection based on CNN-LSTM. Energies 2024, 17(17), 4497. [Google Scholar] [CrossRef]
- Sahu, D.; Dewangan, R. K.; Matharu, S. P. S. Hybrid CNN-LSTM model for fault diagnosis of rolling element bearings with operational defects. International Journal on Interactive Design and Manufacturing (IJIDeM) 2024, 1–12. [Google Scholar] [CrossRef]
- Serin, G.; Sener, B.; Ozbayoglu, A. M.; Unver, H. O. Review of tool condition monitoring in machining and opportunities for deep learning. The International Journal of Advanced Manufacturing Technology 2020, 109(3), 953–974. [Google Scholar] [CrossRef]
- Wahid, A.; Breslin, J. G.; Intizar, M. A. Prediction of machine failure in industry 4.0: a hybrid CNN-LSTM framework. Applied Sciences 2022, 12(9), 4221. [Google Scholar] [CrossRef]
- Wang, W.; Taylor, J.; Rees, R. J. Recent advancement of deep learning applications to machine condition monitoring part 1: a critical review. Acoustics Australia 2021, 49(2), 207–219. [Google Scholar] [CrossRef]
- Wu, Y.; Ma, X. A hybrid LSTM-KLD approach to condition monitoring of operational wind turbines. Renewable Energy 2022, 181, 554–566. [Google Scholar] [CrossRef]
- Yi, K.; Zhang, Q.; Fan, W.; Wang, S.; Wang, P.; He, H.; et al. Frequency-domain MLPs are more effective learners in time series forecasting. Advances in Neural Information Processing Systems 2023, 36, 76656–76679. [Google Scholar]
- Zegarra, F. C.; Vargas-Machuca, J.; Coronado, A. M. Comparison of CNN and CNN-LSTM architectures for tool wear estimation. 2021 IEEE Engineering International Research Conference (EIRCON); 2021; pp. 1–4. [Google Scholar]
- Zhang, F.; Zhu, Y.; Zhang, C.; Yu, P.; Li, Q. Abnormality detection method for wind turbine bearings based on CNN-LSTM. Energies 2023, 16(7), 3291. [Google Scholar] [CrossRef]
- Zhang, Q.; Yang, P.; Wen, H.; Li, X.; Wang, H.; Sun, F.; et al. Beyond the time domain: Recent advances on frequency transforms in time series analysis. arXiv 2025, arXiv:2504.07099. [Google Scholar]
- Zhou, Y.; Zhi, G.; Chen, W.; Qian, Q.; He, D.; Sun, B.; Sun, W. A new tool wear condition monitoring method based on deep learning under small samples. Measurement 2022, 189, 110622. [Google Scholar] [CrossRef]













| Sensor | Condition | Mean (m/s²) | Max (m/s²) | Std Dev |
| Sensor 1 | Healthy | 0.12 | 0.89 | 0.14 |
| Sensor 1 | Faulty | 0.26 | 1.47 | 0.31 |
| Sensor 2 | Healthy | 0.10 | 0.81 | 0.13 |
| Sensor 2 | Faulty | 0.24 | 1.52 | 0.29 |
| Sensor 3 | Healthy | 0.09 | 0.77 | 0.12 |
| Sensor 3 | Faulty | 0.21 | 1.33 | 0.26 |
| Sensor 4 | Healthy | 0.11 | 0.85 | 0.14 |
| Sensor 4 | Faulty | 0.23 | 1.42 | 0.28 |
| Condition | Dominant Frequencies (Hz) | Key Characteristics | Interpretation |
| Healthy | 28 (Shaft Freq. - f_s), 56 (GMF) | Strong GMF peak; No sidebands; Low noise floor | Normal meshing; No amplitude modulation |
| Faulty | 28 (f_s), 56 (GMF), 84 (f_s + GMF), 112 (2×GMF) | Presence of sidebands at f_s ± GMF; Higher noise floor; Broader spectral energy | Amplitude modulation due to localized fault (e.g., broken tooth) |
| Wind Speed Range (m/s) | Theoretical Power (kW) | Actual Power (kW) | Deviation (%) |
| 4–6 | 120 | 100 | -16.7 |
| 6–8 | 300 | 260 | -13.3 |
| 8–10 | 550 | 490 | -10.9 |
| 10–12 | 800 | 750 | -6.3 |
| 12–14 | 1000 | 940 | -6.0 |
| Timestamp | Power Output (kW) | Wind Speed (m/s) | Z-score |
| 2023-02-05 14:10 | 320 | 7.1 | 3.52 |
| 2023-02-18 09:40 | 280 | 8.4 | -3.18 |
| 2023-03-01 16:00 | 910 | 12.6 | 3.70 |
| 2023-03-15 11:30 | 150 | 5.8 | -3.45 |
| 2023-04-02 08:20 | 670 | 10.9 | 3.20 |
| Dataset | Accuracy (%) | Precision (%) | Recall (%) | F1-score (%) |
| Gearbox Vibration | 96.8 | 95.5 | 97.2 | 96.3 |
| SCADA Anomalies | 93.2 | 91.0 | 94.1 | 92.5 |
| Method | Accuracy (%) | F1-score (%) | Data Type |
| SVM + FFT Features | 86.4 | 84.5 | Vibration |
| LSTM (Standalone) | 90.1 | 89.0 | Time-Series |
| CNN (2D Spectrogram) | 93.7 | 92.3 | Spectrogram |
| CNN-LSTM (Proposed Model) | 96.8 | 96.3 | Multimodal |
| Method | Accuracy | Precision | Recall | F1-Score | AUC-ROC | Training Time (s) |
| SVM + FFT | 81.0% | 80.8% | 80.0% | 80.4% | 0.876 | 45.2 |
| Random Forest | 84.5% | 83.9% | 84.5% | 84.2% | 0.901 | 32.7 |
| CNN Only | 86.5% | 87.6% | 85.0% | 86.3% | 0.928 | 156.3 |
| LSTM Only | 89.0% | 89.8% | 88.0% | 88.9% | 0.941 | 189.6 |
| CNN-LSTM (Proposed) | 95.0% | 95.0% | 95.0% | 95.0% | 0.982 | 234.8 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).