Submitted:
08 April 2026
Posted:
13 April 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
- This study used a hybrid combination ensemble approach using feature-level fusion.
- This study used weighted ensemble optimization.
- This study applied a transfer learning approach to transfer and combine the feature-level fusion ensemble result with the weighted ensemble method to increase the performance result.
- This study experimented and found the best combination of algorithms to combine in the ensemble method to improve the performance result.
- This study achieved better performance results compared to the existing studies.
2. Related Works
2.1. Gaps and Contributions
- Novel Hybrid Ensemble Architecture: We propose a novel feature-level fusion ensemble combining MobileNetV2, ResNet50, and EfficientNetB0, selected based on explicit criteria of complementarity, computational efficiency, and pediatric-specific pattern recognition; unlike previous studies that selected models arbitrarily.
- Balanced Clinical Performance: Our framework achieves an unprecedented balance between precision (94.10%) and recall (96.92%), addressing the critical clinical requirement of minimizing both false negatives and false positives simultaneously as a trade-off not achieved by prior works.
- Zero-Shot Generalization Validation: We also validated our ensemble on an external NIH pediatric dataset (94.73% accuracy) without fine-tuning, demonstrating true clinical transportability beyond benchmark dataset performance.
- Architectural Explainability Integration: We use Grad-CAM visualizations to provide clinical interpretability, enabling radiologists to understand and trust model predictions.
- Computational Efficiency for Resource-Constrained Settings: Our framework maintains high diagnostic accuracy while achieving inference speeds suitable for deployment in resource-constrained clinical environments.
3. Research Methods
3.1. Dataset Description
3.2. Data Preprocessing
3.3. Data Augmentation
3.4. Model Architecture of Hybrid Convolutional Neural Network (CNN) Ensemble
3.5. Performance Evaluation and Clinical Significance of the Proposed Ensemble Model
- It introduces a novel hybrid ensemble framework that leverages the complementary strengths of lightweight (MobileNetV2) and deep semantic (ResNet50, EfficientNetB0) networks to balance computational efficiency with high-level feature representation.
- The model is uniquely tailored to a pediatric diagnostic setting, focusing on a sensitive and underrepresented population often overlooked in mainstream AI medical research.
- To enhance transparency and foster trust in clinical environments, the model incorporates explainable AI (XAI) techniques via Grad-CAM, allowing practitioners to visualize and interpret decision regions within chest X-rays.
- A fully reproducible and well-documented pipeline has been developed, covering every stage from data preprocessing and augmentation to model training and evaluation, ensuring scientific rigor and practical deployment readiness.
- The exceptional F1-score of 94.97% confirms the model’s potential for real-world application in automated pneumonia screening tools, especially in resource-constrained healthcare environments.
4. Results and Discussions
4.1. Experimental Setup
4.2. Performance Metrics
4.3. Classification and Explanation
4.4. Comparative Analysis
- MobileNetV2: Employs depth-wise separable convolutions and linear bottlenecks. It is highly parameter-efficient (3.4M parameters) and fast, making it suitable for edge deployment. Its lower-level features capture local textures and edges, useful for detecting small consolidations.
- ResNet50: Introduces residual connections that enable training of very deep networks. Its 25.6M parameters allow learning of hierarchical, semantically rich features, particularly effective for identifying diffuse interstitial patterns characteristic of viral pneumonia.
- EfficientNetB0: Achieves state-of-the-art accuracy with compound scaling (depth, width, resolution). Its 5.3M parameters and balanced receptive field provide a complementary middle ground between the lightweight MobileNetV2 and the deeper ResNet50.
4.5. Clinical Relevance
5. Conclusions and Future Work
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Radočaj, P.; Martinović, G. Interpretable Deep Learning for Pediatric Pneumonia Diagnosis Through Multi-Phase Feature Learning and Activation Patterns. Electronics vol. 14(no. 9), pp. 1899, 2025. [CrossRef]
- Rudan; O’Brien, K. L.; Nair, H.; Liu, L.; Theodoratou, E.; Qazi, S.; Lukšić, I.; Fischer Walker, C. L.; Black, R. E.; Campbell, H. Epidemiology and etiology of childhood pneumonia in 2010: estimates of incidence, severe morbidity, mortality, underlying risk factors and causative pathogens for 192 countries. J Glob Health 2013, vol. 3(no. 1), pp. 010401. [Google Scholar]
- Tavares, L. P.; Galvão, I.; Ferrero, M. R. 5.30 - Novel Immunomodulatory Therapies for Respiratory Pathologies. In Comprehensive Pharmacology; Kenakin, T., Ed.; Elsevier: Oxford, 2022; pp. 554–594. [Google Scholar]
- W. H. Organization, Pneumonia in children. 13 May 2025, 2022. Available online: https://www.who.int/news-room/fact-sheets/detail/pneumonia.
- Zhang, Z. X.; Yong, Y.; Tan, W. C.; Shen, L.; Ng, H. S.; Fong, K. Y. Prognostic factors for mortality due to pneumonia among adults from different age groups in Singapore and mortality predictions based on PSI and CURB-65. Singapore Med J 2018, vol. 59(no. 4), 190–198. [Google Scholar] [CrossRef] [PubMed]
- Eurich, D. T.; Marrie, T. J.; Minhas-Sandhu, J. K.; Majumdar, S. R. Risk of heart failure after community acquired pneumonia: prospective controlled study with 10 years of follow-up. Bmj 2017, vol. 356, pp. j413. [Google Scholar] [CrossRef]
- Metlay, J. P.; Fine, M. J. Testing strategies in the initial management of patients with community-acquired pneumonia. Ann Intern Med 2003, vol. 138(no. 2), 109–18. [Google Scholar] [CrossRef]
- Khalifa, M.; Albadawy, M. Artificial Intelligence for Clinical Prediction: Exploring Key Domains and Essential Functions. Computer Methods and Programs in Biomedicine Update 2024/01/01/, 2024, vol. 5, pp. 100148. [Google Scholar] [CrossRef]
- Panteli, D.; Adib, K.; Buttigieg, S.; Goiana-da-Silva, F.; Ladewig, K.; Azzopardi-Muscat, N.; Figueras, J.; Novillo-Ortiz, D.; McKee, M. Artificial intelligence in public health: promises, challenges, and an agenda for policy makers and public health institutions. The Lancet Public Health 2025/05/01/, 2025, vol. 10(no. 5), e428–e432. [Google Scholar] [CrossRef] [PubMed]
- Yunianta. A Novel Advanced Performance Ensemble-Based Model (APEM) Framework: A Case Study on Diabetes Prediction. Journal of Advances in Information Technology 2024, vol. 15(no. 10), 1193–1204. [Google Scholar] [CrossRef]
- Bajwa, J.; Munir, U.; Nori, A.; Williams, B. Artificial intelligence in healthcare: transforming the practice of medicine. Future Healthcare Journal 2021/07/01/, 2021, vol. 8(no. 2), e188–e194. [Google Scholar] [CrossRef]
- Tsuneki, M. Deep learning models in medical image analysis. Journal of Oral Biosciences 2022/09/01/, 2022, vol. 64(no. 3), 312–320. [Google Scholar] [CrossRef]
- Kaya, M.; Çetin-Kaya, Y. A novel ensemble learning framework based on a genetic algorithm for the classification of pneumonia. Engineering Applications of Artificial Intelligence 2024/07/01/, 2024, vol. 133, pp. 108494. [Google Scholar] [CrossRef]
- Sotirov, S.; Orozova, D.; Angelov, B.; Sotirova, E.; Vylcheva, M. Transforming Pediatric Healthcare with Generative AI: A Hybrid CNN Approach for Pneumonia Detection. Electronics vol. 14(no. 9), pp. 1878, 2025. [CrossRef]
- Rao, S.; Zeng, Z.; Zhang, J. Robust Multiclass Pneumonia Classification via Multi-Head Attention and Transfer Learning Ensemble. Applied Sciences vol. 15(no. 21), pp. 11426, 2025. [CrossRef]
- Bhatt, H.; Shah, M. A Convolutional Neural Network ensemble model for Pneumonia Detection using chest X-ray images. Healthcare Analytics 2023/11/01/, 2023, vol. 3, pp. 100176. [Google Scholar] [CrossRef]
- Yue, Z.; Ma, L.; Zhang, R. Comparison and Validation of Deep Learning Models for the Diagnosis of Pneumonia. Computational Intelligence and Neuroscience 2020, vol. 2020(no. 1), pp. 8876798. [Google Scholar] [CrossRef]
- Rajaraman, S.; Kim, I.; Antani, S. K. Detection and visualization of abnormality in chest radiographs using modality-specific convolutional neural network ensembles. PeerJ 2020, vol. 8, pp. e8693. [Google Scholar] [CrossRef]
- Islam, M. N. Classification of pediatric pneumonia using chest X-rays by functional regression. 2020. Available online: https://arxiv.org/abs/2005.03243.
- Alsharif, R.; Al-Issa, Y.; Alqudah, A. M.; Qasmieh, I. A.; Mustafa, W. A.; Alquran, H. PneumoniaNet: Automated Detection and Classification of Pediatric Pneumonia Using Chest X-ray Images and CNN Approach. Electronics 2021, vol. 10(no. 23), pp. 2949. [Google Scholar] [CrossRef]
- Ravi, V.; Narasimhan, H.; Pham, T. D. A cost-sensitive deep learning-based meta-classifier for pediatric pneumonia classification using chest X-rays. Expert Systems 2022, vol. 39(no. 7), pp. e12966. [Google Scholar] [CrossRef]
- Mohammed; Kora, R. A comprehensive review on ensemble deep learning: Opportunities and challenges. Journal of King Saud University - Computer and Information Sciences 2023/02/01/, 2023, vol. 35(no. 2), 757–774. [Google Scholar] [CrossRef]
- Prakash, J. Arun; Asswin, C. R.; Ravi, V.; Sowmya, V.; Soman, K. P. Pediatric pneumonia diagnosis using stacked ensemble learning on multi-model deep CNN architectures. Multimedia Tools and Applications 2023/06/01, 2023, vol. 82(no. 14), 21311–21351. [Google Scholar] [CrossRef]
- Arulananth, T. S.; Prakash, S. W.; Ayyasamy, R. K.; Kavitha, V. P.; Kuppusamy, P. G.; Chinnasamy, P. Classification of Paediatric Pneumonia Using Modified DenseNet-121 Deep-Learning Model. IEEE Access 2024, vol. 12, 35716–35727. [Google Scholar] [CrossRef]
- Pan, Z.; Wang, H.; Wan, J.; Zhang, L.; Huang, J.; Shen, Y. Efficient federated learning for pediatric pneumonia on chest X-ray classification. Scientific Reports 2024, vol. 14(no. 1), pp. 23272, 2024/10/07. [Google Scholar] [CrossRef]
- Yoon, T.; Kang, D. Enhancing pediatric pneumonia diagnosis through masked autoencoders. Scientific Reports 2024, vol. 14(no. 1), pp. 6150, 2024/03/14. [Google Scholar] [CrossRef]
- Galvis Ruiz, G. E.; Benavides-Cruz, J.; Corredor, D. M.; Morales-Mendoza, E.; Cotrino Palma, H. D. A.; Cely-Jiménez, A. Development of deep learning-based classification models for opacity differentiation in pediatric chest radiography. Informatics in Medicine Unlocked vol. 52, pp. 101605, 2025/01/01/, 2025. [CrossRef]
- R, P.; Gajendran, G.; Boulaaras, S.; Tantawy, S. S. PediaPulmoDx: Harnessing cutting edge preprocessing and explainable AI for pediatric chest X-ray classification with DenseNet121. Results in Engineering 2025/03/01/, 2025, vol. 25, pp. 104320. [Google Scholar] [CrossRef]
- Katreddi, S.; Midatani, A.; Roy, A. P.; Velpuri, U.; Kasani, S. Pediatric pneumonia X-ray image classification: predictive model development with DenseNet-169 transfer learning. Journal of Medical Artificial Intelligence 2025. [Google Scholar] [CrossRef]
- Nazir, S.; Dickson, D. M.; Akram, M. U. Survey of explainable artificial intelligence techniques for biomedical imaging with deep neural networks. Computers in Biology and Medicine 2023/04/01/, 2023, vol. 156, pp. 106668. [Google Scholar] [CrossRef]
- Selvaraju, R. R.; Cogswell, M.; Das, A.; Vedantam, R.; Parikh, D.; Batra, D. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization 618–626.
- Kermany, D.; Zhang, K.; Goldbaum, M. Labeled Optical Coherence Tomography (OCT) and Chest X-Ray Images for Classification. In Mendeley Data; 2018. [Google Scholar]
- Mujahid, M.; Rustam, F.; Álvarez, R.; Luis Vidal Mazón, J.; Díez, I. T.; Ashraf, I. Pneumonia Classification from X-ray Images with Inception-V3 and Convolutional Neural Network. Diagnostics (Basel) 2022, vol. 12(no. 5). [Google Scholar] [CrossRef] [PubMed]
- Ke; Ellsworth, W.; Banerjee, O.; Ng, A. Y.; Rajpurkar, P. CheXtransfer: performance and parameter efficiency of ImageNet models for chest X-Ray interpretation. In in Proceedings of the Conference on Health, Inference, and Learning, Virtual Event, USA, 2021; pp. 116–124. [Google Scholar]
- Goodfellow, Y. Bengio, and A. Courville; The MIT Press: Deep Learning, 2016. [Google Scholar]
- C. Aggarwal, Neural Networks and Deep Learning; Springer Cham, 2023.
- Santurkar, S.; Tsipras, D.; Ilyas, A.; Madry, A. How Does Batch Normalization Help Optimization? in 32nd Conference on Neural Information Processing Systems (NIPS 2018), Montréal, Canada, 2018. [Google Scholar]
- Ioffe, S.; Szegedy, C. Batch normalization: accelerating deep network training by reducing internal covariate shift. In Proceedings of the 32nd International Conference on International Conference on Machine Learning -, Lille, France, 2015; Volume 37, pp. 448–456. [Google Scholar]
- Mumuni; Mumuni, F. Data augmentation: A comprehensive survey of modern approaches. Array 2022/12/01/, 2022, vol. 16, pp. 100258. [Google Scholar] [CrossRef]










| Component | Configuration / Description |
| Optimizer | Adam (Adaptive Moment Estimation) with decoupled weight decay for stable and efficient updates |
| Learning Rate | 1 × 10−4 — fine-tuned to ensure steady convergence without overshooting minima |
| Loss Function | Binary Cross-Entropy — appropriate for probabilistic outputs in binary classification |
| Epochs | 30 — capped with early stopping (patience = 10) to prevent overfitting |
| Batch Size | 32 — balanced for computational efficiency and learning stability |
| Regularization | Dropout with p = 0.5 applied in the fully connected layers to mitigate overfitting |
| Hardware | NVIDIA Tesla T4 GPU via Google Colab Pro for accelerated parallel training |
| Training Parameters | Values/Types |
| Model Architecture | MobileNetV2, VGG16, ResNet-50, DenseNet-201, EfficientNet-B0, InceptionV3, Xception (Pre-trained) |
| Optimizer | Adam (Learning Rate: 1× 10−4) |
| Loss Function | Categorical Crossentropy |
| Batch Size | 32 |
| Epochs | 30 |
| Dropout Rate (Layer 1) | 0.5 |
| Dropout Rate (Layer 2) | 0.3 |
| Learning Rate | 1× 10−4 |
| Weight Initialization | He Initialization |
| Activation Function | ReLU |
| Final Activation Function | Softmax |
| Input Size | 224 × 224 × 3 |
| Model Combination | Accuracy (%) | Precision (%) | Recall (%) | F1-Score (%) |
| MobileNetV2 + ResNet50 + EfficientNetB0 | 96.14 | 94.10 | 96.92 | 94.97 |
| DenseNet201 + EfficientNetB0 + MobileNetV2 | 92.31 | 91.11 | 97.18 | 94.04 |
| EfficientNetB0 + InceptionV3 + Xception | 92.63 | 92.56 | 91.62 | 92.05 |
| EfficientNetB0 + ResNet50 + VGG16 | 89.74 | 87.73 | 97.18 | 92.21 |
| InceptionV3 + ResNet101 + EfficientNet | 81.41 | 80.58 | 92.56 | 86.16 |
| Ensemble Model | Accuracy (%) |
| Without MobileNetV2 | 94.23 |
| Without ResNet50 | 93.87 |
| Without EfficientNetB0 | 94.56 |
| MobileNetV2 + ResNet50 + EfficientNetB0 | 96.14 |
| Model | Accuracy (%) | Precision (%) | Recall (%) | F1-Score (%) |
| MobileNetV2 | 93.18 | 92.86 | 95.26 | 93.18 |
| ResNet-50 | 93.11 | 92.24 | 94.97 | 92.87 |
| DenseNet-201 | 92.64 | 91.76 | 94.68 | 92.47 |
| EfficientNet-B0 | 91.36 | 90.89 | 92.93 | 91.48 |
| VGG16 | 74.29 | 55.19 | 74.29 | 63.33 |
| InceptionV3 | 90.72 | 89.46 | 90.35 | 89.82 |
| Xception | 91.94 | 90.79 | 91.58 | 90.73 |
| Model | Accuracy (%) | Precision (%) | Recall (%) | F1-Score (%) |
| MobileNetV2 | 86.16 | 84.43 | 88.17 | 85.47 |
| ResNet-50 | 88.38 | 86.47 | 89.72 | 87.62 |
| EfficientNet-B0 | 87.85 | 85.79 | 88.39 | 85.96 |
| Static Ensemble | 89.14 | 87.28 | 90.48 | 88.35 |
| Proposed Work | 94.73 | 91.03 | 96.12 | 93.47 |
| Study | Dataset | Model | Accuracy (%) | Precision (%) | Recall (%) | F1-Score (%) | Notes |
| Rajaraman et al. (2020) [18] | 24000+ Chest X-rays | Custom Ensemble Model | 91.63 | 92.49 | 88.42 | 92.86 | High performance with deep residual learning. |
| Yue et al. (2020) [17] | 5863 Chest X-rays | MobileNet | 92.98 | - | 98.98 | - | Balanced metrics suitable for clinical applications. |
| Bhatt and Shah (2023) [16] | 5863 Chest X-rays | ensemble network of 3 CNN models | 84.12 | 80.04 | 99.23 | 88.56 | Combines CNN feature extraction with machine learning classifier. |
| Sotirov et al. (2025) [14] | 5863 Chest X-rays | (CNN) with intuitionistic fuzzy estimation (IFE) | 94.93 | 93.00 | 91.00 | 91.00 | Combines convolutional neural networks with intuitionistic fuzzy estimators. |
| Rao et al. (2025) [15] | 5863 Chest X-rays | Ensemble DenseNet-121, ResNet-50, and VGG-19 | 91.67 | 92.19 | 90.00 | 90.89 | Proposes multimodel ensemble learning framework based on multi-head attention mechanism. |
|
Proposed Work |
5863 Chest X-rays | MobileNetV2 + ResNet50 + EfficientNetB0 | 96.14 | 94.10 | 96.92 | 94.97 | Achieve superior accuracy and recall, ensuring robust and balanced. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).