Submitted:
18 July 2025
Posted:
22 July 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. The Dataset
2.1.1. Image Preprocessing
2.1.2. Anomaly Detection
2.1.3. Final Dataset and Class Creation
2.2. Deep Learning Architectures
2.2.1. Residual Network
2.2.2. Densely Connected Network
2.2.3. Vision Transformer
2.2.4. Swin Transformer
2.3. Training
2.3.1. Cross-Validation
2.3.2. Data Augmentation
2.3.3. Class Weighting
- N is the total number of training samples,
- C is the number of classes,
- is the number of samples in class c.
2.3.4. Batch-wise Class Balancing
2.3.5. Focal Loss
- is the predicted probability for the correct class,
- is a positive scalar that controls the strength of the focusing effect.
2.3.6. Fine-Tuning
2.3.7. Evaluation Metrics
- TP (True Positives): correct predictions of the positive class;
- FP (False Positives): incorrect predictions of the positive class;
- FN (False Negatives): missed predictions of the positive class.
- C is the number of classes,
- is the F1-score for class i.
2.3.8. Ensemble Prediction Strategy
3. Results and Discussion
- Learning rate: logarithmic range between and ;
- Number of transformations applied by RandAugment: from 2 to 4;
- Transformation magnitude applied by RandAugment: from 3 to 15.
4. Conclusions and Future Works
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Schön, J. Physical properties of rocks: a workbook; Number 8 in Handbook of petroleum exploration and production, Elsevier: Amsterdam ; Boston, 2011. OCLC: ocn751711883.
- Lawal, A.; Yang, Y.; He, H.; Baisa, N.L. Machine Learning in Oil and Gas Exploration: A Review. IEEE Access 2024, 12, 19035–19058. [Google Scholar] [CrossRef]
- Ben-Awuah, J.; Padmanabhan, E. An enhanced approach to predict permeability in reservoir sandstones using artificial neural networks (ANN). Arabian Journal of Geosciences 2017, 10, 1–15. [Google Scholar] [CrossRef]
- Huang, Z.H.; Shimeld, J.; Williamson, M.; Katsube, J. Permeability prediction with artificial neural network modeling in the Venture gas field, offshore eastern Canada. GEOPHYSICS 1996, 61, 422–436. [Google Scholar] [CrossRef]
- Helle, H.B.; Bhatt, A.; Ursin, B. Porosity and permeability prediction from wireline logs using artificial neural networks: a North Sea case study. GEOPHYSICAL PROSPECTING 2001, 49, 431–444. [Google Scholar] [CrossRef]
- Liu, J.J.; Liu, J.C. Permeability Predictions for Tight Sandstone Reservoir Using Explainable Machine Learning and Particle Swarm Optimization. Geofluids 2022, 2022, 1–15. [Google Scholar] [CrossRef]
- Zhao, X.; Chen, X.; Huang, Q.; Lan, Z.; Wang, X.; Yao, G. Logging-data-driven permeability prediction in low-permeable sandstones based on machine learning with pattern visualization: A case study in Wenchang A Sag, Pearl River Mouth Basin. Journal of Petroleum Science and Engineering 2022, 214, 110517. [Google Scholar] [CrossRef]
- Matinkia, M.; Hashami, R.; Mehrad, M.; Hajsaeedi, M.R.; Velayati, A. Prediction of permeability from well logs using a new hybrid machine learning algorithm. Petroleum 2023, 9, 108–123. [Google Scholar] [CrossRef]
- Masroor, M.; Emami Niri, M.; Rajabi-Ghozloo, A.H.; Sharifinasab, M.H.; Sajjadi, M. Application of machine and deep learning techniques to estimate NMR-derived permeability from conventional well logs and artificial 2D feature maps. JOURNAL OF PETROLEUM EXPLORATION AND PRODUCTION TECHNOLOGY 2022, 12, 2937–2953. [Google Scholar] [CrossRef]
- Tariq, Z.; Gudala, M.; Yan, B.; Sun, S.; Mahmoud, M. A fast method to infer Nuclear Magnetic Resonance based effective porosity in carbonate rocks using machine learning techniques. GEOENERGY SCIENCE AND ENGINEERING 2023, 222, 211333. [Google Scholar] [CrossRef]
- Zhao, J.; Wang, Q.; Rong, W.; Zeng, J.; Ren, Y.; Chen, H. Permeability Prediction of Carbonate Reservoir Based on Nuclear Magnetic Resonance (NMR) Logging and Machine Learning. ENERGIES 2024, 17, 1458. [Google Scholar] [CrossRef]
- Araya-Polo, M.; Alpak, F.O.; Hunter, S.; Hofmann, R.; Saxena, N. Deep learning-driven permeability estimation from 2D images. COMPUTATIONAL GEOSCIENCES 2020, 24, 571–580. [Google Scholar] [CrossRef]
- Tembely, M.; AlSumaiti, A.M.; Alameri, W.S. Machine and deep learning for estimating the permeability of complex carbonate rock from X-ray micro-computed tomography. ENERGY REPORTS 2021, 7, 1460–1472. [Google Scholar] [CrossRef]
- dos Anjos, C.E.M.; de Matos, T.F.; Avila, M.R.V.; Fernandes, J.d.C.V.; Surmas, R.; Evsukoff, A.G. Permeability estimation on raw micro-CT of carbonate rock samples using deep learning. GEOENERGY SCIENCE AND ENGINEERING 2023, 222, 211335. [Google Scholar] [CrossRef]
- Mohyeddini, A.; Rasaei, M. Calculating porosity and permeability from synthetic micro-CT scan images based on a hybrid artificial intelligence. Canadian Journal of Chemical Engineering 2023, 101, 6591–6612. [Google Scholar] [CrossRef]
- da Silva, P.N.; Goncalves, E.C.; Rios, E.H.; Muhammad, A.; Moss, A.; Pritchard, T.; Glassborow, B.; Plastino, A.; de Vasconcellos Azeredo, R.B. Automatic classification of carbonate rocks permeability from 1H NMR relaxation data. EXPERT SYSTEMS WITH APPLICATIONS 2015, 42, 4299–4309. [Google Scholar] [CrossRef]
- Favacho de Freitas, K.L.; da Silva, P.N.; Faria, B.M.; Goncalves, E.C.; Rios, E.H.; Nobre-Lopes, J.; Rabe, C.; Plastino, A.; de Vasconcelos Azeredo, R.B. A data mining approach for automatic classification of rock permeability. JOURNAL OF APPLIED GEOPHYSICS 2022, 196, 104514. [Google Scholar] [CrossRef]
- Bedi, J.; Toshniwal, D. Features denoising-based learning for porosity classification. NEURAL COMPUTING & APPLICATIONS 2020, 32, 16519–16532. [Google Scholar] [CrossRef]
- A comprehensive survey on design and application of autoencoder in deep learning. Applied Soft Computing 2023, 138, 110176. [CrossRef]
- Zhao, Y.; Nasrullah, Z.; Li, Z. PyOD: A Python Toolbox for Scalable Outlier Detection. JOURNAL OF MACHINE LEARNING RESEARCH 2019, 20, 96. [Google Scholar]
- Oquab, M.; Darcet, T.; Moutakanni, T.; Vo, H.; Szafraniec, M.; Khalidov, V.; Fernandez, P.; Haziza, D.; Massa, F.; El-Nouby, A.; et al. DINOv2: Learning Robust Visual Features without Supervision. arXiv 2024. [Google Scholar] [CrossRef]
- Zhao, F.; Zhang, C.; Geng, B. Deep Multimodal Data Fusion. ACM Comput. Surv. 2024, 56, 216–1. [Google Scholar] [CrossRef]
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. NATURE 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
- Bengio, Y.; Lecun, Y.; Hinton, G. Deep Learning for AI. COMMUNICATIONS OF THE ACM 2021, 64, 58–65. [Google Scholar] [CrossRef]
- Ballard, D.H.; Brown, C.M. Computer Vision, 1st ed.; Prentice Hall Professional Technical Reference, 1982.
- Stockman, G.; Shapiro, L.G. Computer Vision, 1st ed.; Prentice Hall PTR: USA, 2001. [Google Scholar]
- Wightman, R. PyTorch Image Models, 2025. [CrossRef]
- Russakovsky, O.; Deng, J.; Su, H.; Krause, J.; Satheesh, S.; Ma, S.; Huang, Z.; Karpathy, A.; Khosla, A.; Bernstein, M.; et al. ImageNet Large Scale Visual Recognition Challenge. arXiv 2015, arXiv:1409.0575. [Google Scholar] [CrossRef]
- Ridnik, T.; Ben-Baruch, E.; Noy, A.; Zelnik-Manor, L. ImageNet-21K Pretraining for the Masses. arXiv 2021, arXiv:2104.10972. [Google Scholar] [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. arXiv 2015. [Google Scholar] [CrossRef]
- Wightman, R.; Touvron, H.; Jégou, H. ResNet strikes back: An improved training procedure in timm. arXiv 2021. [Google Scholar] [CrossRef]
- Huang, G.; Liu, Z.; Maaten, L.v.d.; Weinberger, K.Q. Densely Connected Convolutional Networks. arXiv 2018, arXiv:1608.06993. [Google Scholar] [CrossRef] [PubMed]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; et al. AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE. 2021.
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.; Kaiser, .; Polosukhin, I. Attention is all you need. 2017, Vol. 2017-December, pp. 5999–6009. ISSN: 1049-5258.
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. arXiv 2021. [Google Scholar] [CrossRef]
- Cubuk, E.D.; Zoph, B.; Shlens, J.; Le, Q.V. RandAugment: Practical automated data augmentation with a reduced search space. arXiv 2019. [Google Scholar] [CrossRef]
- Ansel, J.; Yang, E.; He, H.; Gimelshein, N.; Jain, A.; Voznesensky, M.; Bao, B.; Bell, P.; Berard, D.; Burovski, E.; et al. PyTorch 2: Faster Machine Learning Through Dynamic Python Bytecode Transformation and Graph Compilation. In Proceedings of the PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, ASPLOS 2024, VOL 2, New York, 2024; pp. 929–947. Num Pages: 19 Web of Science ID: WOS:001229041700057. [CrossRef]
- Lin, T.Y.; Goyal, P.; Girshick, R.; He, K.; Dollár, P. Focal Loss for Dense Object Detection. arXiv 2018. [Google Scholar] [CrossRef]
- Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. arXiv 2017. [Google Scholar] [CrossRef]
- Loshchilov, I.; Hutter, F. SGDR: Stochastic Gradient Descent with Warm Restarts. arXiv 2017. [Google Scholar] [CrossRef]
- Rokach, L. Ensemble-based classifiers. Artificial Intelligence Review 2010, 33, 1–39. [Google Scholar] [CrossRef]
- Akiba, T.; Sano, S.; Yanase, T.; Ohta, T.; Koyama, M. Optuna: A Next-generation Hyperparameter Optimization Framework. In Proceedings of the Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery &. [CrossRef]
- Loshchilov, I.; Hutter, F. Decoupled Weight Decay Regularization. arXiv 2019. [Google Scholar] [CrossRef]
- Deb, K.; Pratap, A.; Agarwal, S.; Meyarivan, T. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Transactions on Evolutionary Computation 2002, 6, 182–197. [Google Scholar] [CrossRef]
| 1 | The F-Measure, or F1-score, is the harmonic mean of Precision and Recall, balancing the trade-off between false positives and false negatives. In imbalanced datasets, there are different ways to average this metric across classes. Silva et al. [16] adopted the weighted F-Measure, which accounts for class frequency, giving more influence to classes with more samples. In contrast, the present work employs the macro F1-score, which calculates the unweighted mean of the F1-scores for all classes, treating them equally regardless of class size |
| 2 | G-mean, or geometric mean, is particularly useful in imbalanced classification problems, as it balances performance across classes by combining sensitivity (recall) and specificity into a single measure. |
| 3 |
defines the number of epochs before the first learning rate restart, affecting how frequently the learning rate is reset. |









| Permeability (md) | Class | Samples | Percentage (%) |
|---|---|---|---|
| 0 – 0.1 | Low | 153 | 23.72 |
| 0.1 – 10 | Intermediate | 232 | 35.97 |
| 10 – 100 | High | 123 | 19.07 |
| ≥100 | Excellent | 137 | 21.24 |
| Total | — | 645 | 100.00 |
| Model | Learning Rate | Number of Transformations | Magnitude |
|---|---|---|---|
| ResNet-50 | 0.00586 | 3 | 12 |
| DenseNet-201 | 0.00781 | 3 | 9 |
| ViT-B/8 | 0.00373 | 2 | 6 |
| Swin-L | 0.00628 | 2 | 14 |
| Model | Average Macro F1-Score |
|---|---|
| ResNet-50 | 0.64074 ± 0.02296 |
| DenseNet-201 | 0.57408 ± 0.06085 |
| ViT-B/8 | 0.66084 ± 0.05254 |
| Swin-L | 0.64053 ± 0.02473 |
| Class | ResNet-50 | DenseNet-201 | ViT-B/8 | Swin-L | Support1 | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Prec. | Recall | F1 | Prec. | Recall | F1 | Prec. | Recall | F1 | Prec. | Recall | F1 | ||
| Low | 0.81 | 0.84 | 0.83 | 0.76 | 0.81 | 0.78 | 0.88 | 0.74 | 0.81 | 0.82 | 0.74 | 0.78 | 31 |
| Intermediate | 0.85 | 0.63 | 0.73 | 0.76 | 0.61 | 0.67 | 0.72 | 0.78 | 0.75 | 0.75 | 0.65 | 0.70 | 46 |
| High | 0.43 | 0.55 | 0.48 | 0.40 | 0.45 | 0.43 | 0.57 | 0.59 | 0.58 | 0.47 | 0.68 | 0.56 | 22 |
| Excellent | 0.60 | 0.72 | 0.65 | 0.66 | 0.76 | 0.70 | 0.80 | 0.80 | 0.80 | 0.71 | 0.68 | 0.69 | 25 |
| Accuracy | 0.69 | 0.66 | 0.74 | 0.69 | |||||||||
| Macro F1 | 0.67 | 0.65 | 0.73 | 0.68 | |||||||||
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).