Submitted:
20 May 2026
Posted:
21 May 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Study Area and Data Description
2.1.1. Sample Preparation and Analytical Methods
2.1.2. Rationale for Element Selection
2.2. Data Preprocessing
2.2.1. Treatment of Below-Detection-Limit Values
2.2.2. Closure and CLR Transformation
2.2.3. Outlier Detection
2.3. Target Variable Definition
2.4. Exploratory Compositional Analysis and Predictor Selection
2.5. Machine Learning Models
2.5.1. Linear Baseline Models
2.5.2. Nonlinear Machine Learning Models
2.6. Model Training and Evaluation
2.7. Reconstruction of Gold Concentration
2.8. Model Interpretability Using SHAP
3. Results
3.1. Model Performance in Revised Target Space
3.2. Prediction Accuracy in the Revised Target Space
3.3. Residual Behaviour
3.4. Model Interpretation Using SHAP
4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| ML | Machine Learning |
| RF | Random Forest |
| SVR | Support Vector Regression |
| MLP | Multi-Layer Perceptron |
| kNN | k-Nearest Neighbours |
| OLS | Ordinary Least Squares |
| SHAP | SHapley Additive exPlanations |
| CoDA | Compositional Data Analysis |
| CLR | Centered Log-Ratio |
| MCD | Minimum Covariance Determinant |
| PCA | Principal Component Analysis |
| RMSE | Root Mean Square Error |
| MAE | Mean Absolute Error |
| XAI | Explainable Artificial Intelligence |
| AAS | Atomic Absorption Spectrometry |
| XRF | X-ray Fluorescence |
| LA-ICP-MS | Laser Ablation Inductively Coupled Plasma Mass Spectrometry |
| OGD | Orogenic Gold Deposit |
| BDL | Below Detection Limit |
References
- Groves, D.I.; Santosh, M.; Zhang, L. A scale-integrated exploration model for orogenic gold deposits based on a mineral system approach. Geoscience Frontiers 2020, 11, 719–738. [CrossRef]
- Perret, J.; Jessell, M.W.; Masurel, Q.; et al. Review of Paleoproterozoic tectonics in the southern West African Craton: Insights from multi-disciplinary data integration. Precambrian Research 2025, 422, 107707. [CrossRef]
- Djagre, L.; Ali, K.; Kra, L.K.; Koffi, B.G. Geological controls on gold mineralization of the Nyangboué prospect in the southern part of the Boundiali-Syama belt, northwest Ivory Coast. Scientific African 2025, 27, e02584. [CrossRef]
- Beaudin, D.; Partin, C.A.; Ansdell, K.; Yang, P. Comparative lithology and alteration mineral chemistry of host rocks at the Seabee Gold Operation. Ore Geology Reviews 2024, 166, 105950. [CrossRef]
- Naumov, E.; Kalinin, Y.; Palyanova, G.; et al. Combined study of Au-bearing arsenopyrite of orogenic gold deposits (NE Asia). Geoscience Frontiers 2025, 16, 101953. [CrossRef]
- Rusk, B. Cathodoluminescent Textures and Trace Elements in Hydrothermal Quartz. In Springer Geology 2012, pp. 307–329. [CrossRef]
- Adama, A.; Eric, B.E.; Bertrant, B.S.; et al. Geochemical dataset of laterites soils in Koubou gold district (Zone A) East Cameroon. Data in Brief 2024, 57, 111039. [CrossRef]
- Campos, L.M.; Toledo, C.L.B.; Silva, A.M.; et al. The hydrothermal footprint of the Crixás deposit. Ore Geology Reviews 2022, 146, 104925. [CrossRef]
- Chen, B.; Zuo, Y.; Zheng, L.; et al. Relationship between silicification and gold mineralization. Ore Geology Reviews 2025, 176, 106394. [CrossRef]
- Li, J.; Yang, Z.M.; Wang, C.W.; et al. Metallogeny of the Xiaotongjiapuzi gold deposit. Ore Geology Reviews 2023, 157, 105455. [CrossRef]
- Ge, Y.Z.; Zhang, Z.J.; Zhou, Y.Z.; et al. Explainable machine learning reveals apatite fertility and porphyry copper mineralization. Ore Geology Reviews 2025, 183, 106679. [CrossRef]
- Boadi, B.; Raju, P.S.V.; Wemegah, D.D. Lode-gold prospectivity mapping in the Ahafo gold district. Ore Geology Reviews 2022, 148, 105059. [CrossRef]
- Behera, R.C.; Singh, S.; Srivastava, S.; et al. Trace elemental systematics of auriferous sulfides in dolerites. Ore Geology Reviews 2025, 180, 106569. [CrossRef]
- Davies, R.S.; Trott, M.; Georgi, J.; Farrar, A. AI and ML to enhance critical mineral deposit discovery. Geosystems and Geoenvironment 2025, 4, 100361. [CrossRef]
- Ahmed, A.A.; Sayed, S.; Abdoulhalik, A.; et al. Applications of machine learning to water resources management. Journal of Cleaner Production 2024, 441, 140715. [CrossRef]
- Hansen, T.F.; Erharter, G.H.; Liu, Z.; Torresen, J. ML approaches for rock mass classification. Applied Computing and Geosciences 2024, 24, 100199. [CrossRef]
- Mantilla-Dulcey, A.; Goyes-Peñafiel, P.; Báez-Rodríguez, R.; Khurama, S. Porphyry-type mineral prospectivity mapping. Gondwana Research 2024, 136, 236–250. [CrossRef]
- Zou, X.; et al. Ore fluid pathways at the giant Lannigou Carlin-type gold deposit. Ore Geology Reviews 2025, 179, 106523. [CrossRef]
- Sumail, T.; Thébaud, N.; Masurel, Q.; et al. Temporal constraints on gold mineralisation at the Jundee deposit. Precambrian Research 2024, 410, 107479. [CrossRef]
- Liu, J.; Bao, X.; Kou, S.; et al. LA-ICP-MS U-Pb geochronology of monazite in the Xinjiazui gold deposit. Ore Geology Reviews 2023, 161, 105626. [CrossRef]
- Mahboob, M.A.; Celik, T.; Genc, B. Predictive modelling of mineral prospectivity using ML. Remote Sensing Applications: Society and Environment 2024, 36, 101316. [CrossRef]
- Xue, X.F.; Feng, Y.C.; Tamer, M.T.; et al. Comparison of gold precipitation processes between disseminated and quartz vein ores of orogenic gold deposits: Insights from the Linglong gold field, Jiaodong Peninsula, China. Ore Geology Reviews 2025, 183, 106639. [CrossRef]
- Liang, Y.; Xue, W.; Li, L.; et al. Multi-stage evolution of a gold mineralization from southern China: Implications for the ore-forming processes. Ore Geology Reviews 2025, 181, 106618. [CrossRef]
- Chehreh Chelgani, S.; Nasiri, H.; Alidokht, M. Interpretable modeling of metallurgical responses for an industrial coal column flotation circuit by XGBoost and SHAP. International Journal of Mining Science and Technology 2021, 31, 1135–1144. [CrossRef]
- Antonini, A.S.; Tanzola, J.; Asiain, L.; et al. Machine learning model interpretability using SHAP values: Application to Igneous Rock Classification task. Applied Computing and Geosciences 2024, 23, 100178. [CrossRef]
- Zhang, S.; Chen, C.; Xu, J.; et al. Deterministic modelling for driving factors of mineralization in Shanggong gold deposit (China). Ore and Energy Resource Geology 2024, 17, 100062. [CrossRef]
- Raič, S.; Molnár, F.; O’Brien, H.; et al. Building geochemical vectors with trace element compositions of sulfides in orogenic gold mineral systems in northern Finland. Journal of Geochemical Exploration 2023, 251, 107252. [CrossRef]
- Zhao, H.; Wang, Q.; Groves, D.I.; et al. Genesis of orogenic gold systems in the Daduhe belt. Ore Geology Reviews 2022, 145, 104861. [CrossRef]
- Abraham, E.; Usman, A.; Amano, I. Machine learning-based classification of geological structures from magnetic anomaly data. Machine Learning with Applications 2025, 20, 100678. [CrossRef]
- Zhou, S.; Cheng, Z.; Wang, J.; et al. Uncover implicit associations among geochemical elements using machine learning. Ore Geology Reviews 2025, 179, 106506. [CrossRef]
- Cui, Q.Y.; Li, J.; Cai, W.Y.; et al. Episodic fluid pulses in the Baiyun gold deposit, Liaodong Peninsula, Eastern China. Ore Geology Reviews 2024, 174, 106313. [CrossRef]
- Ma, G.; Zhao, X.; Mao, Q.; et al. Gold and antimony metallogenic relationship of the Awanda Au (Sb) deposit. Ore Geology Reviews 2025, 176, 106384. [CrossRef]
- Buccianti, A.; Grunsky, E.C. Compositional data analysis in geochemistry: Are we sure to see what really occurs during natural processes? Journal of Geochemical Exploration 2014, 141, 1–5. [CrossRef]
- Filzmoser, P.; Hron, K.; Templ, M. Applied Compositional Data Analysis. With Worked Examples in R. Springer International Publishing 2018. [CrossRef]
- Yu, P.Y.; Li, C.; Fu, J.N.; et al. LA-ICP-MS/MS Rb-Sr sericite geochronology in orogenic gold deposits. Ore Geology Reviews 2025, 180, 106543. [CrossRef]
- Nassabeh, M.; You, Z.; Keshavarz, A.; Iglauer, S. Sub-surface geospatial intelligence in carbon storage using ML. Energy 2024, 305, 132086. [CrossRef]
- Negrello Bergami, G.; de Souza Filho, C.R.; Haddad-Martim, M.P.; Carranza, E.J.M. The multifractal nature of world-class orogenic gold systems in greenstone belts. Ore Geology Reviews 2024, 165, 105909. [CrossRef]
- Morgan, H.; Elgendy, A.; Said, A.; et al. Enhanced lithological mapping using explainable AI and remote sensing. Computers & Geosciences 2024, 193, 105738. [CrossRef]
- Lundberg, S.M.; Lee, S.I. A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems 2017, 30, 4765–4774. [CrossRef]
- Zhu, D.; Wang, J.; Kuwatani, T.; Tsuchiya, N. ML applications for magmatic-hydrothermal systems: Quartz trace-element insights. Applied Geochemistry 2025, 189, 106431. [CrossRef]
- Zhu, C.; Liu, Y.; Wang, D.; et al. Exploration of highly stable and efficient lead-free halide perovskite solar cells by ML. Cell Reports Physical Science 2024, 5, 102321. [CrossRef]
- Wang, H.; Wu, Y.; Zhang, Y.; et al. Uncertainty and Explainable Analysis of Machine Learning Model for Reconstruction of Sonic Slowness Logs. Artificial Intelligence in Geosciences 2023, 4, 182–198. [CrossRef]
- Chen, M.; Wang, H. Explainable machine learning model for prediction of ground motion parameters with uncertainty quantification. Chinese Journal of Geophysics 2022, 65, 3386–3404. [CrossRef]
- Sharapatov, A.; Saduov, A.; Assirbek, N.; et al. Prediction of rare and anomalous minerals using anomaly detection and ML. Applied Computing and Geosciences 2025, 26, 100250. [CrossRef]
- Sun, B.; Cui, W.; Liu, G.; et al. A hybrid strategy of AutoML and SHAP for explainable concrete strength prediction. Case Studies in Construction Materials 2023, 19, e02405. [CrossRef]
- Fang, X.; Gu, F.H.; Tang, J.X.; et al. Mesozoic orogenic gold metallogenesis in Tibet. Ore Geology Reviews 2024, 170, 106135. [CrossRef]
- Dai, Q.Y.; Zhang, L.M.; Zhang, K.; et al. Integrated optimization of reservoir production using ML. Petroleum Science 2025. [CrossRef]
- Aitchison, J. The statistical analysis of compositional data. Journal of the Royal Statistical Society: Series B (Methodological) 1982, 44, 139–177. https://www.jstor.org/stable/2345821.
- Pawlowsky-Glahn, V.; Egozcue, J.J.; Tolosana-Delgado, R. Modeling and Analysis of Compositional Data. Wiley 2015. [CrossRef]
- Quinn, T.P.; Erb, I.; Gloor, G.; et al. A field guide for the compositional analysis of any-omics data. GigaScience 2019, 8, giz107. [CrossRef]






| Element | Min (ppm) | Max (ppm) | Mean (ppm) | Std Dev (ppm) | 25th (ppm) | Median (ppm) | 75th (ppm) |
|---|---|---|---|---|---|---|---|
| Au | 0.01 | 2,240 | 0.82 | 11.80 | 0.01 | 0.02 | 0.09 |
| Al | 1,300 | 139,000 | 70,103 | 13,786 | 63,000 | 71,000 | 79,000 |
| Cu | 5 | 5,790 | 49.95 | 51.22 | 30 | 40 | 50 |
| Fe | 15 | 170,000 | 41,909 | 12,345 | 35,700 | 42,600 | 48,700 |
| K | 300 | 46,100 | 18,856 | 5,283 | 16,000 | 18,900 | 21,900 |
| Mn | 15 | 62,500 | 818.46 | 1,482.43 | 460 | 560 | 670 |
| Ni | 10 | 2,530 | 36.23 | 24.86 | 20 | 40 | 50 |
| S | 100 | 440,000 | 3,518 | 19,330 | 500 | 1,200 | 3,900 |
| Si | 1,150 | 550,000 | 238,281 | 42,271 | 216,000 | 233,000 | 253,000 |
| Sr | 2.5 | 1,100 | 257.01 | 80.74 | 213 | 246 | 287 |
| Ti | 50 | 22,000 | 3,580 | 1,105 | 3,300 | 3,700 | 4,000 |
| Model | Validation R2 | Validation RMSE | Validation MAE | ΔR2 vs OLS |
|---|---|---|---|---|
| RF | 0.505 | 1.264 | 1.001 | +0.198 |
| MLP | 0.505 | 1.264 | 0.999 | +0.198 |
| SVR | 0.479 | 1.297 | 0.994 | +0.172 |
| kNN | 0.476 | 1.300 | 1.028 | +0.169 |
| OLS | 0.307 | 1.495 | 1.196 | 0.000 |
| Ridge | 0.307 | 1.495 | 1.196 | 0.000 |
| Lasso | 0.307 | 1.495 | 1.196 | 0.000 |
| Huber | 0.306 | 1.497 | 1.191 | −0.002 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).