Submitted:
16 May 2025
Posted:
19 May 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Typical Method Classification and Data Processing Workflow
2.1. Common Machine Learning Algorithms
2.1.1. Random Forest Algorithm
2.1.2. Support Vector Machine (SVM) Algorithm
2.1.3. Principles of Deep Neural Networks
2.2. Data Processing Workflow of Machine Learning in Mineral Prediction
2.2.1. Data Cleaning and Standardization
2.2.2. Feature Selection and Extraction
2.2.3. Supervised Feature Selection
2.2.4. Unsupervised Feature Extraction
2.2.5. Multi—Source Data Fusion
3. Typical Applications of Machine Learning in Large-Scale Mineral Prediction
3.1. Research Progress in Mineral Prospecting Prediction
3.2. Case Study of Mineral Resource Estimation in Makeng Iron Deposit, Southwestern Fujian

4. Future Research Directions and Conclusions
4.1. Future Research Directions
4.1.1. Algorithm Innovation and Cross-Modal Integration
4.1.2. Big Data Platforms and Cloud-Edge Collaborative Computing
4.1.3. Integrated Geological-Machine Learning Modeling
4.2. Conclusions and Challenges
4.2.1. Summary of Research Progress
4.2.2. Existing Challenges and Countermeasures
4.2.3. Outlook on Future Development Prospects
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| ML | Machine Learning |
| AUC | Area Under The Curve |
| ROC | Receiver operating characteristic |
| GANs | Generative Adversarial Networks |
| SVM | Support Vector Machine |
| DNNs | Deep neural networks |
| RFE | Recursive Feature Elimination |
| PCA | Principal Component Analysis |
References
- XIAO Hang, ZHANG Zhansong, GUO Jianhong, et al. Coal structure identification method based on random forest combined with geophysical logging data and its application. Science, Technology and Engineering (in Chinese). 2021, 21, 10174–10180.
- KANG Qiankun. Application of random forest algorithm in sandstone type uranium mineral anomaly identification based on logging data [D]. Changchun: Jilin University, 2020.
- GUO Jianhong, ZHANG Zhansong, ZHANG Chaomo, et al. Case study on coalbed methane content prediction based on geophysical logging data using slope correlation - degree - random forest method. Exploration Geophysics and Geochemistry 2021, 45, 18–28.
- FAWAGREH K, GABER M M, ELYAN E. Random forests: from early developments to recent advancements. Systems Science & Control Engineering 2014, 2, 602–609.
- JAMES G, WITTEN D, HASTIE T, et al. An introduction to statistical learning: with applications in R [M/OL]. New York: Springer US, 2021.
- VLADIMIR N V. The nature of statistical learning theory [M]. New York: Springer, 1999.
- HAN Shili, XIAO Jian, LIU Wei. Research progress on application of machine learning in uranium resource exploration. Uranium Geology, 2021; 37, 123–130.
- ZEN H, SENIOR A, SCHUSTER M. Statistical parametric speech synthesis using deep neural networks [C]//2013 Institute of Electrical and Electronics Engineers international conference on acoustics, speech and signal processing. Vancouver: Institute of Electrical and Electronics Engineers, 2013: 7962 - 7966.
- HINTON G E. A practical guide to training restricted Boltzmann machines. Momentum 2010, 9, 926.
- WANG Hao, YAN Jiayong, FU Guangming, et al. Current status and application prospect of deep learning in geophysics. Progress in Geophysics (in Chinese). 2020, 35, 642–655.
- WANG Jingyi, WANG Zhiguo, CHEN Yumin, et al. Deep artificial neural network in seismic inversion. Progress in Geophysics (in Chinese). 2023, 38, 298–320.
- LUO Haibo, TANG Yandong. "Deep learning and its applications". Infrared and Laser Engineering (in Chinese). 2018, 47, 8–9.
- ZHAO Ming, CHEN Shi, DAVE Y. Waveform classification and seismic recognition by convolutional neural network. Chinese Journal of Geophysics (in Chinese). 2019, 62, 374–382.
- WANG Kunxi, HU Tianyue, LIU Xiaozhou, et al. Suppressing seismic multiples based on the deep neural network method with data augmentation training. Chinese Journal of Geophysics (in Chinese). 2021, 64, 4196–4214.
- LIU, G., et al. (2022). Integration of MT and hyperspectral data for 3D mineral exploration modeling. Remote Sensing of Environment 2022, 271, 112893.
- ZHOU, Y., et al. (2021). Deep feature fusion for iron ore prospectivity mapping in the Pilbara Craton. Ore Geology Reviews 2021, 134, 104187.
- TANG, C., et al. (2023). Bayesian model averaging for gold targeting in Central Asian Orogenic Belt. Natural Resources Research 2023, 32, 321–337.
- ZHANG Zhenjie, CHENG Qiuming, YANG Jie, et al. Machine learning and mineral prediction: a case study of Fe - polymetallic mineral prediction in southwestern Fujian [J]. Earth Science Frontiers, 2021, 28(03): 221 - 235. [CrossRef]
- XIANG Jie, CHEN Jianping, XIAO Keyan, et al. Three - dimensional quantitative mineral prediction based on machine learning: a case study of the Lala copper deposit in Sichuan. Geological Bulletin of China 2019, 38, 2010–2021.
- DONG Jianhui, LIU Huan, JIANG Sha, et al. Two - dimensional mineral prediction based on random forest: a case study of the Shilei - Tuhu area sedimentary manganese deposit [J]. Mineral Deposits, 2025, 44(01): 143 - 158. [CrossRef]
- Bektemyssova G ,Bykov A ,Moldagulova A , et al. Analysis of Spatial Aggregation and Activity of the Urban Population of Almaty Based on Cluster Analysis. Sustainability 2025, 17, 3243–3243. [CrossRef]
- Zhou R ,Gao Q ,Wang Q , et al. Machine Learning Optimization of Waste Salt Pyrolysis: Predicting Organic Pollutant Removal and Mass Loss. Sustainability 2025, 17, 3216. [CrossRef]
- DAHL G E, YU D, DENG L, et al. Context - dependent pre - trained deep neural networks for large - vocabulary speech recognition. Audio, Speech, and Language Processing, IEEE Transactions on 2012, 20, 30–42. [CrossRef]
- CHEN Jin, MAO Xiancheng, LIU Zhanjun, et al. Three - dimensional mineral prediction of the Dayinziguan gold deposit based on the random forest algorithm [J]. Tectonophysics and Metallogeny, 2020, 44(02): 231 - 241.
- BI Chengxi, LIU Liangming, ZHOU Feihu. Three - dimensional mineral prediction based on machine learning combined with kinematic simulation: a case study of the Tongshan copper deposit in Anhui [J]. Tectonophysics and Metallogeny, 2025, 49(01): 103 - 116.
- ZUO Ren - guang. Deep - seated mineralization information mining and integration based on deep learning [J]. Bulletin of Mineralogy, Petrology and Geochemistry, 2019, 38(01): 53 - 60 + 203. [CrossRef]
- LIU Guan. Application of machine learning in flight delay prediction [J]. Journal of Civil Aviation, 2023, 7(06): 1 - 4.
- Leonardi S ,Distefano N ,Gruden C . Advancing Sustainable Mobility: Artificial Intelligence Approaches for Autonomous Vehicle Trajectories in Roundabouts. Sustainability 2025, 17, 2988. [CrossRef]
- WO Yi - feng. Application of machine learning in electricity spot trading price prediction. Electronic Technology 2024, 53, 238–239.
- Zhou Y ,Lv Y ,Dong J , et al. Factors Influencing Transparency in Urban Landscape Water Bodies in Taiyuan City Based on Machine Learning Approaches. Sustainability 2025, 17, 3126. [CrossRef]
- Pendar R M ,Cândido S ,Páscoa C J , et al. Enhancing Automotive Paint Curing Process Efficiency: Integration of Computational Fluid Dynamics and Variational Auto-Encoder Techniques. Sustainability 2025, 17, 3091. [CrossRef]



| Algorithm Type | Principles and Features | applicable scene |
| Random Forest | Integrates multiple decision trees, strong overfitting resistance; supports feature importance evaluation | Classification tasks, high-dimensional data |
| Support Vector Machine | Solves nonlinear problems via kernel function mapping; performs well with small samples but sensitive to noise | Small-sample classification, geochemical anomaly identification |
| Deep Learning | Automatically extracts complex features via multi-layer neural networks; requires large amounts of training data and computational power | Multi-source heterogeneous data fusion, 3D modeling |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).