State-of-the-Art Review of Artificial Intelligence in Environmental Geophysics and Geotechnical Engineering

Adedibu Sunny Akingboye; Andy Anderson Bery; Mbuotidem David Dick; Babangida Mohammed Ahmed; Temitayo Olamide Ale; Adeyemi Oludapo Olusola

doi:10.20944/preprints202605.1511.v1

Submitted:

21 May 2026

Posted:

22 May 2026

You are already at the latest version

Abstract

Artificial intelligence (AI) is transforming environmental geophysics and geotechnical engineering (EGGE), shifting practice from empirical and deterministic workflows toward data-rich, physics-consistent, and decision-oriented subsurface intelligence. This review synthesizes advances in machine learning, deep learning, physics-informed and theory-guided modeling, multimodal data fusion, uncertainty-aware and explainable AI, and intelligent sensing for near-surface, environmental, and geotechnical systems. It presents an integrated framework linking physics-informed AI, multimodal fusion, and regulatory pathways for deployment in EGGE, bridging methodological innovation and operational adoption. These advances enable high-resolution subsurface characterization, lithological and geotechnical profiling, hydro-geomechanical parameter estimation, groundwater assessment, geohazard forecasting, and infrastructure monitoring, marking a transition to field-validated decision-support systems for early warning, risk-informed design, and climate-resilient management. Key challenges include data heterogeneity, cross-scale inconsistency, limited ground truth, fusion complexity, ill-posed inversion, weak generalization across geological and climatic regimes, and insufficient interpretability, uncertainty quantification, and validation for engineering decisions. Regulatory gaps further constrain adoption. A roadmap emphasizes scalable physics-integrated AI, next-generation multimodal fusion, interpretable and uncertainty-aware modeling, edge-cloud digital twins, adaptive data acquisition, and standardized benchmarking. AI can redefine EGGE through resilient, sustainability-aligned subsurface intelligence if scientific rigor and governance advance in parallel.

Keywords:

artificial intelligence

;

machine/deep learning

;

physics-informed learning

;

multimodal data fusion

;

engineering geophysics

;

geotechnical engineering

Subject:

Environmental and Earth Sciences - Geophysics and Geology

1. Introduction

Artificial intelligence (AI) is reshaping environmental geophysics and geotechnical engineering (EGGE) through the AI–EGGE paradigm, redefining how near-surface systems are characterized, modeled, and managed under increasing environmental and infrastructure pressures [1,2,3,4]. While conventional and physics-based approaches remain foundational, they often struggle to represent the nonlinear behavior, uncertainty, and multiscale heterogeneity that govern surface–subsurface processes [5,6,7]. The rapid growth of geophysical, geotechnical, and environmental datasets has positioned AI as a powerful complement, enabling the extraction of complex patterns from high-dimensional data and improving inference where traditional inversion and regression are poorly constrained [8,9]. By learning relationships among geophysical observables, mechanical properties, and environmental indicators, AI enhances prediction, interpretability, and decision support in subsurface characterization [10,11,12].

The adoption of AI in Earth sciences has progressed from early expert systems and regression-based models to modern machine learning (ML), including unsupervised clustering algorithms such as k-means, fuzzy c-means, and self-organizing maps, capable of extracting spatial, temporal, and structural features from complex datasets [13,14,15,16]. These foundations have further advanced into deep learning (DL) architectures, e.g., feedforward, convolutional, recurrent, autoencoder, generative, and graph-based networks, that autonomously learn hierarchical representations from high-dimensional data [17,18,19,20]. More recently, hybrid and physics-informed approaches have emerged, explicitly embedding governing physical constraints within data-driven learning to support multi-physics integration across geophysical and geotechnical domains [21,22,23]. These developments now enable coupled modeling of electrical (resistivity, induced polarization [IP], self-potential [SP]), electromagnetic (EM), seismic (refraction tomography and multichannel analysis of surface waves [SRT, MASW], reflection), ground-penetrating radar (GPR), magnetic, gravity, radiometric, and borehole datasets within unified AI-driven predictive frameworks.

EGGE are inherently complementary disciplines focused on imaging, characterizing, and engineering the near-surface critical zone, where hydrological, geological, geochemical, and geomechanical processes interact [24,25]. Environmental geophysics provides noninvasive, continuous subsurface imaging through the aforementioned geophysical datasets, enabling delineation of subsurface structure, fractures, contaminant pathways, voids, cavities, hydrogeological boundaries, etc [26,27,28]. Geotechnical engineering complements these observations through direct characterization of soil and rock behavior using the standard penetration test (SPT/SPT-N), cone penetration test (CPT/CPT-qc), rock quality designation (RQD), rock mass quality (RMQ), and related indices [29,30]. Integrating these heterogeneous datasets within AI-enabled workflows, hereafter termed AI–EGGE, unites continuous subsurface imaging with discrete mechanical and material properties, enabling physics-consistent feature extraction, improved prediction of geotechnical indices from geophysical attributes, and enhanced spatial resolution of geomechanical parameters.

Applications of AI–EGGE are expanding rapidly across surface–subsurface and infrastructure systems. Supervised and ensemble models have been used to predict shear-wave velocity (Vs) and stiffness from ERT, MASW, or SRT [6,21,31], and infer P-wave velocity (Vp), dynamic moduli, and soil or rock integrity using ERT–SRT or ERT–IP correlations [13,27,32,33,34]. Unsupervised ML and DL further improve mapping of subsurface heterogeneity, facies, transition zones, and landslide precursors from integrated geophysical–borehole data [35,36,37]. Hybrid and physics-informed models, including Bayesian and metaheuristic–DL couplings, enhance estimation of integrity, deformation moduli, and stress–strain behavior for slope, tunnel, and foundation design [38,39]. Cross-disciplinary DL architectures, such as encoder–decoders, convolutional neural networks (CNNs), U-Nets, vision transformers (ViTFs), and graph neural networks (GNNs), extend AI–EGGE to multi-source spatiotemporal modeling [18,40,41]. By integrating gravity, thermal, and hydrogeological indicators, these models support improved contaminant tracking, groundwater dynamics, deformation monitoring, and resilience planning [42,43]. Collectively, they are advancing intelligent digital-twin–like systems and scenario-based simulation for infrastructure resilience and environmental risk management [3,44].

Given this background, this review consolidates recent theoretical and practical advances in AI–EGGE within a unified workflow framework (Figure 1). Specifically, the review: (i) synthesizes the historical and conceptual development of AI methodologies relevant to EGGE; (ii) examines strategies for data integration, feature engineering, and cross-domain coupling between geophysical and geotechnical datasets; (iii) reviews representative applications and case studies linking geophysical attributes with geotechnical indices; (iv) critically assesses persistent challenges related to data scarcity, generalization, interpretability, and validation; and (v) outlines pathways toward physically grounded, uncertainty-aware, and explainable AI (XAI) systems. Guided by this structure, the paper progresses from methodological foundations to integrated applications and forward-looking perspectives on environmental-engineering-grade AI for near-surface characterization.

2. Environmental Geophysics and Geotechnical Engineering: Foundations of AI–EGGE

The evolution of AI, from early neuron models in the 1940s through the first “golden age” of the 1950s–1960s, subsequent stagnation cycles, and the modern DL era, has progressively driven the convergence of geophysical and geotechnical domains by enabling multimodal data fusion, hierarchical feature learning, and predictive modeling across heterogeneous surface–subsurface datasets [45]. Conventional EGGE workflows, however, struggle to resolve nonlinear interactions and spatiotemporal coupling governing near-surface processes. These limitations motivate AI–EGGE as an integrative paradigm augmenting physics-based methods with data-driven learning for holistic subsurface characterization and resilience-oriented engineering. This Section outlines the core principles, data structures, and methodological foundations underpinning AI–EGGE.

2.1. Environmental Geophysics: Methods, Capabilities, and Integrated Applications

Environmental geophysics employs noninvasive near-surface techniques to characterize subsurface conditions controlling groundwater flow, contamination, land-use planning, and ecosystem resilience [12,46]. Core methods include ERT/IP, GPR, time- and frequency-domain EM, seismic approaches (SRT, MASW), and remote sensing. ERT delineates conductivity contrasts linked to aquifers, saline intrusion, and contaminant plumes but remains sensitive to saturation, temperature, and pore-fluid chemistry [28,36,47]. IP complements ERT by capturing capacitive responses associated with clay content and mineral surface properties, enhancing lithological and contaminant discrimination [27,48]. GPR provides high-resolution imaging in resistive environments yet attenuates in conductive media [25,49], while EM offers rapid coverage at lower vertical resolution [47,50]. Seismic methods (Vs, Vp) yield elastic, stiffness, and rippability, strengthening interpretation when integrated with electrical properties [13,33]. Remote sensing supplies regional constraints on land use, vegetation stress, and recharge variability [51,52]. Multi-method integration, joint inversion, and AI-driven fusion reduce non-uniqueness, improve uncertainty quantification (UQ), and advance physics-informed multi-physics and ML/DL frameworks [4,9].

Computational advances and autonomous sensing technologies are accelerating operational deployment [53,54]. Two dominant trends emerge: (1) fusion of geophysical and Earth observation (EO) datasets using statistical, geostatistical, and ML/DL frameworks for time-lapse characterization and forecasting, and (2) adoption of UAVs, robotic platforms, and dense sensor networks delivering high-spatiotemporal data streams. Multispectral and hyperspectral imagery detect land-surface dynamics, while field geophysics constrains subsurface structure [55,56]. Multi-sensor fusion is increasingly demonstrated in practice: Gholizadeh et al. [57] merged satellite data with ERT for regional soil-contamination mapping; Omolaiye et al. [58] integrated geophysical and remote-sensing datasets with hydrogeological models to predict groundwater availability in a semi-arid Nigerian basin; and Davis et al. [51] reviewed the convergence of sensor-based monitoring, computational techniques, and Internet of Things (IoT) for soil and groundwater contamination assessment. These hybrid approaches support integrated hydro-environmental modeling and watershed-scale decision-making. Physics-informed ML, joint inversion, and cloud-based processing further reduce interpretive ambiguity, enable transfer learning under data scarcity, and support adaptive survey design. The convergence of autonomous sensing, AI-driven processing, and cloud orchestration now underpins AI–EGGE, shifting practice from method-centric surveys toward integrated subsurface intelligence [39,59].

2.2. Geotechnical Engineering: Testing, Instabilities, Monitoring, and Computational Integration

Geotechnical engineering focuses on the mechanical and hydraulic behavior of soils and rocks under natural and engineered loading. Conventional in-situ and laboratory tests—SPT, CPT, RQD/RMQ, consolidation and shear testing, seismic profiling, and electrical methods—remain central to evaluating bearing capacity, settlement, and slope stability [14,30]. While deterministic datasets historically supported empirical correlations and factor-of-safety approaches, sensitivity to heterogeneity and uncertainty has driven the adoption of probabilistic and performance-based frameworks [60,61]. Advanced seismic testing constrains dynamic properties essential for performance-based design, while reliability methods propagate parameter uncertainty through safety and serviceability assessments [62,63]. These developments reinforce the need for integrated multi-test characterization.

Advances in non-destructive testing (NDT) have expanded the geotechnical toolkit [64]. GPR, ERT, and IP enable high-resolution subsurface imaging without disturbance, enhancing data triangulation when combined with conventional tests. Probabilistic slope-stability modeling, spatial random fields, and Bayesian updating now support quantified, scenario-based risk assessment for design and mitigation [65], moving practice beyond deterministic parameters. NDT and time-lapse geophysical monitoring also capture construction effects and seasonal hydromechanical variability [26,66]. These capabilities support improved foundation design, slope-stability evaluation, and excavation planning [60,67]. Distributed fiber-optic sensing (DFOS), including Brillouin-based methods, optical time-domain reflectometry, distributed temperature sensing, and distributed acoustic sensing (DAS), provides dense, continuous measurements of strain, temperature, and acoustic response along structural elements and slopes [68,69]. These streams enable early detection of deformation and instability precursors, although installation, calibration, and data-management challenges persist [70]. Coupled with AI-driven anomaly detection and explainable ML, DFOS supports adaptive early-warning systems.

Climate-driven variability within the near-surface critical zone is redefining boundary conditions and increasing service-life uncertainty, necessitating coupled hydrometeorological–geotechnical models [51,71]. AI and optimization approaches increasingly support scenario-based slope-failure assessment under future climatic pathways by integrating testing, monitoring, and physical modeling, which are central to computational geotechnics [8,72]. These enable the prediction of soil classification, shear strength, liquefaction potential, bearing capacity, and rock-mass indices [6,73,74]. Ensemble learners, e.g., random forest (RF), perform robustly under noisy, heterogeneous datasets [75], while DL captures nonlinear and spatial dependencies when adequately trained [76,77]. Coupling probabilistic modeling with optimization techniques such as particle swarm optimization (PSO) and genetic algorithms (GAs) improves slope-stability analysis through enhanced parameter calibration and sensitivity assessment [78]. Statistical and data-driven analytics further improve soil behavior interpretation under varying loading and environmental conditions [6,42]. Concerns regarding generalization and interpretability are accelerating hybrid and physics-informed ML frameworks embedding constitutive constraints [11,41].

2.3. Cross-Domain Synthesis in AI–EGGE

AI–EGGE paradigms integrate environmental geophysics and geotechnical engineering within a unified intelligence-driven decision-support framework anchored on three pillars: (i) observational layers (multi-method geophysics, geotechnical testing, DFOS), (ii) computational cores (joint inversion, probabilistic inference, physics-informed/hybrid ML/DL, metaheuristic optimization), and (iii) operational outputs (forecasting, anomaly detection, early warning, decision support). Convergence emerges when geophysical, EO, and monitoring datasets are fused with well logs, CPT/SPT, and DFOS into harmonized feature spaces powering physics–ML hybrid models. These systems enhance subsurface interpolation, contaminant-pathway mapping, slope-instability forecasting, and foundation assessment under transient hydro-mechanical and climatic forcing. Three frontiers accelerate this transition: (a) multi-physics joint inversion reducing ambiguity and improving UQ relative to sequential workflows; (b) physics-informed ML embedding conservation and constitutive laws to preserve realism under sparse data; and (c) cloud–edge infrastructures enabling real-time ingestion, adaptive updating, and automated alerts. Collectively, these advances shift practice toward continuous, intelligence-driven subsurface monitoring, the hallmark of AI–EGGE.

Operationalization depends on interpretability, data integrity, and stakeholder trust. XAI and UQ remain essential where outputs inform safety-critical or regulatory decisions, such as contaminant-transport risk to water sources, foundation reliability assessments, or slope failure warnings in inhabited areas [79,80,81]. Current research highlights the need for transparent hybrid models (e.g., surrogate modeling with uncertainty bounds), standardized data-provenance protocols, and harmonized sensor-calibration and inversion workflows. Socio-technical factors, such as capacity building, data governance, interoperability, and regulatory alignment [82,83], will ultimately govern AI–EGGE’s real-world impact. Establishing robust, traceable pipelines from sensing to decision-making is, therefore, as critical as improving predictive accuracy.

3. AI Techniques in EGGE

AI enables capture of nonlinear, heterogeneous, and spatiotemporally variable subsurface relationships that physics-only methods struggle to resolve, especially under data scarcity [10,84]. Multimodal data fusion strengthens AI–EGGE by harmonizing information from diverse sensing and testing modalities into a unified analytical framework for subsurface characterization. Figure 2 summarizes the three main fusion strategies—early, middle, and late—and their roles in enhancing robustness, generalization, and interpretability across supervised, unsupervised, hybrid, and physics-informed ML/DL workflows. Early fusion concatenates calibrated inputs (e.g., resistivity, Vp, Vs, SPT-N, RQD) into a single feature space; middle fusion integrates learned representations; and late fusion combines modality-specific predictions via averaging, stacking, or uncertainty-weighted ensembling. Early fusion suits co-registered data, middle fusion handles heterogeneous inputs, and late fusion is preferred when modalities differ and uncertainty must be addressed. AI-driven fusion improves efficiency and sustainability by reducing investigation time and cost while enhancing spatial coverage and interpretational consistency.

Supervised, clustering, and ensemble ML methods now link geophysical attributes with mechanical, hydrogeological, and geochemical soil–rock properties, improving spatial coverage, consistency, and investigation efficiency. DL architectures, including deep neural networks (DNNs), CNNs, recurrent neural networks (RNNs), long short-term memory (LSTMs), autoencoders (AEs), transformers (TFs), generative algorithms (GAs), generative adversarial networks (GANs), and physics-informed neural networks, further extract latent features from multimodal datasets for high-resolution 2D/3D subsurface modeling, temporal analysis, anomaly detection, contaminant mapping, and deformation forecasting [10]. AI-driven analytics integrate multi-temporal satellite, geophysical, and sensor data to assess contamination, groundwater vulnerability, and soil degradation, enabling early warning and decision support [72]. Increasing engineering complexity has accelerated hybrid models that fuse geophysical insight with geotechnical parameterization. Feedforward neural networks, especially multilayer perceptrons (MLPs), remain effective for regression and classification of subsurface properties [85]. Recent advances include GAN-based synthetic data generation for inversion and physics-informed models that embed governing constraints to improve generalization and physical consistency [30].

3.1. Supervised and Unsupervised Learning in AI–EGGE

Supervised and unsupervised learning underpin AI–EGGE by enabling automated prediction, pattern discovery, subsurface characterization, and decision support across integrated geophysical–geotechnical investigations. They are widely applied to model and predict Vs, Vp, elastic moduli, SPT-N and CPT-qc values, compressive strength, bedrock depth, fracture density, porosity, weathering grade, and geohazard susceptibility using geophysical, remote sensing, and geospatial methods.

3.1.1. Supervised Learning

Supervised learning establishes quantitative relationships between input variables and labeled outputs, enabling the estimation of key geotechnical and subsurface integrity parameters from surface and borehole datasets [78,86,87]. Common approaches include linear, multiple, polynomial, and regularized regression methods—simple linear regression (SLR), multiple linear regression (MLR), least absolute shrinkage and selection operator (LASSO), and Ridge regression—alongside support vector machines (SVMs), k-nearest neighbors (KNN), decision trees (DT), RF, M5 model trees, extremely randomized trees (XRT), gradient boosting machines (GBMs) and variants (GBoost, XGBoost, sGBM, LightGBM), categorical boosting (CatBoost), and artificial neural networks (ANNs), including shallow and deep architectures (SNN, DNN). These models capture linear to highly nonlinear relationships and complex feature interactions, while genetic expression programming (GEP) supports interpretable symbolic regression and equation-based prediction. In general, regression-based methods model linear–polynomial trends; SVMs optimize class separation using kernel-based margins; KNN applies instance-based learning; tree-based and ensemble models enable rule-based decision modeling; boosting frameworks improve sequential ensemble performance; and SNN/DNN learn hierarchical nonlinear mappings through multi-layer feature extraction [13,31,32,42,88]. GEP is particularly beneficial in EGGE as it yields explicit analytical equations that support interpretability, parameter sensitivity assessment, and engineering adoption [24,85].

The models illustrated in Figure 3 summarize key supervised ML approaches widely used in AI–EGGE. The ANN (a) comprises an input layer, one or more hidden layers, and an output layer, where weighted connections and nonlinear activation functions learn complex input–output relationships for regression and classification in geophysical–geotechnical datasets. SVM (b) constructs an optimal separating hyperplane for classification or regression by maximizing the margin between classes, where training samples are represented as vectors

x_{i}

with corresponding labels

y_{i}

, and the decision function is defined as

f (x) = w^{T} x + b

. Kernel functions,

K (x_{i}, x_{j})

(e.g., linear, polynomial, RBF), map nonlinearly separable data into a higher-dimensional feature space, enabling SVM to handle complex geophysical–geotechnical patterns; support vectors—the samples closest to the margin—control the model’s decision boundary. DT (c) models split data into hierarchical decision rules at internal nodes to minimize impurity or prediction error, with leaves producing final class labels or numeric outputs; DTs are intuitive, transparent, and effective for baseline geotechnical assessments. RF (d) constructs an ensemble of

k

decision trees using bootstrap sampling and random feature selection, with individual predictions aggregated through majority voting (classification) or averaging (regression) to reduce overfitting and improve generalization. GBM (e) builds trees sequentially, with each new tree fitting the residuals of the previous learner to minimize a chosen loss function; the final prediction is the additive combination of all weak learners

W_{1} + W_{2} + \dots + W_{n}

, enabling strong nonlinear predictive capability. CatBoost (f) is an advanced gradient-boosting algorithm optimized for heterogeneous datasets with categorical variables, transforming categorical features using ordered target statistics to prevent leakage and overfitting, and building symmetric (oblivious) trees at each iteration to enhance training stability, speed, and generalization for diverse environmental/geotechnical predictors.

Model performance is evaluated using R² score, mean absolute error (MAE), mean squared error (MSE), root mean squared error (RMSE), root mean squared logarithmic error (RMSLE), mean absolute percentage error (MAPE), median absolute percentage error (MdAPE), Nash–Sutcliffe Efficiency (NSE), and Willmott’s Index of Agreement, with robustness assessed through repeated k-fold cross-validation, hold-out testing, residual analysis, and bias–variance diagnostics [32,75,89]. XAI techniques such as SHapley Additive exPlanations (SHAP), Local Interpretable Model-Agnostic Explanations (LIME), and Accumulated Local Effects (ALE) are increasingly used to interpret model behavior, quantify feature influence, and enhance transparency and trust in geoscience-focused machine learning applications [9].

3.1.2. Unsupervised Learning

Unsupervised learning reveals inherent structure in unlabeled datasets, enabling lithological zoning, subsurface pattern recognition, facies classification, anomaly detection, hydrogeophysical domain mapping, and structural boundary delineation without ground truth [37,90]. Unlike supervised learning, which relies on error-based metrics, unsupervised model quality is assessed using cluster validity and topology-preservation indices. Clustering is the most widely applied unsupervised approach in AI–EGGE: k-means partitions data into

k

clusters by minimizing within-cluster variance, while fuzzy c-means assigns membership degrees to better capture transitional boundaries in weathered, fractured, or hydrogeologically gradational zones [91]. As shown in Figure 4, k-means clustering (a) partitions a dataset into

k

clusters by assigning each input vector

x_{j}

to the cluster with the nearest centroid

c_{i}

based on a distance metric (commonly Euclidean), then updates centroids iteratively until convergence to reveal latent subsurface patterns [86].

On the other hand, self-organizing maps (SOMs) provide a topology-preserving projection of high-dimensional geophysical attributes onto a 2D neuron lattice, with 3D extensions for enhanced topology retention, supporting visualization of structural trends, material contrasts, and alteration zones [90]. In SOM (b), each neuron (

i

) has an associated weight vector

ε_{i}

. For each input vector

x_{j}

, neurons compete based on a similarity metric (typically Euclidean distance), and the closest neuron is identified as the Best Matching Unit (BMU) [92]. The BMU and its neighboring neurons are iteratively updated using a neighborhood function and learning rate to preserve topological structure. A Unified Distance Matrix (U-matrix, U) is commonly used to visualize the trained SOM, where low values indicate cluster interiors and high values highlight cluster boundaries. This enables SOM to effectively cluster geophysical–geotechnical signatures for lithological classification, anomaly detection, and environmental pattern recognition.

Advanced variants such as growing self-organizing networks (GSONs) and adaptive resonance theory networks refine cluster boundaries to represent heterogeneous and evolving subsurface terrains [93]. Common clustering and dimensionality-reduction tools include hierarchical clustering for multiscale geological grouping; density-based spatial clustering of applications with noise (DBSCAN) and ordering points to identify the clustering structure (OPTICS) for density-based anomaly detection in irregular data; Gaussian mixture models for probabilistic litho-facies characterization, k-medoids for noise-resilient clustering; principal component analysis (PCA) for feature decorrelation; independent component analysis (ICA) for source isolation; factor analysis for latent structure extraction; and nonlinear manifold learning tools such as t-Distributed Stochastic Neighbor Embedding (t-SNE) and uniform manifold approximation and projection (UMAP) for high-dimensional pattern visualization and structural trend discovery [94,95]. Cluster quality is typically evaluated using internal indices such as Silhouette, Calinski–Harabasz, Davies–Bouldin, Dunn, and S_Dbw, while fuzzy/probabilistic methods employ the Partition Coefficient, Partition Entropy, Xie–Beni, Fuzzy Silhouette, and Kwon indices [96,97]. When partial labels exist, external indices, including Adjusted Rand Index, Normalized Mutual Information, Purity, and Jaccard Similarity, enable benchmarking against known geological or geotechnical classes [98].

AI–EGGE now integrates semi-supervised, self-supervised, hybrid, and physics-informed learning to overcome limited labeled data and improve model generalization. Semi-supervised learning leverages small labeled and large unlabeled datasets through self-training, co-training, and graph-based approaches [87,99]. Self-supervised learning extracts feature representations from unlabeled geophysical data via masked reconstruction and contrastive or patch-prediction tasks [100]. Hybrid workflows combine unsupervised clustering or dimensionality reduction with supervised training to enhance accuracy while preserving geological heterogeneity [25]. Physics-informed learning, including physics-informed machine learning (PIML), physics-informed simple-to-ensemble regressors (PISERs), and physics-informed neural networks (PINNs), embeds governing laws, constitutive relationships, and boundary constraints into model architectures or loss functions to suppress non-physical solutions [100,101], and can be coupled with geostatistics, inversion, and simulation for physical consistency. Model optimization further strengthens AI–EGGE through hyperparameter tuning using Bayesian optimization, evolutionary, and swarm algorithms such as GAs, PSO, differential evolution, Gray Wolf Optimizer (GWO), and hybrid Bayesian optimization–deep learning (BO-DL) strategies for architecture search and loss balancing in physics-informed models [31,100,102]. Together, these approaches deliver more data-efficient, physically consistent, and scalable AI solutions for subsurface characterization, hazard assessment, and infrastructure resilience.

3.2. Deep Learning Approach

Deep learning (DL), a branch of ML built on multi-layer ANNs, has become a dominant framework for high-dimensional analytics by autonomously learning hierarchical, task-specific features from raw or minimally processed data [103,104]. Its adoption within EGGE continues to expand, improving high-fidelity analysis and prediction across geological and structural feature detection [105], landslide forecasting [106], seismic interpretation [107], groundwater modeling [108], infrastructure monitoring [109], soil and rock classification [13,72], geospatial analytics [110], mineral resource evaluation [111], and environmental impact assessment [112].

Building on the ANN backbone, modern DL increases network depth and architectural complexity to enhance representational capacity, abstraction, and generalization. ANNs provide the foundational neural-network paradigm from which contemporary DL architectures are derived, with advanced models, such as DNNs, CNNs, RNNs, LSTMs, AEs, and deep belief networks (DBNs), representing specialized extensions tailored to spatial, temporal, dimensionality-reduction, and hierarchical feature-learning tasks [105,113,114]. Within EGGE, CNNs and AEs support subsurface imaging, signal refinement, inversion enhancement, and lithological/facies classification [115,116]; RNN/LSTMs model time-dependent deformation, settlement, and slope-movement behavior [117,118]; and DNNs model constitutive behavior, stiffness, and strength for engineering design and risk assessment [119]. Generative and uncertainty-aware architectures, including variational autoencoders (VAEs) and GANs for synthetic data augmentation, scenario simulation, and uncertainty representation, and Bayesian neural networks (BNNs) for probabilistic prediction with quantified epistemic and aleatoric uncertainty, are increasingly enabling more robust and risk-informed decision-making [41,113,117]. DL performance is evaluated using standard ML metrics, alongside training–validation loss dynamics, convergence stability, efficiency, and uncertainty diagnostics for probabilistic and generative models.

3.2.1. ANN Architecture

ANNs are widely used in EGGE for classification and prediction, offering a robust alternative to traditional mathematical formulations for modeling nonlinear soil/rock structure interactions [120,121]. The ANN architecture—an input layer, one or more hidden layers, and an output layer linked by weighted neurons—forms the foundation of modern DL architectures [78]. As shown in Figure 5, ANNs learn nonlinear input–output relationships by adjusting synaptic weights during training, enabling the mapping of geoenvironmental and geotechnical variables (e.g., loading, soil parameters, environmental influences) to mechanical responses. Model complexity, driven by the number of hidden layers and neurons, strongly affects accuracy [122]. A key extension is the Fuzzy Inference System (FIS), which uses fuzzy logic and rule-based reasoning to handle uncertainty in EGGE data [42,123]. Its hybrid evolution, the Adaptive Neuro-Fuzzy Inference System (ANFIS), integrates ANN learning with fuzzy logic for nonlinear modeling with improved interpretability [124]. Another classical variant, the Probabilistic Neural Network (PNN), applies Bayesian decision theory with radial basis activation functions for reliable pattern recognition under uncertainty [125]. ANN training commonly uses Backpropagation Neural Network (BPNN), which updates weights via gradient-based optimization to minimize loss, with optimizers such as Adam and RMSProp improving convergence and generalization [123]. Metaheuristic-based training further reduces error in complex applications [56].

Recent ANN-derived architectures: GNNs for spatial-relational learning, TFs for long-range dependency modeling, and PINNs that embed physical laws, are accelerating a shift toward physics-aware DL [18,39]. This promotes domain-informed, interpretable, and more generalizable models that integrate geoscientific and geotechnical priors. Project DeepGeo [62] exemplifies this by embedding geological knowledge into ANN/DNN workflows through structured training-image databases for subsurface characterization in data-scarce settings; 54 expert-interpreted cross-sections enabled Bayesian ensemble learning and stratigraphic UQ, demonstrating the value of knowledge-informed DL. Karpatne et al. [126] likewise showed that ANN-based models outperform empirical and statistical approaches by learning latent features governing stress–strain behavior, stiffness, and strength under variable loading and environmental conditions. Despite progress, limitations—data scarcity, spatial bias, overfitting, weak cross-site generalization, and black-box opacity—still hinder large-scale EGGE deployment. Emerging solutions integrate XAI (e.g., SHAP, LIME, Gradient-weighted Class Activation Mapping [Grad-CAM], uncertainty-aware learning, hybrid physics-informed models, and attention-based architectures to improve reliability, transparency, and engineering adoption. Table 1 further summarizes selected reviews and research on ANN applications.

Figure 5. Typical ANN framework for DL–based prediction of geomechanical properties (after [127]).

Table 1. Summary of selected reviews and research on ANN with its variants in EGGE.

Authors	Methods used	Results	Relevance	Limitations
[128]	Time Delay Neural Networks (TDNNs) applied to cyclic swelling data from the powerhouse cavern (Iran)	TDNN successfully modeled cyclic swelling pressure with good accuracy, capturing time-dependent behavior	Demonstrates ANN capability in modeling cyclic swelling/shrinkage of weak rocks (mudrock), critical for underground structures	Dependent on site-specific data, generalizability to other rock formations is uncertain
[129]	ANN (MLP, RBF) and multivariate regression with multivariate non-linear regression (MLR, MNR) to predict shear strength parameters in part of Iran	MLP-ANN outperformed RBF-ANN; MLR outperformed nonlinear regression; ANN captured complex nonlinear soil behavior	Demonstrated ANN’s accuracy in predicting soil cohesion and friction angle using soil index properties	Limited dataset (200 samples); model performance dependent on input combinations
[123]	Review of ANN modeling and application issues	Outlined ANN architectures: BPNN, RNN, PNN, and SOM. Input selection, training/testing, and data preprocessing highlighted practical examples in liquefaction, pile capacity, soil classification, and slope stability	Provides a methodological framework for applying ANNs in geotechnical engineering	Issues: network geometry selection, data division, overfitting, computational demands
[24]	Review of AI optimization techniques (ANN, Fuzzy Logic, GEP, ANFIS, GA, etc.) in geotechnical applications	AI methods shown to improve prediction of soil behavior, pile capacity, swelling potential, foundation settlement, liquefaction, and more	Comprehensive overview showing how AI enhances geotechnical modeling, sustainability, and precision	Review article—no new experimental validation; heavy reliance on secondary data
[130]	Bootstrapping DL-ANN with airborne EM and borehole data for saltwater investigation in the Mississippi River Valley (USA)	Developed resistivity-to-lithology and resistivity-to-concentration models, while DL-ANN estimated the total dissolved solute	Results indicate salinity upconing due to excessive pumping	Reliance on used water quality data for training and validating the DL-ANN model. Inherent uncertainties with the transformation of resistivity values to lithologies and chloride concentrations
[131]	Review of ANNs in soil science	ANNs prove to be effective in predicting soil properties (pH, organic carbon, clay content, permeability, compaction, shear strength); useful for soil classification, fertility assessment, erosion prediction, and moisture estimation	Highlights ANN potential for soil modeling, land-use planning, and precision agriculture; relevant to geotechnical soil behavior predictions	Review only; lacks detailed experimental validation; applications are mostly limited to the soil science context

3.2.2. Convolutional Neural Networks (CNNs)

CNNs are a leading DL architecture inspired by the visual cortex and widely applied in EGGE for spatial feature learning, enabling automatic multiscale pattern extraction from raw geospatial, geophysical, and imagery data [117]. As shown in Figure 6a–b, a typical AI–EGGE CNN workflow includes data preprocessing, augmentation, and transfer learning using pretrained models such as AlexNet, GoogLeNet, Inception, ResNet, U-Net, and DenseNet to enhance feature extraction and reduce training time and data needs. A standard CNN comprises convolutional and pooling layers with activation functions (e.g., ReLU), followed by either a flatten layer or global average pooling (GAP) before dense layers and a Softmax classifier, with Flatten adopted in Figure 6b. Flatten increases trainable parameters, whereas GAP reduces complexity and mitigates overfitting—beneficial in EGGE where labeled data are limited and heterogeneous [10,13].

CNNs outperform traditional ML methods based on handcrafted features by enabling end-to-end, noise-resilient representation learning [104,116]. Performance has been further enhanced through variants such as Visual Geometry Group (VGG), ResNet, and DenseNet for deep residual feature extraction [99,132]; Inception/GoogLeNet and EfficientNet for multiscale and computation-efficient learning [133]; MobileNet for lightweight deployment [106]; 3D-CNNs for volumetric subsurface data [134]; and U-Net, fully convolutional network (FCN), and SegNet for pixel-level segmentation [19,110]. Model evaluation typically uses accuracy, precision, recall, F1-score, and confusion matrices [13], while segmentation and generative CNNs additionally employ Intersection over Union (IoU), dice coefficient, peak signal-to-noise ratio, and Structural Similarity Index Measure (SSIM) [40,135]. High annotation needs, compute cost, and low interpretability are driving progress in transfer learning, few-shot learning, lightweight CNNs, and XAI/physics-informed CNNs [134,136].

CNNs show strong versatility across EGGE for material characterization, geospatial mapping, hazard assessment, and infrastructure monitoring. They routinely achieve >95–99% accuracy in soil and rock classification, texture analysis, and mineral discrimination using core, thin-section, UAV, and hyperspectral imagery [137,138]. Hemdan and Al-Atroush [127] demonstrated that CNN-based feature extraction combined with SVM improves soil-image recognition accuracy while reducing training cost. U-Net, SegNet, and FCN enable high-resolution segmentation of lithological boundaries and soil horizons, supporting digital soil mapping, fertility and pH assessment, and large-area soil property prediction [139,140]. In remote sensing, CNNs applied to ASTER, Landsat, Sentinel, and UAV data enhance lithological mapping, alteration detection, and landslide susceptibility analysis for early warning and resilience planning [111,141]. CNNs also deliver automated, high-accuracy crack and defect detection in tunnels, retaining walls, pavements, and embankments, outperforming manual inspection and traditional computer vision [142,143]. Hybrid CNN–LSTM/RNN models fuse spatial and temporal learning for slope-movement forecasting, deformation monitoring, and tunnel boring machine (TBM) ground-type recognition, with ResNet-18 and GoogLeNet reporting >96% accuracy for real-time TBM operations [54,133]. Compared with handcrafted image-processing techniques, CNNs provide superior multilevel feature learning and robust end-to-end classification; for instance, ResNet50 and VGG16 exceeded 98% accuracy in soil-aggregate classification [137], and 3D-CNNs delivered very high accuracy for hyperspectral soil imaging [144].

Despite the rapid progress, key gaps persist: cross-site generalization, label imbalance, and domain-shift sensitivity limit deployment, particularly in data-scarce or region-specific studies [13,145]. Emerging solutions include multimodal CNNs integrating geophysical, remote-sensing, and geotechnical imagery; CNN–TF hybrids for long-range spatial dependency learning; and physics-informed CNNs that embed geoscience constraints to suppress non-physical outputs [41,79]. Research is also advancing toward self-supervised and few-shot learning, synthetic data augmentation, and explainable CNNs to reduce labeling burden, improve robustness, and accelerate engineering adoption.

3.2.3. Recurrent Neural Networks (RNNs)

RNNs are DL architectures designed for sequential data, maintaining a hidden state that evolves over time to learn temporal dependencies between observations, an advantage over feedforward and CNN models, which process inputs independently or within finite receptive fields [117,144]. As shown in Figure 7 and expressed in Equations (1) and (2), each input

x_{t}

is combined with the previous hidden state

h_{t - 1}

to produce an updated state

h_{t}

[146], enabling RNNs to retain temporal memory essential for modeling evolving subsurface and geo-infrastructure behavior. This capability has driven their adoption in EGGE applications involving time-varying responses, including time-lapse geophysics, seismic site-response monitoring, pore-pressure and settlement prediction, slope-stability early warning from sensor streams, and tunnel and embankment health monitoring [147]. Practical deployment requires sequence normalization, handling irregular sampling, and managing concept drift in long-term monitoring [148,149]. To overcome vanishing and exploding gradients, gated variants such as LSTMs and Gated Recurrent Units (GRUs) enable long-range dependency learning and have become the dominant RNN models for sequential geotechnical data [73,150]. Despite their strengths, RNNs struggle with long-sequence cost, noise sensitivity, data drift, limited interpretability, and weak cross-site generalization. These limitations are driving advances such as hybrid CNN–RNNs, multimodal and self-supervised sequence models, and physics-informed RNNs for more robust engineering use. Their performance is typically assessed using time-series metrics (MSE, RMSE, MAE, MAPE) and correlation-based indices.

h_{t} = f (W_{x h} x_{t} + W_{h h} h_{t - 1} + b)

(1)

o_{t} = g (W_{h o} h_{t} + c)

(2)

3.2.4. Deep RNN (DRNNs)

DRNNs extend conventional RNNs by stacking multiple recurrent layers to strengthen temporal feature learning and improve predictive accuracy [24,89,114]. In a typical DRNN architecture, inputs pass through successive RNN/LSTM/GRU layers, with hidden states propagated across time, enabling hierarchical learning of temporal patterns: lower layers capture short-term dynamics, while upper layers extract long-range and more abstract dependencies [149]. This depth advantage makes DRNNs well-suited for environmental and geotechnical problems involving nonlinear, multivariate time-series behavior. Applications include modeling rainfall–pore-pressure–suction interactions for slope-stability assessment, forecasting ground settlement under cyclic or construction loading, and fusing multi-sensor data for condition monitoring of tunnels, dams, and embankments [149,151]. By capturing both immediate and long-term dependencies in evolving geotechnical signals, DRNNs provide a more reliable basis for time-series prediction, early-warning analytics, and risk-informed geotechnical management.

3.2.5. RNN–Autoencoder (RNN–AE) and Bidirectional RNN (BiRNN)

In a typical RNN–Autoencoder (RNN–AE), the encoder compresses multivariate time-series into a latent vector that captures short- and long-term temporal dependencies, and the decoder reconstructs or forecasts sequences from this latent state [24,152]. RNN–AEs integrate the sequential learning ability of LSTM/GRU-based recurrent networks with the feature-learning and dimensionality-reduction strengths of autoencoders, enabling unsupervised representation learning from time-series data [149,153]. For example, the architecture described by Yu et al. [154] and Santoso et al. [153] stacks LSTM/GRU layers for encoding, applies a “RepeatVector” to expand the latent state to the forecast horizon, and employs a mirrored LSTM/GRU decoder with a “TimeDistributed” dense head for output generation. Model capacity is governed by hyperparameters N₁, N₂, and N₃ (units per layer), while dropout and layer normalization enhance training stability and generalization. Training is conducted using a composite loss that combines reconstruction mean squared error (MSE) with multi-step forecasting losses (MSE/MAE), enabling denoising, gap-filling, anomaly detection through reconstruction error, and multi-step nowcasting.

The other variant, Bidirectional RNNs (BiRNNs), processes sequences in both temporal directions to enhance prediction accuracy [149], by employing forward and backward hidden states that capture past and future inputs concurrently, with the outputs merged in a shared layer [114]. This dual perspective alleviates vanishing-gradient limitations and improves context awareness, making BiRNNs particularly effective for sequential learning tasks [133]. These capabilities make RNN–AEs and BiRNNs well-suited to EGGE applications involving noisy, irregular, and multivariate monitoring data, including time-lapse ERT/IP, pore-pressure and settlement logs, rainfall–infiltration–pore-pressure responses, geophysical monitoring, infrastructure-health sensing, and early-warning prediction in geophysical–geotechnical systems.

3.2.6. Gated Recurrent Unit (GRU)

GRUs are an advanced RNN variant for efficient time-series modeling, using gating mechanisms to retain relevant historical information while mitigating vanishing-gradient issues [114]. A GRU cell uses two gates—update and reset—to regulate information flow: the update gate controls how much past state is retained, while the reset gate determines how new input is combined with previous memory. Compared with LSTMs, which use three gates, GRUs have a simpler architecture with fewer parameters, enabling faster training and lower computational cost with minimal accuracy loss [155]. This parsimony makes GRUs suitable for real-time or resource-constrained EGGE workflows involving sensor-based monitoring, dynamic responses, and short- to medium-term forecasting. However, reduced gating complexity can limit long-term dependency modeling, where LSTMs often perform better. Overall, GRUs offer an effective balance of accuracy and efficiency, with the GRU–LSTM choice depending on length, data characteristics, and computational constraints.

3.2.7. Long Short-Term Memory (LSTM)

LSTM networks are an advanced RNN architecture designed to capture long-range temporal dependencies and overcome vanishing-gradient limitations in standard RNNs [112,118,156]. It addresses the problem of the backpropagated error either blowing up or decaying exponentially for long time lags in conventional RNNs. An LSTM cell maintains an internal cell state

c_{t}

regulated by three gating mechanisms—input

(i_{t})

, forget

(f_{t})

, and output

(o_{t})

gates—which control how information is written, retained, and retrieved over time [117] (Figure 8). This gated structure stabilizes the cell state and enables selective preservation of relevant patterns, allowing LSTMs to model long-term sequential behavior more effectively than traditional RNNs and earlier models such as Hidden Markov Models [157,158]. LSTMs are particularly valuable for dynamic processes, e.g., rainfall–infiltration–suction cycles, pore-pressure and settlement evolution, structural health trends, etc., where retaining long temporal context improves forecasting accuracy [114]. However, under irregular or non-uniform time steps, simpler RNN variants may occasionally outperform LSTMs [133]. As shown in Figure 8, an LSTM cell receives the current input

x_{t}

and previous hidden state

h_{t - 1}

, which pass through the forget, input, and output sigmoid gates, as well as a candidate state generated via a

t a n h

activation. The cell state is updated as (Equation (3)):

c_{t} = f_{t} ⊙ c_{t - 1} + i_{t} ⊙ {\tilde{c}}_{t},

(3)

while the hidden output is computed as (Equation (4)):

h_{t} = o_{t} ⊙ t a n h (c_{t})

(4)

The gated memory pathway preserves gradients across long sequences, enabling stable training and reliable multi-step forecasting in noisy, nonlinear EGGE time-series data. Building on this capability, Bidirectional LSTM (BiLSTM) and Bidirectional GRU (BiGRU) extend the architecture by processing sequences in both forward and backward directions, allowing the model to learn from past and future context simultaneously [89]. At each time step, BiRNNs generate forward and backward hidden states that are fused to produce a context-enriched representation, particularly advantageous when complete sequences are available prior to inference, and when delayed outcomes depend on earlier and later events within the time window [133,154]. In EGGE, BiLSTM/BiGRU models have demonstrated superior performance compared to unidirectional variants by capturing fuller temporal dependencies in geotechnical signals, improving forecasting accuracy for slope-failure precursors, settlement evolution, seismic site response, TBM-induced vibrations, and structural-health trends in buried and surface infrastructure [133,149,159].

3.2.8. Generative Adversarial Networks (GANs)

GANs, introduced by Goodfellow et al. [160], are a key DL architecture for generative modeling. A GAN trains a generator

G

to produce synthetic samples and a discriminator

D

to distinguish them from real data through an adversarial minimax process [117], Figure 9a. Originally for unsupervised learning, GANs now support semi-supervised, supervised, and reinforcement learning, enabling data synthesis, super-resolution, domain translation, and anomaly detection [161,162]. Several variants enhance stability and performance: Deep Convolutional GAN (DCGAN) improves image quality using convolutional blocks; Wasserstein GAN (WGAN) stabilizes training and reduces mode collapse; Auxiliary Classifier GAN (ACGAN) enables label-conditioned generation; Variational Autoencoder GAN (VAE-GAN) fuses latent regularization with adversarial learning for sharper samples; Conditional GAN (cGAN) incorporates auxiliary data (e.g., lithology); CycleGAN performs unpaired image-to-image translation; and PatchGAN evaluates local patches to enhance fine-scale texture and spatial detail [30,161].

In EGGE, GANs help overcome data scarcity, improve model generalization, and enhance subsurface characterization. Key applications (Table 2) include synthetic augmentation of soil/rock datasets, reconstruction of incomplete borehole, seismic, and monitoring time series, pore-scale microstructure generation, and super-resolution of seismic and GPR imagery. Conditional and cycle-based GANs also enable cross-domain mapping (e.g., geophysical-to-lithology translation) [163]. Challenges persist—training instability, mode collapse, computational cost, and validation—but GANs remain promising for data enrichment, geo-imaging enhancement, and inversion and simulation support [30,164]. To address some of these limitations, Yan et al. [165] integrate CycleGAN-based cross-modal enhancement with hierarchical feature fusion to improve multi-sensor image integration (e.g., Vis–SAR) (Figure 9b), marking a shift toward reducing modality gaps and improving fusion consistency for high-fidelity geo-imaging.

Figure 9. (a) Schematic diagram of the GAN (adapted from [117]. (b) Typical advanced GAN-based cross-modal fusion framework integrating CycleGAN-driven cross-modal enhancement with hierarchical deep feature fusion (adapted from [165]). The model learns bidirectional mappings between visible (Vis) and SAR modalities via generators

G_{V \to S}

and

G_{S \to V}

, with adversarial feedback from modality-specific discriminators.

Figure 9. (a) Schematic diagram of the GAN (adapted from [117]. (b) Typical advanced GAN-based cross-modal fusion framework integrating CycleGAN-driven cross-modal enhancement with hierarchical deep feature fusion (adapted from [165]). The model learns bidirectional mappings between visible (Vis) and SAR modalities via generators

G_{V \to S}

and

G_{S \to V}

, with adversarial feedback from modality-specific discriminators.

Table 2. Summary of selected studies on GAN with its variants and their relevance in EGGE.

Author	Methods used	Results	Relevance	Limitations
[160]	Original GAN framework using generator + discriminator (MLP)	Demonstrated competitive sample generation on MNIST, CIFAR-10, and TFD datasets	Foundational to geotechnical applications (later adopted for soil/rock modeling, subsurface imaging)	Training instability, lack of explicit probability density, and mode collapse
[166]	GANs, cGANs, WGANs; applied to CIFAR-10 and medical images	Augmented datasets improved accuracy (e.g., CIFAR-10: +7.3%; medical imaging: +6.7%); significant FID/IS improvements	Framework directly transferable to geotechnical datasets (soil/rock images, seismic/GPR data)	Training instability, quality control of synthetic data, and high compute costs
[161]	Variants: DCGAN, WGAN, ACGAN, VAE-GAN; review of structural and loss-function improvements	Summarized enhancements that stabilize training & improve diagnostic accuracy	Shows potential of GANs in generating synthetic geotechnical signals (e.g., vibration data for fault diagnosis)	Training instability, domain transferability issues
[167]	Image-to-Image GAN (Pix2Pix) trained on synthetic RF data representing soil layers (sand, silt, clay); Soil behavior type index (Ic) used as input	Achieved mean absolute error (MAE) of 0.039 vs. 0.096 for nearest-neighbor interpolation; accurate for Ic < 3; demonstrated the feasibility of GANs for 2D soil schematization	Provides a novel AI-based approach for subsurface schematization, outperforming traditional interpolation; supports efficient soil classification and modeling with limited data	Based solely on synthetic data; performance biased toward datasets dominated by Ic < 3; requires balanced datasets; validation with real field data still needed
[163]	Multi-scale GAN (MS-GAN) for 3D geological modeling	Generated multiple 3D realizations capturing stratigraphy with quantified uncertainty	Highly relevant for site characterization, 3D subsurface modeling in geotechnics	Computationally intensive; depends on the quality of the training image; irregular boreholes require additional processing
[30]	SchemaGAN: cGAN with U-Net generator & PatchGAN discriminator; trained on 24,000 synthetic cross-sections and CPT-like data	Outperformed interpolation methods (MAE ≈ 0.039 vs. 0.096 for nearest-neighbor; better layer boundaries, anisotropy, and complex geometries); validated through blind expert survey and Dutch field case studies	Demonstrated robust, realistic subsurface schematization from sparse CPT data; scalable tool for geotechnical site characterization and digital twin applications	High computational training cost (95h on supercomputer); still reliant on synthetic training data; limited to 2D (needs extension to 3D); may struggle if field conditions deviate from training database

3.3. Physics-Informed Neural Networks (PINNs) in AI–EGGE

PINNs are neural networks that solve partial differential equations (PDEs) by embedding physical laws or other governing constraints directly into the loss function, ensuring physics-consistent learning [168,169,170]. By converting these constraints into additional loss terms, PINNs integrate observational data with PDE residuals, initial, and boundary conditions to guide training, thereby enhancing generalization, reducing dependence on large labeled datasets, and improving interpretability relative to purely data-driven DL [23,171]. This capability is particularly relevant to EGGE, where subsurface behavior is governed by PDE-based geophysical and geotechnical laws [59,172]. Within the AI–EGGE paradigm, PINNs mark a shift from conventional pattern-recognition DL toward physics-informed learning, enabling models to retain mechanistic fidelity while leveraging data-driven flexibility [39]. Figure 10 illustrates a multi-receptive-field PINN (MRF–PINN) [171]. The initialized input field (

u_{0}

) is processed through parallel encoder–decoder branches with different receptive fields (

{R F}_{1} - {R F}_{6}

), enabling multi-scale feature extraction. Each branch produces intermediate feature maps (

u_{1} - u_{6}

) and corresponding predictions (

p_{1} - p_{6}

), which are recombined via feature-map aggregation and linear superposition to form the MRF–PINN prediction field. Training is guided by a composite loss function, where weighting coefficients

λ_{d a t a}

,

λ_{B C}

,

λ_{P D E s}

, balance data misfit (

L_{d a t a}

), boundary-condition constraints (

L_{B C}

), and physics-based PDE residuals (

L_{P D E s}

). This design supports stable learning of multi-scale, heterogeneous, and coupled subsurface processes characteristic of EGGE systems, e.g., [11,173].

PINNs have shown strong capability for forward and inverse modeling in EGGE, with applications in soil/rock consolidation [174], groundwater [59], seismic wave propagation [38], subsurface stress distribution [22], tunneling [172], and pile–soil interaction [175]. They deliver physically credible predictions under sparse or imperfect data and achieve accuracy comparable to analytical and numerical solutions, with growing value for parameter inference [138,176]. Ito et al. [177] estimated the coefficient of consolidation for Terzaghi’s 1D consolidation and unsaturated hydraulic properties directly from laboratory data. However, widespread adoption remains limited by reliance on synthetic data, reduced robustness under field noise and heterogeneity, training instability, slow convergence, loss-term sensitivity, and difficulty modeling highly nonlinear or coupled hydro-geomechanical processes [22,178,179]. Emerging advances, such as hybrid architectures (e.g., MRF-PINNs, operator-learning PINNs), adaptive loss balancing, integrated field–lab data training, uncertainty- and XAI-enhanced PINNs, and PINN–numerical solver coupling [171,180], are accelerating the shift toward trustworthy, physics-informed, and field-deployable AI for EGGE, laying the foundation for neural-operator models and real-time digital-twin subsurface systems. For example, Figure 11, Arif et al. [10], integrates an optimum GBM classifier with a PINN for circular slope-stability diagnosis: the GBM is trained on a 221-case dataset (inputs:

H / h

, slope angle

β

, cohesion

c

, unit weight

γ

, friction angle

φ

, and pore-water pressure

r_{u}

) with 5-fold cross validation (CV), while the PINN enforces physics constraints to regularize learning. This hybrid achieved the strongest F1/κ–AUC performance and outperformed ANN, SVM, and RF baselines, demonstrating how physics guidance enhances generalization under limited data.

4. Emerging Trends in AI–EGGE

The increasing complexity and nonlinearity of subsurface systems now require analytical frameworks that can integrate diverse geophysical, geotechnical, remote-sensing, and environmental datasets into unified, physically consistent models. This need is driving a shift in AI–EGGE toward physics-informed, multimodal, explainable, and increasingly autonomous intelligence capable of capturing coupled subsurface processes and supporting real-time decision-making [181,182]. This shift is being enabled by the integration of ML/DL with physics-based modeling, multimodal data fusion, UQ, and edge-intelligent sensing, aimed at improving model reliability, transparency, and operational deployment [183,184]. At the core of this evolution is multimodal fusion, which integrates heterogeneous datasets to reconcile differences in sensitivity, resolution, and noise [4,185], thereby reducing interpretational ambiguity and strengthening predictive robustness for environmental and geotechnical decision-making.

4.1. Explainable AI (XAI): Concept and Model Framework

As AI systems become embedded in geophysical–geotechnical workflows, interpretability is essential for scientific credibility, regulatory acceptance, and geoenvironmental and geoengineering deployment. In XAI, the terms white-box, gray-box, and black-box denote increasing opacity in a model’s internal logic and decision-making process [186], influencing model selection in EGGE applications. White-box models are inherently interpretable but often less accurate; gray-box models balance interpretability and performance; and black-box models typically achieve higher accuracy at the expense of transparency. XAI promotes transparency by linking model predictions to physical properties, uncertainty sources, and domain knowledge, converting opaque outputs into defensible analytical insights. It aims to make AI results understandable and trustworthy for both domain specialists and system developers, supporting error diagnosis, model refinement, and responsible use in high-consequence contexts [9,79,186]. XAI methods fall into two groups: intrinsically interpretable models designed for transparency (e.g., EBM, NAM, CBM) and post-hoc techniques that explain complex models (e.g., SHAP, LIME, IG, Grad-CAM, ALE) [187,188]. Beyond transparency, XAI supports scientific consistency by ensuring that learned representations align with physical principles and empirical evidence critical in EGGE, where decisions must withstand scrutiny under partial observability and high impact [189]. Table 3 summarizes the key XAI and uncertainty evaluation methods essential for interpreting multimodal models and assessing their predictive reliability.

4.1.1. Post-Hoc Methods

Post-hoc methods explain trained models at local or global levels. Key feature-attribution tools include Integrated Gradients and its refinement, Guided IG, which reduces attribution noise, and Shapley-value methods, with dependence-aware variants preferred when predictors are correlated [188,189]. For global effects, ALE is generally favored over Partial Dependence Plots (PDPs) as it avoids extrapolation under correlation, with recent work improving its efficiency in high-dimensional settings. These methods support factor-effect interpretation in geospatial workflows involving terrain, soil, or geophysical attributes [56]. However, post-hoc explanations may be unstable, inconsistent, or unfaithful to model logic; thus, quantitative tests of faithfulness and stability are recommended over visual inspection alone [81,190]. Demonstrated value in EGGE includes landslide susceptibility modeling, where SHAP-based analyses reveal the influence of slope, precipitation, and lithology, with similar usage in hazard mapping, subsoil classification, and monitoring [80,191].

4.1.2. Intrinsic (Ante-Hoc) Methods

Intrinsic approaches embed interpretability in the model form and are advantageous for validation in safety-critical engineering contexts. Modern additive models include NAMs, which retain GAM-like clarity with neural flexibility, and EBMs, which provide transparent, shape-constrained feature effects with competitive accuracy [180,187]. CBMs introduce human-auditable intermediate concepts to enable inspection and correction at the concept layer [192]. A related direction leverages physics-informed ML (PIML) or physics-guided ML, embedding physical laws, constraints, or forward operators into the architecture or loss to yield predictions interpretable through constraint satisfaction and residuals—useful for inversion and property mapping in EGGE [101,169,180].

4.1.3. Integrated Applications

Applications in EGGE demonstrate the value of XAI for improving model credibility, interpretability, and deployment. For instance, Wang et al. [80] employed an XAI–DL framework to derive global and local insights into hydro-morphological processes (HMP) across China to aid in hazard and risk assessment, achieving ten-fold cross-validated AUC scores of 0.83–0.86. The SHAP-based interpretations revealed that spatially varying feature contributions to HMP predictions can diagnose model behavior, validate physically meaningful drivers, and support regional-scale environmental analysis. In landslide susceptibility modeling, e.g., [80,188], interpretable neural networks and SHAP-augmented ensembles similarly expose dominant predictors and interactions (e.g., slope–precipitation coupling, lithology, aspect), improving user trust and guiding model refinement. In geotechnical property prediction and soil classification, SHAP, ALE, and partial-dependence diagnostics clarify how CPT-derived indices and soil parameters influence outputs [9,16]. Similarly, Degen et al. [180] show that architectures that embed domain constraints into feature learning enhance classification performance while preserving the interpretability of the physical cues used. Overall, XAI strengthens due diligence in EGGE, supporting sensitivity and scenario analysis, defensible decision-making, and model-risk management across heterogeneous datasets [189].

The practical guidance XAI interpretations include:

Use intrinsic models (EBM, NAM, CBM) where interpretability is essential; apply post-hoc methods mainly for auditing, debugging, and communicating complex models, and validate explanations with faithfulness and stability checks.
In correlated multimodal settings, favor ALE and dependence-aware SHAP over naïve PDP/SHAP pipelines.
Where physical laws are well established, adopt physics-informed hybrids to couple accuracy with mechanistic interpretability and diagnosable residuals.
Quantify uncertainty in explanations (e.g., confidence intervals for feature effects or attribution variability across perturbations) to avoid overconfident interpretation in high-stakes EGGE decisions.
Communicate explanations at the appropriate level of abstraction for the audience (engineers, regulators, field practitioners), emphasizing actionable insights rather than raw attribution maps.

4.2. Multimodal Fusion: Concept, Hierarchy, Strategies, and Methodological Approaches

Multimodal fusion in AI–EGGE refers to the systematic integration of heterogeneous datasets to achieve a more complete, reliable, and physically consistent representation of subsurface and engineered environments [4,193]. The conceptual hierarchy of multimodal fusion spans three integration levels: low-, mid-, and high-level fusion. Low-level (data-level) fusion combines raw or minimally processed datasets to preserve intrinsic spatial and physical relationships; mid-level (feature-level) fusion generates shared latent representations by combining engineered or learned features to improve discrimination of lithological boundaries, hydrostratigraphic units, geomechanical heterogeneities, and deformation precursors; and high-level (decision-level) fusion aggregates outputs from independently trained models through ensemble learning, Bayesian model averaging, and probabilistic late-fusion schemes to strengthen robustness, interpretability, and out-of-distribution generalization [4,21,182,183,194]. This conceptual multimodal fusion framework is illustrated in Figure 12, summarizing the flow from data acquisition and preprocessing through fusion levels and AI modeling to interpretability, uncertainty assessment, and end-use applications.

Relevant data sources include geophysical surveys (resistivity, seismic, GPR, magnetic, gravity), geotechnical investigations (borehole logs, SPT-N, CPT-qc, laboratory tests), and remote-sensing and environmental observations such as UAV imagery, light detection and ranging (LiDAR), synthetic aperture radar (SAR), interferometric SAR (InSAR), and soil chemistry. Each modality contributes distinct yet complementary strengths: geophysical methods provide broad spatial coverage but often suffer from non-uniqueness and depth-resolution limitations, whereas geotechnical data offer high-fidelity ground truth but are spatially sparse. Their fusion reduces interpretational ambiguity, tightens model uncertainty, and enhances the effective spatial resolution and structural interpretability of subsurface models for geoenvironmental and geoengineering decision-making [193,195,196]. Table 4 provides a complementary overview focusing specifically on the hierarchy of fusion levels, representative techniques, and computational enablers adopted in EGGE.

Methodologically, multimodal integration in EGGE is enabled through five overarching strategy classes: physics-governed fusion (e.g., joint and cooperative inversion) that embeds physical constraints; data-driven fusion using ML/DL architectures; hybrid physics–AI fusion that couples domain knowledge with learning-based models; cross-modal alignment strategies that learn shared latent spaces for correlated modalities; and probabilistic and uncertainty-aware fusion to quantify confidence and support risk-informed decision-making [45,182]. Recent advances further incorporate AI-driven approaches, including physics-informed ML (PIML/PINNs) for physics-consistent multimodal learning, transformer-based and contrastive cross-modal fusion for spatial-temporal representation alignment, diffusion and GAN models to infer or reconstruct missing modalities, and GNN- and spatiotemporal fusion frameworks for continuous deformation assessment, infrastructure monitoring, and hazard early warning [165,185,194]. Deep attention-enhanced fusion architectures are also emerging, exemplified by Yan et al. [165], who integrate multi-layer deep feature extraction, hierarchical feature fusion with attention, and a Layered Strictly Nested Framework (LSNF) to achieve robust SAR–visible image fusion with preserved structural integrity and high-frequency detail. Recent evidence shows that integrating borehole and laboratory data with ground and airborne geophysics sharpens hydrostratigraphic and deformation models and strengthens prediction in EGGE [195,196]. Physics-consistent joint inversion further improves structural coherence in integrated subsurface interpretation [193,198]. Remote-sensing fusion (optical/SAR/LiDAR) adds terrain, deformation, and environmental constraints at scale, supporting hazard-prone and urban-infrastructure applications [4,64,199].

5. Applications and Case Studies of AI–EGGE

5.1. Applications

The practical deployment of AI–EGGE has advanced from isolated algorithmic trials to robust, field-validated decision-support systems. Beyond conventional supervised and unsupervised ML, recent developments integrate DL architectures, physics-informed and mechanistic modeling, multimodal and cross-domain data fusion, and XAI to enhance trust, interpretability, and engineering uptake. These advances now enable high-resolution subsurface characterization, soil and lithological classification, geotechnical parameter prediction, groundwater assessment, geohazard forecasting, underground infrastructure monitoring, and real-time sensing for resilient engineering design. This section presents application cases demonstrating how AI–EGGE is transforming data acquisition, interpretation, and risk-informed decision-making across environmental and engineering contexts.

5.1.1. Site Characterization and Subsurface Profiling

AI–EGGE strengthens site characterization by improving assessment of soil–rock interactions and subsurface conditions, reducing trial-and-error investigations. It enables reliable prediction of groundwater levels, soil texture, particle-size fractions, organic carbon, pore-water pressure, and soil movement, lowering exploration costs and improving design efficiency. Accurate subsurface characterization is critical because physical, mechanical, and hydraulic properties govern structural performance [6,14,87]. Compaction, shear strength, and permeability control foundation support, while interactions—such as water content, viscosity, and earth pressure in clays—affect strength and deformation. Data-driven and statistical methods enhance characterization by revealing patterns often missed by conventional approaches.

ML models have been widely applied to estimate soil hydraulic properties [100,200], soil organic carbon and matter [201], and soil pore structures [53,77]. Notable examples include ANNs and the Group Method of Data Handling (GMDH), a self-organizing polynomial-based ML method, which have provided deeper insight into site properties [75,108,202]. Licznar and Nearing [203] applied ANNs to predict hydraulic behavior, and Mishra et al. [204] used RF and SVM to estimate bulk density and cation exchange capacity at different depths. Extreme learning machine (ELM), RBF, modified GMDH, and M5 tree methods have proven effective for predicting pore-water pressure, groundwater table elevation, and soil-water retention curves (SWRC), which are central to hydrological processes and soil moisture dynamics, and agro-hydrology [205]. A wide range of soil attributes—from texture and organic matter to hydraulic and retention properties—serve as key inputs for ML-based SWRC estimation. ANN- and DNN-based models have also been used to predict soil temperature [42,206].

Despite advances, site-specific soil and rock heterogeneity still necessitate complementary laboratory testing. Consistently with this, Wang et al. [207] assessed RF, SVM, gradient-boosted regression trees, MLP, and least angle regression (LAR) for hyperspectral-based soil salinity prediction, with LAR showing superior stability and accuracy. Elaziz et al. [208] also applied XGBoost, RF, and GBM to 120 soil samples for predicting soil salinity in semi-arid regions, with a success rate between 0.99% to 1.0%. Zhang et al. [209] reported that XGBoost achieved the highest accuracy for total soil nitrogen prediction, RF performed best with SAR data, and GBM outperformed other models when using Landsat-8 and Sentinel-2 data compared to Sentinel-1. These highlight the value of integrating ML with multisource satellite data for precise soil nutrient mapping and improved geotechnical prediction, risk assessment, design optimization, and decision-making. Table 5 provides additional AI–EGGE studies on soil prediction and classification.

5.1.2. Landslides

Landslide research is grouped into susceptibility assessment, displacement prediction, and detection. Landslides may be shallow or deep-seated and commonly triggered by rainfall, earthquakes, typhoons, or deforestation [212]. Landslide susceptibility mapping (LSM) uses conditioning factors such as elevation, slope, lithology, soil type, aspect, and distance to roads, faults, and rivers, with steep slopes, weak soils, low shear strength, and river proximity increasing susceptibility [12,213]. Detection maps event location, type, boundary, volume, date, and soil/rock triggers, supporting regional “soft” risk assessment, whereas displacement prediction represents “hard” risk evaluation for specific slopes or assets [214,215]. Traditional detection relied on visual imagery and field surveys [216]. The Frequency Ratio method remains widely used to quantify links between conditioning factors and landslide occurrence, with correlation/multicollinearity checks applied to reduce redundant predictors [217].

AI-driven semi- and fully automated workflows now reduce manual effort and enhance detection and LSM efficiency, especially for pixel-based and object-based image analysis (OBIA) methods [110,213,215]. OBIA generally outperforms pixel-based methods in complex terrain by detecting object-level changes from multi-temporal imagery. For instance, Ghorbanzadeh et al. [110] employed a hybrid ResU-Net–OBIA workflow for landslide detection. In the study, Sentinel-2, NDVI, and slope layers are first segmented using multiresolution segmentation, followed by rule-based OBIA classification to generate object-level candidates. In parallel, ResU-Net performs pixel-wise segmentation to produce probability heatmaps, which are thresholded and fused with OBIA outputs. The hybrid ResU-Net–OBIA approach reduces spectral confusion, enhances boundary delineation, and suppresses false positives compared with standalone ResU-Net or OBIA, demonstrating the value of combining deep semantic segmentation with object-based geomorphological reasoning for landslide mapping.

In other studies, Liu et al. [218] demonstrated that the YOLOv7 model enhanced with a Squeeze-and-Excitation attention mechanism provided higher generalization capability and detected landslides more accurately with fewer missed events. CNNs and RNNs consistently outperform SVM, DT, and RF for LSM [219,220,221]. Althuwaynee et al. [222] found that a hybrid evidential-AHP (analytical hierarchy process) model outperformed logistic regression. Wang et al. [223] reported that an ensemble of SVM, ANN, and GBoost improved AUC by 0.11–0.35 over single models. Kim and Lee [206] compared CNNs with ML models (RF and bagging) and showed that CNNs consistently achieved higher LSM accuracy. Balogun et al. [102] used meta-heuristic algorithms—GWO, Bat Algorithm (BA), and Cuckoo Optimization Algorithm (COA)—to optimize SVM hyperparameters, improving LSM prediction accuracy. GRU, CNN, and hybrid models (CNN–AEIO [Age of Exploration-Inspired Optimizer], PSO–SVM, SVM–GWO) have also shown strong LSM performance in Lushan, Majiagou, northwest Sichuan, and Jiuxianping [76,106,119,224].

For time-series-based landslide modeling, DNNs, TF-based temporal models, and physics-informed hybrids have improved landslide displacement forecasting; however, models such as GEP, SVM, and RBF often perform poorly when external triggers (e.g., rainfall and reservoir water-level fluctuations) are excluded, limiting their long-term predictive ability [215,225]. GEP has shown superior performance to SVM and RBF in displacement prediction [85]. Comparative DL studies report that transformer models yield the highest accuracy, followed by LSTM, improved Elman networks, RNNs, and other DL architectures [136,158]. The improved Elman network, which incorporates a piecewise time-weighted gradient function, enhances the ability to learn both current and historical temporal dependencies [225].

5.1.3. Sinkholes

Sinkholes are vertical depressions formed by the collapse of overlying soil due to chemical dissolution of underlying karstic bedrock, particularly in carbonate and evaporite terrains [9,214,226]. They display distinct morphologies, such as depth, diameter, circumference, and area, that vary with formation environment and lithology (e.g., basement rock, plateau, and basin sinkholes. Efficient detection and analysis of sinkhole patterns are essential for disaster mitigation and sustainable development [19,227]. ML/DL applied to subsurface datasets provides a robust means of characterizing sinkhole morphology and distinguishing them from surrounding terrain. Integrated ML/DL models have leveraged diverse data sources, including aerial photographs, GPR, thermal imagery, digital elevation models (DEMs), satellite data, and RGB images [228,229,230,231]. In West-Central Florida, Muili and Babaie [232] showed that RF outperformed MLP, SVM, and KNN for sinkhole prediction, and revealed that shallow bedrock depth, land-use patterns, and NW–SE-trending faults were the dominant controls on sinkhole development. Table 6 further provides additional studies on AI applications for sinkhole detection using remote sensing data.

5.2. Case Studies

This section presents real-world cases showing how AI enhances subsurface characterization, hazard prediction, and engineering decisions in EGGE. Each case summarizes: (i) the problem context, (ii) the AI/ML/DL, hybrid, or physics-informed models applied, (iii) multimodal or multisensor data used, and (iv) the value-added gains over conventional methods. These cases reflect the AI–EGGE shift toward data-driven, physics-consistent, and multimodal intelligence for more accurate and decision-ready solutions.

5.2.1. Case 1: Hybrid CNN–ViTF for High-Fidelity, Robust, and Real-Time ERT Inversion

ERT inversion has long been constrained by smoothing artifacts, non-uniqueness, and high computational demand inherent to Gauss–Newton (GN)–based solvers. Yin et al. [41] introduced a Hybrid ViTF–based inversion framework, with spatial CNN blocks and TF self-attention, that transitions ERT imaging from iterative physics-based inversion to a direct, data-driven, image-to-image prediction paradigm with high structural fidelity and real-time performance. The workflow (Figure 13) integrates: (i) large-scale synthetic paired apparent–true resistivity datasets generated through finite-element forward modeling for 1–5 anomalous targets, enabling curriculum training to progressively learn structural complexity; (ii) a residual CNN-based colormap calibration module to predict resistivity ranges for field data where true values are unavailable, ensuring consistent pixel–resistivity mapping; and (iii) the ViTF inversion model that maps calibrated apparent resistivity images to true resistivity distributions (Figure 13a). The detailed ViTF architecture is further illustrated, combining convolutional blocks (Figure 13b) for localized anomaly feature extraction with TF self-attention maps observed apparent-resistivity pseudo-sections to true resistivity distributions for resolving and capturing long-range resistivity dependencies. This architecture overcomes the locality bias and scale sensitivity of CNN-only models and enables generalization across anomaly geometries, contrasts, and spatial configurations. A comparative performance summary of ViTF against CNN-AE, U-Net, Latent Diffusion Model (LDM), and GN inversion confirmed its superior accuracy, efficiency, boundary preservation, and robustness across all complexity levels.

The ViTF achieved SSIM up to 0.912, MSE as low as

1.12 \times 10 ⁻ ³

, and the shortest training time of 706 s, outperforming CNN-AE and U-Net, which produced blurred anomaly boundaries and higher errors, and outperforming LDM, which showed instability beyond one-target scenarios. It maintained R² > 0.96 and RMSE < 260 Ωm across 1–5 targets, demonstrating stable high-fidelity inversion with sharp boundary retention. In Figure 14, the colormap calibration module (Panel 1) delivered R² > 0.98, MAE < 150 Ωm, and RMSE < 260 Ωm, confirming negligible deviation between calibrated and ground-truth colormaps, while inversion results remained statistically unchanged (Panel 2), validating field readiness. Against GN inversion, the ViTF (Panel 3) consistently produced lower pixel-level errors, sharper target geometries, and reduced artifact zones across all five complexity cases, whereas GN exhibited smearing and centroid misplacement of anomalies. Notably, ViTF achieved ~20 ms per inversion, compared to ≥ 5 s per iteration for GN, marking a transition toward real-time ERT imaging. In field validation at the U.S. DOE Hanford Site, ViTF successfully reconstructed key resistivity contrasts associated with stratified vadose-zone heterogeneity and contamination pathways, outperforming GN, which underestimated boundary sharpness and mispositioned anomaly interfaces. While current limitations include reliance on synthetic training data, future integration with PINNs, uncertainty-aware diffusion models, and multimodal joint inversion (e.g., ERT–seismic–GPR) will further enhance generalizability across lithological regimes. This TF-based framework exemplifies the AI-EGGE paradigm shift, where DL models evolve from post-processing aids to primary inversion engines capable of augmenting or replacing conventional physics-based solvers. The ViTF establishes a new benchmark for rapid, high-resolution, and operationally deployable ERT inversion, enabling real-time decision support for environmental, geotechnical, and subsurface hazard monitoring applications.

5.2.2. Case 2: Multimodal CNN–TF Fusion for Enhanced Urban Scene and Functional Mapping

Urban scene understanding is critical for land-use mapping, infrastructure planning, and exposure assessment in complex environments; however, remote sensing imagery (RSI) alone often struggles to discriminate morphologically similar urban classes. Su et al. [4] addressed this limitation through a multimodal CNN–Transformer fusion framework that integrates RSI with Points of Interest (POIs) and building footprint data, enabling semantically enriched urban functional mapping. The model employs a dual-branch architecture with attention-weighted fusion and multiscale feature extraction, while a transformer-based interaction module aligns cross-modal representations to enhance feature consistency and contextual awareness.

Experimental results on the Chengdu and Wuhan datasets demonstrate that multimodal integration improves overall accuracy by approximately 6–7% over RSI-only baselines, with consistent gains in Cohen’s Kappa, κ, and F1-score. Ablation analysis confirms that attention weighting, modality interaction, and multiscale feature extraction each contribute measurably to performance, while cross-city validation further indicates improved generalization relative to single-modality approaches. Comparative evaluations show that the framework achieves competitive or superior accuracy relative to recent multimodal architectures, with lower model complexity and improved interpretability. Interpretability analysis using Class Activation Maps reveals that multimodal fusion produces more spatially coherent and semantically meaningful attention patterns, particularly for complex functional classes such as commercial, public service, and high-density residential zones, where RSI-only models exhibit ambiguous or diffuse responses. The lightweight design (~6.23M parameters) supports scalable deployment for large-area urban monitoring; however, reliance on static datasets limits temporal adaptability, highlighting the need for integrating dynamic data sources such as mobility patterns and multi-temporal imagery. This case study demonstrates how multimodal fusion within the AI-EGGE paradigm integrates heterogeneous spatial data to improve classification reliability, interpretability, and generalization, strengthening urban analytics for infrastructure exposure assessment and climate–geohazard-informed planning.

6. Challenges and Limitations

Despite major advances, AI–EGGE faces persistent data and fusion-related barriers that limit reliable deployment. Data scarcity, uneven quality, and limited labeled samples—particularly in environmental geophysics, near-surface imaging, and hazard-related applications where ground truth is costly or uncertain—continue to constrain model performance [41,199]. Multimodal fusion must reconcile heterogeneity in spatial support, sampling density, sensor footprint, and noise, especially in shallow, heterogeneous environments, complicating co-registration, scaling, and correlation handling and potentially biasing fused estimates [1,4]. Class imbalance and spatial autocorrelation can inflate apparent accuracy and obscure generalization, while inconsistent metadata, weak documentation, and the lack of standardized multimodal datasets hinder reproducibility, benchmarking, and cross-site transferability [30].

Model robustness, physical consistency, and interpretability remain insufficient for engineering-grade decision-making. Many models trained on site-specific datasets degrade under domain shift, non-stationarity, or extreme triggers, with limited cross-site validation and no widely accepted benchmark datasets for fair comparison [4,100]. Purely data-driven models may violate physical constraints and fail outside the training envelope; although physics-informed, hybrid, and joint inversion approaches improve realism, they remain computationally intensive, often ill-posed, and susceptible to non-unique solutions without informative coupling or priors [115,176]. Modal fusion and deep fusion models raise additional interpretability and black-box concerns for engineering decisions, reinforcing the need for physics-informed and XAI-enhanced mechanisms to produce physically meaningful reasoning for spatiotemporal and multimodal systems—especially for environmental geophysics applications where shallow targets demand high sensitivity to noise, heterogeneity, and uncertainty [186,192]. UQ is rarely calibrated, limiting confidence and safety-margin assessment for risk-sensitive EGGE applications [165,182].

Operational, regulatory, and institutional barriers continue to slow the translation of AI–EGGE from research to practice. Developing, validating, and maintaining deep or physics-informed models requires specialized expertise, computing resources, and data-annotation capacity that many engineering and environmental agencies lack, while the absence of lightweight, real-time, edge-deployable tools limits early-warning, digital-twin, and on-site applications [17,134,194]. Data sharing restrictions, unclear responsibility for AI-assisted decisions, and limited long-term field pilots further reduce confidence and uptake [186]. In addition, widely relied-upon environmental and engineering standards—such as the European Standards for Structural and Geotechnical Design (Eurocode), the American Association of State Highway and Transportation Officials specifications, the Japanese Geotechnical Society Standards, the Australian Standards for geotechnical and structural design (including AS 1726 and AS 5100), and Environmental Impact Assessment (EIA) regulatory frameworks, currently provide no formal provisions for AI-assisted analyses, constraining regulatory acceptance in design, compliance, and risk management. Similar gaps exist across regional systems, including EIA regimes in Asia, Environmental Management Acts in Africa, and the Environmental Protection and Biodiversity Conservation Act in Australia, which also lack defined pathways for AI-enabled evaluation and decision support. Ethical, cybersecurity, and misuse risks—especially for automated or safety-critical environmental and infrastructure systems—remain unresolved, collectively limiting safe and regulated deployment of AI–EGGE.

7. Future Directions

Advancing AI–EGGE requires moving beyond data-driven prediction toward physics-grounded, uncertainty-aware, real-time, and decision-support AI suitable for field and engineering deployment. Priority lies in maturing physics-integrated learning—through PIML/PINNs, operator-learning networks, and theory-guided hybrids—to reduce non-uniqueness, improve generalization, and stabilize multimodal inversion by embedding governing equations, constitutive relations, and conservation laws directly into learning, even under sparse or noisy data [23,39,171]. Parallel advances in multimodal fusion are expected, with cross-modal attention, foundation-scale multimodal TFs, and GNNs enabling robust integration of geophysical, geotechnical, environmental, and remote-sensing data with differing spatial supports and noise characteristics [4,136,181]. These architectures encode spatial topology, physical neighborhoods, and causal interactions while preserving subsurface structural integrity [108]. When coupled with calibrated uncertainty quantification, interpretability, and physics-based regularization, these architectures can deliver trustworthy models for engineering design, hazard assessment, environmental monitoring, and subsurface digital twins [51,238].

A further frontier is real-time, autonomous, and adaptive geosensing. Edge–cloud computing, IoT-enabled instrumentation, UAV/robotic survey automation, and streaming geophysics will allow continuous assimilation of data for early warning, infrastructure monitoring, and dynamic model updating [53,149,239]. Active learning and adaptive survey designs—guided by model uncertainty—will optimize field campaigns by identifying where to drill, scan, or image to maximize information gain while minimizing cost [1,148]. Progress will also depend on building community-driven multimodal benchmarks, open datasets, and validation protocols tailored to EGGE tasks, along with regulatory and standards development to support safe engineering adoption. Ultimately, the integration of interpretable AI, physics-informed modeling, and real-time digital-twin ecosystems will position AI–EGGE as a core enabler of climate resilience, sustainable urban development, and risk-aware geotechnical and environmental engineering.

Strategic Roadmap for Advancing AI–EGGE:

6.: Physics-aligned and trustworthy intelligence: Unify physics-integrated AI (PIML, PINNs, neural operators) with calibrated UQ and interpretable/XAI frameworks to ensure physically consistent, reliable, and audit-ready predictions for engineering-grade deployment.
7.: Multimodal fusion and adaptive data ecosystems: Develop next-generation fusion using TFs, GNNs, and cross-modal learning to harmonize geophysical, geotechnical, environmental, and remote-sensing data, supported by adaptive, uncertainty-guided field acquisition that maximizes information efficiency.
8.: Autonomous, real-time, and digital-twin EGGE systems: Establish edge–cloud AI platforms, IoT sensing networks, UAV/robotic acquisition, and continuous monitoring to enable real-time hazard detection, early warning, and autonomous digital-twin subsurface systems for resilient infrastructure and environmental management.
9.: Standardization, benchmarking, and engineering integration: Create open multimodal benchmarks and validation protocols and co-develop practice-ready AI–EGGE workflows with industry and regulators—embedding safety, due diligence, and codes-of-practice alignment to accelerate formal adoption into engineering standards.

8. Conclusions

AI has catalyzed a decisive shift in EGGE from empirical and deterministic approaches toward data-rich, physics-consistent, and decision-oriented subsurface intelligence. This review synthesizes advances in ML/DL, physics-informed and theory-guided modeling, multimodal data fusion, uncertainty-aware and XAI, and intelligent sensing. Together, these developments enable more accurate and scalable solutions for site characterization, lithological and geotechnical profiling, joint hydrogeomechanical estimation, groundwater and hydro-environmental assessment, geohazard detection, and infrastructure condition monitoring. This trajectory reflects the maturation of AI–EGGE from exploratory algorithms to validated, field-deployable systems supporting early warning, risk-informed design, sustainable infrastructure, and climate-resilient environmental management. The convergence of geophysical, geotechnical, environmental, and data-centric domains now positions AI–EGGE as a core pillar of next-generation subsurface characterization and hazard mitigation.

Responsible, regulated, and engineering-grade deployment, however, remains a critical frontier. Enduring limitations, including data heterogeneity, cross-scale incompatibility, sparse ground truth, lack of standardized multimodal benchmarks, weak generalization across geological and climatic regimes, inadequate interpretability, and limited UQ, continue to constrain confidence, regulatory acceptance, and industry-wide adoption. The absence of formal validation pathways for AI-assisted analyses within environmental and engineering codes and standards underscores the need for auditable modeling frameworks and safety-critical oversight. Future progress must accelerate the integration of physics-based and uncertainty-aware architectures, cross-modal attention and graph-based fusion, digital-twin ecosystems with edge–cloud intelligence, and adaptive sensing strategies that reduce data redundancy while maximizing information value. Realizing the full potential of AI–EGGE requires coordinated efforts across research, practice, regulatory bodies, and international standard-setting organizations to embed rigor, transparency, ethics, and public trust, enabling its transition from a promising innovation to a blueprint for resilient, autonomous, and sustainability-aligned subsurface intelligence.

Acknowledgments

The funding support provided by all contributing agencies that enabled the successful execution of this review is gratefully acknowledged.

Data Availability

All data analyzed during this study are included in this published article.

Ethical Approval

All ethical standards have been duly followed during the research.

Consent to Participate

Not Applicable.

Consent to Publish

Not Applicable.

Financial interests

The authors declare no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

CRediT Author Statement

ASA: Conceptualization, Investigation, Data curation, Resource, Validation, Writing – Original draft, Review & Editing, Funding, & Supervision. AAB, MDD, BMA, TOA, & AOO: Validation, Resource, Writing – Original draft, Review & Editing.

Notations and Abbreviations

AE	Autoencoder	LIME	Local Interpretable Model-Agnostic Explanations
AI	Artificial Intelligence	LSM	Landslide Susceptibility Mapping
ALE	Accumulated Local Effects	LSTM	Long Short-Term Memory
ANFIS	Adaptive Neuro-Fuzzy Inference System	MASW	Multichannel Analysis of Surface Waves
ANN	Artificial Neural Network	ML	Machine Learning
BNN	Bayesian Neural Network	MLP	Multilayer Perceptron
BPNN	Backpropagation Neural Network	MLR	Multiple Linear Regression
CatBoost	Categorical Boosting	MRF	Multi-Receptive-Field
CNN	Convolutional Neural Network	PIML	Physics-Informed Machine Learning
COA	Cuckoo Optimization Algorithm	PINN	Physics-Informed Neural Network
CPT/CPT-qc	Cone Penetration Test	PISER	Physics-Informed Simple-to-Ensemble Regressor
DAS	Distributed Acoustic Sensing	PNN	Probabilistic Neural Network
DBN	Deep Belief Network	PSO	Particle Swarm Optimization
DFOS	Distributed Fiber-Optic Sensing	RF	Random Forest
DL	Deep Learning	RMQ	Rock Mass Quality
DNN	Deep Neural Network	RNN	Recurrent Neural Network
DRNN	Deep Recurrent Neural Network	RQD	Rock Quality Designation
DT	Decision Trees	SHAP	Shapley Additive Explanations
EBM	Explainable Boosting Machine	SLR	Simple And Multiple Linear Regression
EGGE	Environmental Geophysics and Geotechnical Engineering	SNN	Shallow Neural Network
EM	Electromagnetic Methods	SOM	Self-Organizing Map
ERT	Electrical Resistivity Tomography	SP	Self-Potential
FCN	Fully Convolutional Network	SPT/SPT-N	Standard Penetration Test
FIS	Fuzzy Inference System	SRT	Seismic Refraction Tomography
GA	Generative Algorithm	SVM	Support Vector Machine
GAN	Generative Adversarial Network	TBM	Tunnel Boring Machine
GBM	Gradient Boosting Machine	TDR	Time Domain Reflectometry
GBoost	Gradient Boosting	TEM	Transient Electromagnetic
GEP	Genetic Expression Programming	TF	Transformer
GNN	Graph Neural Network	UAV	Unmanned Aerial Vehicle
GPR	Ground-Penetrating Radar	UQ	Uncertainty Quantification
Grad-CAM	Gradient-Weighted Class Activation Mapping	VAE	Variational Autoencoder
GRU	Gated Recurrent Unit	VGG	Visual Geometry Group Network
GSON	Growing Self-Organizing Network	ViTF	Vision Transformer
GWO	Gray Wolf Optimizer	Vp	P-Wave Velocity
IoT	Internet of Things	Vs	Shear-Wave Velocity
IoU	Intersection over Union	XAI	Explainable AI
IP	Induced Polarization	XGBoost	Xtreme Gradient Boosting
KNN	K-Nearest Neighbors	κ	Cohen’s Kappa

References

Yu, S.; Ma, J. Deep Learning for Geophysics: Current and Future Trends. Rev. Geophys. 2021, 59, 1–36. [Google Scholar] [CrossRef]
Kiran Pandiri, D.N.; Murugan, R.; Goel, T. Smart soil image classification system using lightweight convolutional neural network. Expert Syst. With Appl. 2024, 238, 122185. [Google Scholar] [CrossRef]
Sheil, B.; Anagnostopoulos, C.; Buckley, R.; et al. Artificial intelligence transformations in geotechnics: progress, challenges and future enablers. Comput. Geotech. 2026, 189, 107604. [Google Scholar] [CrossRef]
Su, C.; Hu, X.; Meng, Q.; et al. A multimodal fusion framework for urban scene understanding and functional identification using geospatial data. Int. J. Appl. Earth Obs. Geoinf. 2024, 127, 103696. [Google Scholar] [CrossRef]
Meju, M.A.; Gallardo, L.A. Structural Coupling Approaches in Integrated Geophysical Imaging. In Integrated Imaging of the Earth: Theory and Applications; Wiley, 2016; pp. 49–67. [Google Scholar]
Balarabe, B.; Bery, A.A.; Teoh, Y.J.; Khalil, A.E. New Empirical Approach for the Estimation of Soil Cohesion and Friction Angle in 2D Form for Site Investigations. Sains Malays. 2022, 51, 405–419. [Google Scholar] [CrossRef]
Kuras, O.; Wilkinson, P.B.; Meldrum, P.I.; et al. Geoelectrical monitoring of simulated subsurface leakage to support high-hazard nuclear decommissioning at the Sellafield Site, UK. Sci. Total Environ. 2016, 566–567, 350–359. [Google Scholar] [CrossRef]
Saadati, G.; Javankhoshdel, S.; Mohebbi Najm Abad, J.; et al. AI-Powered Geotechnics: Enhancing Rock Mass Classification for Safer Engineering Practices. In Rock Mechanics and Rock Engineering; 2024. [Google Scholar] [CrossRef]
Bilgilioğlu, S.S.; Gezgin, C.; Iban, M.C.; et al. Explainable Sinkhole Susceptibility Mapping Using Machine-Learning-Based SHAP: Quantifying and Comparing the Effects of Contributing Factors in Konya, Türkiye. Appl. Sci. 2025, 15, 3139. [Google Scholar] [CrossRef]
Arif, A.; Zhang, C.; Sajib, M.H.; et al. Rock Slope Stability Prediction: A Review of Machine Learning Techniques. Geotech. Geol. Eng. 2025, 43, 124. [Google Scholar] [CrossRef]
Liu, C.; Macedo, J.; Rodríguez, A. Leveraging physics-informed neural networks in geotechnical earthquake engineering: An assessment on seismic site response analyses. Comput. Geotech. 2025, 182, 107137. [Google Scholar] [CrossRef]
Shafapourtehrany, M.; Batur, M.; Özener, H.; et al. Conventional and advanced geospatial techniques for landslide detection and modeling: a comprehensive overview. Geoenvironmental Disasters 2026, 13, 3. [Google Scholar] [CrossRef]
Akingboye, A.S.; Bery, A.A.; Tang, H.; et al. Advancing resistivity–chargeability modeling for complex subsurface characterization using machine learning and deep learning. arXiv (Preprint) arXiv 2025, 2509:1–22. [Google Scholar] [CrossRef]
Dick, M.; Bery, A.A.; Akingboye, A.S.; et al. Integrated Machine Learning Modeling of Seismic, Electrical Resistivity, Induced Polarization, and SPT-N Data for Subsurface Integrity Assessment in Granitic Terrain. In Earth Systems and Environment; 2025. [Google Scholar] [CrossRef]
Ali, M.; Zhu, P.; Huolin, M.; et al. Data-driven machine learning approaches for precise lithofacies identification in complex geological environments. In Geo-spatial Information Science; 2024; pp. 1–21. [Google Scholar] [CrossRef]
Aydın, Y.; Işıkdağ, Ü.; Bekdaş, G.; et al. Use of Machine Learning Techniques in Soil Classification. Sustainability 2023, 15, 2374. [Google Scholar] [CrossRef]
Lu, P.; Morris, M.; Brazell, S.; et al. Using generative adversarial networks to improve deep-learning fault interpretation networks. Lead. Edge 2018, 37, 578–583. [Google Scholar] [CrossRef]
Zhao, S.; Chen, Z.; Xiong, Z.; et al. Beyond Grid Data: Exploring graph neural networks for Earth observation. IEEE Geosci. Remote Sens. Mag. 2025, 13, 175–208. [Google Scholar] [CrossRef]
Alrabayah, O.; Caus, D.; Watson, R.A.; et al. Deep-Learning-Based Automatic Sinkhole Recognition: Application to the Eastern Dead Sea. Remote Sens. 2024, 16. [Google Scholar] [CrossRef]
Zhang, Z.; Hu, Q.; Fang, H.; et al. TriGEFNet: A Tri-Stream Multimodal Enhanced Fusion Network for Landslide Segmentation from Remote Sensing Imagery. Remote Sens. 2026, 18, 186. [Google Scholar] [CrossRef]
Zhou, Z.; Gerstoft, P.; Olsen, K. Graph-learning approach to combine multiresolution seismic velocity models. Geophys. J. Int. 2024, 238, 1353–1365. [Google Scholar] [CrossRef]
Ding, Y.; Chen, S.; Li, X.; et al. Physics-constrained neural networks for half-space seismic wave modeling. Comput. Geosci. 2023, 181, 105477. [Google Scholar] [CrossRef]
Chen, X.-X.; Zhang, P.; Yin, Z.-Y. Physics-Informed neural network solver for numerical analysis in geoengineering. Georisk Assess. Manag. Risk Eng. Syst. Geohazards 2024, 18, 33–51. [Google Scholar] [CrossRef]
Onyelowe, K.C.; Mojtahedi, F.F.; Ebid, A.M.; et al. Selected AI optimization techniques and applications in geotechnical engineering. Cogent Eng. 2023, 10. [Google Scholar] [CrossRef]
Khatti, J.; Grover, D.K.S. Prediction of Geotechnical Properties of Soil using Artificial Intelligence Framework. Int. J. Recent Technol. Eng. (IJRTE) 2021, 10, 218–227. [Google Scholar] [CrossRef]
Linck, R.; Kale, M.; Stele, A.; Schlechtriem, J. Testing the Applicability of Drone-Based Ground-Penetrating Radar for Archaeological Prospection. Remote Sens. 2025, 17, 1498. [Google Scholar] [CrossRef]
Bala, G.A.; Bery, A.A.; Dick, M.D.; et al. Modeling subsurface geotechnical integrity via interpolated resistivity–chargeability and SPT datasets with machine learning: A case study from Perak, Malaysia. Phys. Chem. Earth Parts A/B/C 2025, 141, 104093. [Google Scholar] [CrossRef]
Hasan, M.; Su, L.; Cui, P.; Shang, Y. Development of deep-underground engineering structures via 2D and 3D RQD prediction using non-invasive CSAMT. Sci. Rep. 2025, 15, 1403. [Google Scholar] [CrossRef]
Hoek, E.; Diederichs, M.S. Empirical estimation of rock mass modulus. Int. J. Rock. Mech. Min. Sci. 2006, 43, 203–215. [Google Scholar] [CrossRef]
Campos Montero, F.A.; Zuada Coelho, B.; Smyrniou, E.; et al. SchemaGAN: A conditional Generative Adversarial Network for geotechnical subsurface schematisation. Comput. Geotech. 2025, 183, 107177. [Google Scholar] [CrossRef]
Wu, L.; Li, J.; Zhang, J.; et al. Prediction model for the compressive strength of rock based on stacking ensemble learning and shapley additive explanations. Bull. Eng. Geol. Environ. 2024, 83, 439. [Google Scholar] [CrossRef]
Shahani, N.M.; Zheng, X.; Guo, X.; Wei, X. Machine Learning-Based Intelligent Prediction of Elastic Modulus of Rocks at Thar Coalfield. Sustainability 2022, 14, 3689. [Google Scholar] [CrossRef]
Akingboye, A.S.; Bery, A.A. Rock mass quality evaluation via statistically optimized geophysical datasets. Bull. Eng. Geol. Environ. 2023, 82, 376. [Google Scholar] [CrossRef]
Akinlalu, A.A.; Futai, M.M.; Afolabi, D.O.; Abraham-A, R.M. A review on the application of geophysical methods in civil engineering studies. Geosystems Geoenvironment 2026, 5, 100453. [Google Scholar] [CrossRef]
Giraud, J.; Lindsay, M.; Jessell, M.; Ogarko, V. Towards plausible lithological classification from geophysical inversion: Honouring geological principles in subsurface imaging. Solid Earth 2020, 11, 419–436. [Google Scholar] [CrossRef]
Akingboye, A.S. Electrical and seismic refraction methods: Fundamental concepts, current trends, and emerging machine learning prospects. Discov. Geosci. 2025, 3, 87. [Google Scholar] [CrossRef]
Asadi, A.; Baise, L.G.; Sanon, C.; et al. Semi-Supervised Learning Method for the Augmentation of an Incomplete Image-Based Inventory of Earthquake-Induced Soil Liquefaction Surface Effects. Remote Sens. 2023, 15. [Google Scholar] [CrossRef]
Rasht-Behesht, M.; Huber, C.; Shukla, K.; Karniadakis, G.E. Physics-Informed Neural Networks (PINNs) for Wave Propagation and Full Waveform Inversions. J. Geophys. Res. Solid Earth 2022, 127, 1–21. [Google Scholar] [CrossRef]
Zhou, H.; Wu, H.; Sheil, B.; Wang, Z. A self-adaptive physics-informed neural networks method for large strain consolidation analysis. Comput. Geotech. 2025, 181, 107131. [Google Scholar] [CrossRef]
Zhang, R.; Zhu, W.; Li, Z.; et al. Re-Net: Multibranch Network With Structural Reparameterization for Landslide Detection in Optical Imagery. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2024, 17, 2828–2837. [Google Scholar] [CrossRef]
Yin, H.; Carroll, K.C.; Yuan, Y.; et al. Hybrid Vision Transformer With Convolutional Blocks Approach for Subsurface Electrical Resistivity Tomography Inversion. J. Geophys. Res. Mach. Learn. Comput. 2025, 2, 1–24. [Google Scholar] [CrossRef]
Baghbani, A.; Choudhury, T.; Costa, S.; Reiner, J. Application of artificial intelligence in geotechnical engineering: A state-of-the-art review. Earth-Sci. Rev. 2022, 228, 103991. [Google Scholar] [CrossRef]
Al-Aghbary, M.; Sobh, M.; Gerhards, C. A geothermal heat flow model of Africa based on random forest regression. Front. Earth Sci. 2022, 10. [Google Scholar] [CrossRef]
Ge, C.; Qin, S. Urban flooding digital twin system framework. Syst. Sci. Control Eng. 2025, 13. [Google Scholar] [CrossRef]
Li, Y.; Liu, X.; Zhou, J.; et al. Artificial intelligence in traditional Chinese medicine: advances in multi-metabolite multi-target interaction modeling. Front. Pharmacol. 2025, 16, 1–18. [Google Scholar] [CrossRef]
Saneiyan, S.; Mansourian, D. Locating undocumented orphaned oil and gas wells with smartphones. J. Appl. Geophys. 2023, 219. [Google Scholar] [CrossRef]
Akingboye, A.S.; Bery, A.A.; Kayode, J.S.; et al. Near-Surface Crustal Architecture and Geohydrodynamics of the Crystalline Basement Terrain of Araromi, Akungba-Akoko, SW Nigeria, Derived from Multi-Geophysical Methods. Nat. Resour. Res. 2022, 31, 215–236. [Google Scholar] [CrossRef]
Kemna, A.; Binley, A.; Slater, L. Crosshole IP imaging for engineering and environmental applications. GEOPHYSICS 2004, 69, 97–107. [Google Scholar] [CrossRef]
Chalikakis, K.; Plagnes, V.; Guerin, R.; et al. Contribution of geophysical methods to karst-system exploration: an overview. Hydrogeol. J. 2011, 19, 1169–1180. [Google Scholar] [CrossRef]
Müller, D.; Kwan, K.; Groves, D.I. Geophysical implications for the exploration of concealed orogenic gold deposits: A case study in the Sandy Lake and Favourable Lake Archean greenstone belts, Superior Province, Ontario, Canada. Ore Geol. Rev. 2021, 128, 103892. [Google Scholar] [CrossRef]
Davis, G.B.; Rayner, J.L.; Donn, M.J. Advancing “Autonomous” sensing and prediction of the subsurface environment: a review and exploration of the challenges for soil and groundwater contamination. Environ. Sci. Pollut. Res. 2023, 30, 19520–19535. [Google Scholar] [CrossRef]
Sreelakshmi, S.; Vinod Chandra, S. ~S. A hybrid fusion network using convolutional vision transformers for landslide identification. Expert Syst. With Appl. 2026, 298, 129688. [Google Scholar] [CrossRef]
Sharma, S.; Ahmed, S.; Naseem, M.; et al. A Survey on Applications of Artificial Intelligence for Pre-Parametric Project Cost and Soil Shear-Strength Estimation in Construction and Geotechnical Engineering. Sensors 2021, 21, 463. [Google Scholar] [CrossRef]
Rane, N.; Choudhary, S.; Rane, J. Leading-edge Artificial Intelligence (AI) and Internet of Things (IoT) technologies for enhanced geotechnical site characterization. SSRN Electron. J. 2023. [Google Scholar] [CrossRef]
Jackisch, R.; Heincke, B.H.; Zimmermann, R.; et al. Drone-based magnetic and multispectral surveys to develop a 3D model for mineral exploration at Qullissat, Disko Island, Greenland. Solid Earth 2022, 13, 793–825. [Google Scholar] [CrossRef]
Razavi-Termeh, S.V.; Pourzangbar, A.; Sadeghi-Niaraki, A.; et al. Metaheuristic-driven enhancement of categorical boosting algorithm for flood-prone areas mapping. Int. J. Appl. Earth Obs. Geoinf. 2025, 136, 104357. [Google Scholar] [CrossRef]
Gholizadeh, A.; Saberioon, M.; Ben-Dor, E.; Borůvka, L. Monitoring of selected soil contaminants using proximal and remote sensing techniques: Background, state-of-the-art and future perspectives. Crit. Rev. Environ. Sci. Technol. 2018, 48, 243–278. [Google Scholar] [CrossRef]
Omolaiye, G.E.; Oladapo, I.M.; Ayolabi, A.E.; et al. Integration of remote sensing, GIS and 2D resistivity methods in groundwater development. Appl. Water Sci. 2020, 10, 1–24. [Google Scholar] [CrossRef]
Depina, I.; Jain, S.; Mar Valsson, S.; Gotovac, H. Application of physics-informed neural networks to inverse problems in unsaturated groundwater flow. Georisk Assess. Manag. Risk Eng. Syst. Geohazards 2022, 16, 21–36. [Google Scholar] [CrossRef]
Phoon, K.-K. The story of statistics in geotechnical engineering. Georisk Assess. Manag. Risk Eng. Syst. Geohazards 2020, 14, 3–25. [Google Scholar] [CrossRef]
Pradhan, B.; Lee, S. Delineation of landslide hazard areas on Penang Island, Malaysia, by using frequency ratio, logistic regression, and artificial neural network models. Environ. Earth Sci. 2010, 60, 1037–1054. [Google Scholar] [CrossRef]
Phoon, K.K.; Zhang, L.M.; Cao, Z.J. Special issue on “Machine learning and AI in geotechnics.”. Georisk Assess. Manag. Risk Eng. Syst. Geohazards 2023, 17, 1–6. [Google Scholar] [CrossRef]
Pradhan, B.; Lee, S. Regional landslide susceptibility analysis using back-propagation neural network model at Cameron Highland, Malaysia. Landslides 2010, 7, 13–30. [Google Scholar] [CrossRef]
Elseicy, A.; Solla, M.; Balado, J.; et al. Enhancing Reinforced Concrete Bridge Health Monitoring: A Case Study on the Integration of InSAR, GPR, and LiDAR within 3D GIS Environment. ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. 2024, X-4/W5-202, 155–161. [Google Scholar] [CrossRef]
Yin, Z.Y.; Jin, Y.F.; Shen, J.S.; Hicher, P.Y. Optimization techniques for identifying soil parameters in geotechnical engineering: Comparative study and enhancement. Int. J. Numer. Anal. Methods Geomech. 2018, 42, 70–94. [Google Scholar] [CrossRef]
Akingboye, A.S.; Bery, A.A.; Tang, H.; et al. Deciphering near-surface architecture and landslide triggers in granitic environments: A regression-driven multiphysics modeling framework for geoengineering integrity. Phys. Chem. Earth Parts A/B/C 2025, 140, 104040. [Google Scholar] [CrossRef]
Patel, A. Geotechnical Investigations and Improvement of Ground Conditions; Elsevier, 2019. [Google Scholar]
Habel, W.R.; Krebber, K. Fiber-optic sensor applications in civil and geotechnical engineering. Photonic Sens. 2011, 1, 268–280. [Google Scholar] [CrossRef]
Hong, C.; Luo, G.; Chen, W. Safety analysis of a deep foundation ditch using deep learning methods. Gondwana Res. 2023, 123, 16–26. [Google Scholar] [CrossRef]
Lv, P.; Ma, L.; Li, Q.; Du, F. ShapeFormer: A Shape-Enhanced Vision Transformer Model for Optical Remote Sensing Image Landslide Detection. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2023, 16, 2681–2689. [Google Scholar] [CrossRef]
Chowdhury, R.; Bhattacharya, G.; Metya, S. Geotechnical Slope Analysis; CRC Press: London, 2023. [Google Scholar]
Shahin, M.A. State-of-the-art review of some artificial intelligence applications in pile foundations. Geosci. Front. 2016, 7, 33–44. [Google Scholar] [CrossRef]
Shao, W.; Yue, W.; Zhang, Y.; et al. The Application of Machine Learning Techniques in Geotechnical Engineering: A Review and Comparison. Mathematics 2023, 11, 1–16. [Google Scholar] [CrossRef]
Akingboye, A.S.; Bery, A.A.; Aminu, M.B.; et al. Surface–subsurface characterization via interfaced geophysical–geotechnical and optimized regression modeling. Model. Earth Syst. Environ. 2024, 10, 5121–5143. [Google Scholar] [CrossRef]
Akingboye, A.S.; Bery, A.A.; Tang, H.; et al. Machine learning-driven velocity–resistivity modeling: A novel explicit framework for near-surface characterization. J. Appl. Geophys. 2025, 243, 105955. [Google Scholar] [CrossRef]
Zhang, W.; Li, H.; Tang, L.; et al. Displacement prediction of Jiuxianping landslide using gated recurrent unit (GRU) networks. Acta Geotech. 2022, 17, 1367–1382. [Google Scholar] [CrossRef]
Jong, S.C.; Ong, D.E.L.; Oh, E. State-of-the-art review of geotechnical-driven artificial intelligence techniques in underground soil-structure interaction. Tunn. Undergr. Space Technol. 2021, 113, 103946. [Google Scholar] [CrossRef]
Harle, S.M.; Wankhade, R.L. Machine learning techniques for predictive modelling in geotechnical engineering: a succinct review. Discov. Civ. Eng. 2025, 2, 86. [Google Scholar] [CrossRef]
Dikshit, A.; Pradhan, B.; Alamri, A.M. Pathways and challenges of the application of artificial intelligence to geohazards modelling. Gondwana Res. 2021, 100, 290–301. [Google Scholar] [CrossRef]
Wang, N.; Zhang, H.; Dahal, A.; et al. On the use of explainable AI for susceptibility modeling: Examining the spatial pattern of SHAP values. Geosci. Front. 2024, 15, 101800. [Google Scholar] [CrossRef]
Gevaert, C.M. Explainable AI for earth observation: A review including societal and regulatory perspectives. Int. J. Appl. Earth Obs. Geoinf. 2022, 112, 102869. [Google Scholar] [CrossRef]
Guan, Z.; Wang, Y. Fusion of three-dimensional geotechnical and geophysical data for developing digital twin of underground space. Soils Found. 2024, 64, 101528. [Google Scholar] [CrossRef]
Searle, R.; McBratney, A.; Grundy, M.; et al. Digital soil mapping and assessment for Australia and beyond: A propitious future. Geoderma Reg. 2021, 24. [Google Scholar] [CrossRef]
Gallardo, L.A.; Meju, M.A. Characterization of heterogeneous near-surface materials by joint 2D inversion of DC resistivity and seismic data. Geophys. Res. Lett. 2003, 30. [Google Scholar] [CrossRef]
Huang, S.; Zhou, J. Cutting—Edge Soft Computing Technologies for Rock Mass Excavatability: Transforming Prediction with Hybrid GA—MLP and GEP—Based Criteria. In Rock Mechanics and Rock Engineering; 2025. [Google Scholar] [CrossRef]
Doyoro, Y.G.; Gelena, S.K.; Lin, C. Improving subsurface structural interpretation in complex geological settings through geophysical imaging and machine learning. Eng. Geol. 2025, 344, 107839. [Google Scholar] [CrossRef]
Radwan, A.E.; Wood, D.A.; Radwan, A.A. Machine learning and data-driven prediction of pore pressure from geophysical logs: A case study for the Mangahewa gas field, New Zealand. J. Rock. Mech. Geotech. Eng. 2022, 14, 1799–1809. [Google Scholar] [CrossRef]
Shahin, M.A. Artificial Intelligence in Geotechnical Engineering: Applications, Modeling Aspects, and Future Directions. In Metaheuristics in Water, Geotechnical and Transport Engineering, First Edit. ed; Elsevier, 2013; pp. 169–204. [Google Scholar]
Jalal, F.E.; Iqbal, M.; Khan, W.A.; et al. ANN-based swarm intelligence for predicting expansive soil swell pressure and compression strength. Sci. Rep. 2024, 14, 14597. [Google Scholar] [CrossRef]
Swetha, R.K.; Bende, P.; Singh, K.; et al. Predicting soil texture from smartphone-captured digital images and an application. Geoderma 2020, 376, 114562. [Google Scholar] [CrossRef]
Penta de Peppo, G.; Cercato, M.; De Donno, G. Cross-gradient joint inversion and clustering of ERT and SRT data on structured meshes incorporating topography. Geophys. J. Int. 2024, 239, 1155–1169. [Google Scholar] [CrossRef]
Steuer, A.; Smirnova, M.; Becken, M.; et al. Comparison of novel semi-airborne electromagnetic data with multi-scale geophysical, petrophysical and geological data from Schleiz, Germany. J. Appl. Geophys. 2020, 182. [Google Scholar] [CrossRef]
Brito da Silva, L.E.; Elnabarawy, I.; Wunsch, D.C. A survey of adaptive resonance theory neural network models for engineering applications. Neural Netw. 2019, 120, 167–203. [Google Scholar] [CrossRef]
Duan, L.; Xiong, D.; Lee, J.; Guo, F. A Local Density Based Spatial Clustering Algorithm with Noise. In 2006 IEEE International Conference on Systems, Man and Cybernetics; IEEE, 2006; pp. 4061–4066. [Google Scholar]
Shebl, A.; Abdellatif, M.; Hissen, M.; et al. Lithological mapping enhancement by integrating Sentinel 2 and gamma-ray data utilizing support vector machine: A case study from Egypt. Int. J. Appl. Earth Obs. Geoinf. 2021, 105, 102619. [Google Scholar] [CrossRef]
Delforge, D.; Watlet, A.; Kaufmann, O.; et al. Time-series clustering approaches for subsurface zonation and hydrofacies detection using a real time-lapse electrical resistivity dataset. J. Appl. Geophys. 2021, 184. [Google Scholar] [CrossRef]
Charles Komadja, G.; Westman, E.; Rana, A.; Vitalis, A. Predicting rock mass strength from drilling data using synergistic unsupervised and supervised machine learning approaches. 2025, 18, 325. [Google Scholar] [CrossRef]
Warrens, M.J.; van der Hoef, H. Understanding the Adjusted Rand Index and Other Partition Comparison Indices Based on Counting Object Pairs. J. Classif. 2022, 39, 487–509. [Google Scholar] [CrossRef]
Zhao, Z.; Feng, W.; Xiao, J.; et al. Rapid and Accurate Prediction of Soil Texture Using an Image-Based Deep Learning Autoencoder Convolutional Neural Network Random Forest (DLAC-CNN-RF) Algorithm. Agronomy 2022, 12, 3063. [Google Scholar] [CrossRef]
Yuan, B.; Choo, C.S.; Yeo, L.Y.; et al. Physics-informed machine learning in geotechnical engineering: a direction paper. Geomech. Geoengin. 2025, 20, 1128–1159. [Google Scholar] [CrossRef]
Schuster, G.T.; Chen, Y.; Feng, S. Review of physics-informed machine-learning inversion of geophysical data. GEOPHYSICS 2024, 89, T337–T356. [Google Scholar] [CrossRef]
Balogun, A.-L.L.; Rezaie, F.; Pham, Q.B.; et al. Spatial prediction of landslide susceptibility in western Serbia using hybrid support vector regression (SVR) with GWO, BAT and COA algorithms. Geosci. Front. 2021, 12, 101104. [Google Scholar] [CrossRef]
Neupane, B.; Horanont, T.; Aryal, J. Deep Learning-Based Semantic Segmentation of Urban Features in Satellite Images: A Review and Meta-Analysis. Remote Sens. 2021, 13, 808. [Google Scholar] [CrossRef]
Yaghoubi, E.; Yaghoubi, E.; Khamees, A.; et al. A systematic review and meta-analysis of machine learning, deep learning, and ensemble learning approaches in predicting EV charging behavior. 2024. [Google Scholar] [CrossRef]
Sang, X.; Xue, L.; Ran, X.; et al. Intelligent High-Resolution Geological Mapping Based on SLIC-CNN. ISPRS Int. J. Geo-Inf. 2020, 9, 99. [Google Scholar] [CrossRef]
Chou, J.-S.; Nguyen, H.-M.; Phan, H.-P.; Wang, K.-L. Predicting deep-seated landslide displacement on Taiwan’s Lushan through the integration of convolutional neural networks and the Age of Exploration-Inspired Optimizer. Nat. Hazards Earth Syst. Sci. 2025, 25, 119–146. [Google Scholar] [CrossRef]
Zhong, Z.; Sun, A.Y.; Wu, X. Inversion of Time-Lapse Seismic Reservoir Monitoring Data Using CycleGAN: A Deep Learning-Based Approach for Estimating Dynamic Reservoir Property Changes. J. Geophys. Res. Solid Earth 2020, 125, 1–27. [Google Scholar] [CrossRef]
Abbaszadeh Shahri, A.; Chunling, S.; Larsson, S. A hybrid ensemble-based automated deep learning approach to generate 3D geo-models and uncertainty analysis. Eng. With Comput. 2024, 40, 1501–1516. [Google Scholar] [CrossRef]
Jianliang, W.; Iqbal, I.; Sanxi, P.; et al. Integrated Geophysical Survey in Defining Subsidence Features of Glauber’s Salt Mine, Gansu Province in China. Geotech. Geol. Eng. 2022, 40, 325–334. [Google Scholar] [CrossRef]
Ghorbanzadeh, O.; Shahabi, H.; Crivellari, A.; et al. Landslide detection using deep learning and object-based image analysis. Landslides 2022, 19, 929–939. [Google Scholar] [CrossRef]
Shirmard, H.; Farahbakhsh, E.; Heidari, E.; et al. A Comparative Study of Convolutional Neural Networks and Conventional Machine Learning Models for Lithological Mapping Using Remote Sensing Data. Remote Sens. 2022, 14, 819. [Google Scholar] [CrossRef]
Yao, P.; Yu, Z.; Zhang, Y.; Xu, T. Application of machine learning in carbon capture and storage: An in-depth insight from the perspective of geoscience. Fuel 2023, 333, 126296. [Google Scholar] [CrossRef]
He, Z.; Liu, H.; Wang, Y.; Hu, J. Generative Adversarial Networks-Based Semi-Supervised Learning for Hyperspectral Image Classification. Remote Sens. 2017, 9, 1042. [Google Scholar] [CrossRef]
Mojtahedi, F.F.; Yousefpour, N.; Chow, S.H.; Cassidy, M. Deep Learning for Time Series Forecasting: Review and Applications in Geotechnics and Geosciences. Arch. Comput. Methods Eng. 2025, 32, 3415–3445. [Google Scholar] [CrossRef]
Aleardi, M.; Vinciguerra, A.; Hojat, A. A convolutional neural network approach to electrical resistivity tomography. J. Appl. Geophys. 2021, 193, 104434. [Google Scholar] [CrossRef]
Liu, B.; Guo, Q.; Li, S.; et al. Deep Learning Inversion of Electrical Resistivity Data. IEEE Trans. Geosci. Remote Sens. 2020, 58, 5715–5728. [Google Scholar] [CrossRef]
Zhang, W.; Li, H.; Li, Y.; et al. Application of deep learning algorithms in geotechnical engineering: a short critical review. Artif. Intell. Rev. 2021, 54, 5633–5673. [Google Scholar] [CrossRef]
Iqbal, N.; Rizwan, A.; Khan, A.N.; et al. Boreholes Data Analysis Architecture Based on Clustering and Prediction Models for Enhancing Underground Safety Verification. IEEE Access 2021, 9, 78428–78451. [Google Scholar] [CrossRef]
Zheng, H.; Liu, B.; Han, S.; et al. Research on landslide hazard spatial prediction models based on deep neural networks: a case study of northwest Sichuan, China. Environ. Earth Sci. 2022, 81, 258. [Google Scholar] [CrossRef]
Kundu, S.K.; Dey, A.K.; Sapkota, S.C.; et al. Advanced predictive modelling of electrical resistivity for geotechnical and geo-environmental applications using machine learning techniques. J. Appl. Geophys. 2024, 231, 105557. [Google Scholar] [CrossRef]
Fitz, S.; Romero, P. Neural Networks and Deep Learning: A Paradigm Shift in Information Processing, Machine Learning, and Artificial Intelligence. In The Palgrave Handbook of Technological Finance; Springer International Publishing: Cham, 2021; pp. 589–654. [Google Scholar]
Kurani, A.; Doshi, P.; Vakharia, A.; Shah, M. A Comprehensive Comparative Study of Artificial Neural Network (ANN) and Support Vector Machines (SVM) on Stock Forecasting. Ann. Data Sci. 2023, 10, 183–208. [Google Scholar] [CrossRef]
Das, S.K. Artificial Neural Networks in Geotechnical Engineering: Modeling and Application Issues. In Metaheuristics in Water, Geotechnical and Transport Engineering, First Edit. ed; Elsevier, 2013; pp. 231–270. [Google Scholar]
Kim, T.; Shin, J.-Y.; Kim, H.; et al. The Use of Large-Scale Climate Indices in Monthly Reservoir Inflow Forecasting and Its Application on Time Series and Artificial Intelligence Models. Water 2019, 11, 374. [Google Scholar] [CrossRef]
Mohebali, B.; Tahmassebi, A.; Meyer-Baese, A.; Gandomi, A.H. Probabilistic neural networks. In Handbook of Probabilistic Models; Elsevier, 2020; pp. 347–367. [Google Scholar]
Karpatne, A.; Ebert-Uphoff, I.; Ravela, S.; et al. Machine Learning for the Geosciences: Challenges and Opportunities. IEEE Trans. Knowl. Data Eng. 2019, 31, 1544–1554. [Google Scholar] [CrossRef]
Hemdan, E.E.-D.; Al-Atroush, M.E. An efficient IoT-based soil image recognition system using hybrid deep learning for smart geotechnical and geological engineering applications. Multimed. Tools Appl. 2024, 83, 66591–66612. [Google Scholar] [CrossRef]
Moosavi, M.; Yazdanpanah, M.J.; Doostmohammadi, R. Modeling the cyclic swelling pressure of mudrock using artificial neural networks. Eng. Geol. 2006, 87, 178–194. [Google Scholar] [CrossRef]
Khanlari, G.R.; Heidari, M.; Momeni, A.A.; Abdilor, Y. Prediction of shear strength parameters of soils using artificial neural networks and multivariate regression methods. Eng. Geol. 2012, 131–132, 11–18. [Google Scholar] [CrossRef]
Attia, M.; Tsai, F.T.C. Successive bootstrapping deep learning approach and airborne EM-borehole data fusion to understand salt water in the Mississippi River Valley Alluvial Aquifer. Sci. Total Environ. 2024, 932, 172950. [Google Scholar] [CrossRef]
Elakiya, N.; Keerthana, G. Application of Artificial Neural Networks in Soil Science Research. Arch. Curr. Res. Int. 2024, 24, 1–15. [Google Scholar] [CrossRef]
Ge, Y.; Liu, G.; Tang, H.; et al. Comparative analysis of five convolutional neural networks for landslide susceptibility assessment. Bull. Eng. Geol. Environ. 2023, 82, 377. [Google Scholar] [CrossRef]
Liu, M.; Liao, S.; Yang, Y.; et al. Tunnel boring machine vibration-based deep learning for the ground identification of working faces. J. Rock. Mech. Geotech. Eng. 2021, 13, 1340–1357. [Google Scholar] [CrossRef]
Pi, W.; Du, J.; Bi, Y.; et al. 3D-CNN based UAV hyperspectral imagery for grassland degradation indicator ground object classification research. Ecol. Inform. 2021, 62, 101278. [Google Scholar] [CrossRef]
Guo, Q.M.; Zhan, L.T.; Yin, Z.Y.; et al. Correlation of excavated soil multi-source heterogeneous data using multimodal diffusion model. Acta Geotech. 2025, 20, 4977–5005. [Google Scholar] [CrossRef]
Xi, N.; Yang, Q.; Sun, Y.; Mei, G. Machine Learning Approaches for Slope Deformation Prediction Based on Monitored Time-Series Displacement Data: A Comparative Investigation. Appl. Sci. 2023, 13, 4677. [Google Scholar] [CrossRef]
Azizi, A.; Gilandeh, Y.A.; Mesri-Gundoshmian, T.; et al. Classification of soil aggregates: A novel approach based on deep learning. Soil. Tillage Res. 2020, 199, 104586. [Google Scholar] [CrossRef]
Guerri, M.F.; Distante, C.; Spagnolo, P.; Taleb-Ahmed, A. Boosting hyperspectral image classification with Gate-Shift-Fuse mechanisms in a novel CNN-Transformer approach. Comput. Electron. Agric. 2025, 237, 110489. [Google Scholar] [CrossRef]
Ran, X.; Xue, L.; Zhang, Y.; et al. Rock classification from field image patches analyzed using a deep convolutional neural network. Mathematics 2019, 7, 1–16. [Google Scholar] [CrossRef]
Dey, B.; Ferdous, J.; Ahmed, R. Machine learning based recommendation of agricultural and horticultural crop farming in India under the regime of NPK, soil pH and three climatic variables. Heliyon 2024, 10, e25112. [Google Scholar] [CrossRef] [PubMed]
Latifovic, R.; Pouliot, D.; Campbell, J. Assessment of Convolution Neural Networks for Surficial Geology Mapping in the South Rae Geological Region, Northwest Territories, Canada. Remote Sens. 2018, 10, 307. [Google Scholar] [CrossRef]
Lin, C.S.; Chen, S.H.; Chang, C.M.; Shen, T.W. Crack detection on a retainingwall with an innovative, ensemble learning method in a dynamic imaging system. Sensors 2019, 19. [Google Scholar] [CrossRef]
Golding, V.P.; Gharineiat, Z.; Munawar, H.S.; Ullah, F. Crack Detection in Concrete Structures Using Deep Learning. Sustainability 2022, 14, 8117. [Google Scholar] [CrossRef]
Yu, Y.; Xu, T.; Shen, Z.; et al. Compressive spectral imaging system for soil classification with three-dimensional convolutional neural network. Opt. Express 2019, 27, 23029. [Google Scholar] [CrossRef] [PubMed]
Srivastava, P.; Shukla, A.; Bansal, A. A comprehensive review on soil classification using deep learning and computer vision techniques. Multimed. Tools Appl. 2021, 80, 14887–14914. [Google Scholar] [CrossRef]
Tai, L.; Zhang, J.; Liu, M.; et al. A Survey of Deep Network Solutions for Learning Control in Robotics: From Reinforcement to Imitation. 2016. [Google Scholar] [CrossRef]
Tsantekidis, A.; Passalis, N.; Tefas, A. Front Matter. In Deep Learning for Robot Perception and Cognition; Elsevier, 2022; pp. i–iii. [Google Scholar]
Liu, H.; Su, H.; Sun, L.; Dias-Da-Costa, D. State-of-the-art review on the use of AI-enhanced computational mechanics in geotechnical engineering. Artif. Intell. Rev. 2024, 57, 196. [Google Scholar] [CrossRef]
Mienye, I.D.; Swart, T.G.; Obaido, G. Recurrent Neural Networks: A Comprehensive Review of Architectures, Variants, and Applications. Information 2024, 15, 517. [Google Scholar] [CrossRef]
Wang, Z.Z.; Zhang, J.; Huang, H. Interpreting random fields through the U-Net architecture for failure mechanism and deformation predictions of geosystems. Geosci. Front. 2024, 15, 101720. [Google Scholar] [CrossRef]
He, Y.; Semnani, S.J. Machine learning based modeling of path-dependent materials for finite element analysis. Comput. Geotech. 2023, 156, 105254. [Google Scholar] [CrossRef]
Mittal, M.; Kumar, K.; Behal, S. Deep learning approaches for detecting DDoS attacks: a systematic review. Soft Comput. 2023, 27, 13039–13075. [Google Scholar] [CrossRef]
Santoso, B.; Anggraeni, W.; Pariaman, H.; Purnomo, M.H. RNN-Autoencoder Approach for Anomaly Detection in Power Plant Predictive Maintenance Systems. Int. J. Intell. Eng. Syst. 2022, 15, 363–381. [Google Scholar] [CrossRef]
Yu, W.; Kim, I.Y.; Mechefske, C. Analysis of different RNN autoencoder variants for time series classification and machine prognostics. Mech. Syst. Signal Process. 2021, 149, 107322. [Google Scholar] [CrossRef]
Jozefowicz, R.; Zaremba, W.; Sutskever, I. An empirical exploration of Recurrent Network architectures. 32nd Int. Conf. Mach. Learn. ICML 2015 2015, 3, 2332–2340. [Google Scholar]
Yang, C.; Yin, Y.; Zhang, J.; et al. A graph deep learning method for landslide displacement prediction based on global navigation satellite system positioning. Geosci. Front. 2024, 15, 101690. [Google Scholar] [CrossRef]
Yu, Y.; Si, X.; Hu, C.; Zhang, J. A Review of Recurrent Neural Networks: LSTM Cells and Network Architectures. Neural Comput. 2019, 31, 1235–1270. [Google Scholar] [CrossRef]
Niu, X.; Ma, J.; Wang, Y.; et al. A novel decomposition-ensemble learning model based on ensemble empirical mode decomposition and recurrent neural network for landslide displacement prediction. Appl. Sci. 2021, 11, 1–18. [Google Scholar] [CrossRef]
Li, J.; Tang, B.; Zhang, Y.; et al. Study on the influence of lead ions on soil fissure development and intelligent prediction model under dry–wet cycle conditions. Environ. Earth Sci. 2025, 84. [Google Scholar] [CrossRef]
Goodfellow, I.J.; Pouget-Abadie, J.; Mirza, M.; et al. Generative adversarial nets. Adv. Neural Inf. Process. Syst. 2014, 3, 2672–2680. [Google Scholar] [CrossRef]
Ruan, D.; Chen, X.; Gühmann, C.; Yan, J. Improvement of Generative Adversarial Network and Its Application in Bearing Fault Diagnosis: A Review. Lubricants 2023, 11, 1–21. [Google Scholar] [CrossRef]
Guo, S.; Wang, B.; Zhang, P.; et al. Influence analysis and relationship evolution between construction parameters and ground settlements induced by shield tunneling under soil-rock mixed-face conditions. Tunn. Undergr. Space Technol. 2023, 134, 105020. [Google Scholar] [CrossRef]
Lyu, B.; Wang, Y.; Shi, C. Multi-scale generative adversarial networks (GAN) for generation of three-dimensional subsurface geological models from limited boreholes and prior geological knowledge. Comput. Geotech. 2024, 170, 106336. [Google Scholar] [CrossRef]
Marano, G.C.; Rosso, M.M.; Aloisio, A.; Cirrincione, G. Generative adversarial networks review in earthquake-related engineering fields. Bull. Earthq. Eng. 2024, 22, 3511–3562. [Google Scholar] [CrossRef]
Yan, Y.; Jiang, L.; Li, J.; et al. Local Information-Driven Hierarchical Fusion of SAR and Visible Images via Refined Modal Salient Features. Remote Sens. 2025, 17, 2466. [Google Scholar] [CrossRef]
Biswas, A.; Md Abdullah Al, N.; Imran, A.; et al. Generative Adversarial Networks for Data Augmentation. In Data Driven Approaches on Medical Imaging; Springer Nature Switzerland: Cham, 2023; pp. 159–177. [Google Scholar]
Smyrniou, E.; Coelho, B.Z. Using Generative Adversarial Networks to create a 2D subsoil schematization. ISMLG 2023. [Google Scholar]
Raissi, M.; Perdikaris, P.; Karniadakis, G.E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019, 378, 686–707. [Google Scholar] [CrossRef]
Karniadakis, G.E.; Kevrekidis, I.G.; Lu, L.; et al. Physics-informed machine learning. Nat. Rev. Phys. 2021, 3, 422–440. [Google Scholar] [CrossRef]
Zhang, H.; Song, B.; Zuo, L.; Li, L. Domain-decomposed physics-informed neural network for one-dimensional soil consolidation under multi-step surcharge loading. Transp. Geotech. 2025, 55, 101722. [Google Scholar] [CrossRef]
Zhang, S.; Zhang, C.; Han, X.; Wang, B. MRF-PINN: a multi-receptive-field convolutional physics-informed neural network for solving partial differential equations. Comput. Mech. 2025, 75, 1137–1163. [Google Scholar] [CrossRef]
Wang, C.; han, Song L; Yuan, Z.; Fan, J.; sheng. State-of-the-art AI-based computational analysis in civil engineering. J. Ind. Inf. Integr. 2023, 33, 100470. [Google Scholar] [CrossRef]
Okazaki, T.; Hirahara, K.; Ito, T.; et al. Physics-Informed Deep Learning for Forward and Inverse Modeling of Inplane Crustal Deformation. J. Geophys. Res. Mach. Learn. Comput. 2025, 2, 1–13. [Google Scholar] [CrossRef]
Lan, P.; Su, J.; Ma, X.; Zhang, S. Application of improved physics-informed neural networks for nonlinear consolidation problems with continuous drainage boundary conditions. Acta Geotech. 2024, 19, 495–508. [Google Scholar] [CrossRef]
Vahab, M.; Shahbodagh, B.; Haghighat, E.; Khalili, N. Application of Physics-Informed Neural Networks for forward and inverse analysis of pile–soil interaction. Int. J. Solids Struct. 2023, 277–278, 112319. [Google Scholar] [CrossRef]
Luo, K.; Zhao, J.; Wang, Y.; et al. Physics-informed neural networks for PDE problems: a comprehensive review. Artif. Intell. Rev. 2025, 58, 323. [Google Scholar] [CrossRef]
Ito, S.; Fukunaga, R.; Sako, K. Inverse analysis for estimating geotechnical parameters using physics-informed neural networks. Soils Found. 2024, 64, 101533. [Google Scholar] [CrossRef]
Horiguchi, I.; Shima, K.; Okano, Y. Physics-informed neural networks ( PINNs ) for high-resolutional prediction of shear stress on cells in suspension culture. AIChE J. 2025, 71. [Google Scholar] [CrossRef]
Feng, Y.; Eun, J.; Kim, S.; Kim, Y.-R. Application of physics-informed neural networks (PINNs) solution to coupled thermal and hydraulic processes in silty sands. Int. J. Geo-Eng. 2025, 16, 3. [Google Scholar] [CrossRef]
Degen, D.; Caviedes Voullième, D.; Buiter, S.; et al. Perspectives of physics-based machine learning strategies for geoscientific applications governed by partial differential equations. Geosci. Model Dev. 2023, 16, 7375–7409. [Google Scholar] [CrossRef]
Liang, R.; Zhang, C.; Huang, C.; et al. Multimodal data fusion for geo-hazard prediction in underground mining operation. Comput. Ind. Eng. 2024, 193, 110268. [Google Scholar] [CrossRef]
Castanedo, F. A Review of Data Fusion Techniques. Sci. World J. 2013. [Google Scholar] [CrossRef]
Guo, R.; Zhou, H.; Wei, X.; et al. Deep joint inversion of multiple geophysical data with U-net reparameterization. GEOPHYSICS 2025, 90, WA61–WA75. [Google Scholar] [CrossRef]
Jiang, Y.; Ma, J.; Ning, J.; et al. One-Fit-All Transformer for Multimodal Geophysical Inversion: Method and Application. J. Geophys. Res. Mach. Learn. Comput. 2025, 2. [Google Scholar] [CrossRef]
Kuo, P.C.; Chou, Y.T.; Li, K.Y.; et al. GNN-LSTM-based fusion model for structural dynamic responses prediction. Eng. Struct. 2024, 306, 117733. [Google Scholar] [CrossRef]
Ali, S.; Abuhmed, T.; El-Sappagh, S.; et al. Explainable Artificial Intelligence (XAI): What we know and what is left to attain Trustworthy Artificial Intelligence. Inf. Fusion 2023, 99, 101805. [Google Scholar] [CrossRef]
Nori, H.; Caruana, R.; Bu, Z.; et al. Accuracy, Interpretability, and Differential Privacy via Explainable Boosting. Proc. Mach. Learn. Res. 2021, 139, 8227–8237. [Google Scholar]
Xiao, X.; Zou, Y.; Huang, J.; et al. An interpretable model for landslide susceptibility assessment based on Optuna hyperparameter optimization and Random Forest. Geomat. Nat. Hazards Risk 2024, 15. [Google Scholar] [CrossRef]
Roscher, R.; Bohn, B.; Duarte, M.F.; Garcke, J. Explainable Machine Learning for Scientific Insights and Discoveries. IEEE Access 2020, 8, 42200–42216. [Google Scholar] [CrossRef]
Dasgupta, S.; Frost, N.; Moshkovitz, M. Framework for Evaluating Faithfulness of Local Explanations. Proc. Mach. Learn. Res. 2022, 162, 4794–4815. [Google Scholar]
Youssef, K.; Shao, K.; Moon, S.; Bouchard, L.-S. Landslide susceptibility modeling by interpretable neural network. Commun. Earth Environ. 2023, 4, 162. [Google Scholar] [CrossRef]
Agarwal, R.; Melnick, L.; Frosst, N.; et al. Neural Additive Models: Interpretable Machine Learning with Neural Nets. Adv. Neural Inf. Process. Syst. 2021, 6, 4699–4711. [Google Scholar] [CrossRef]
Bortolozo, C.A.; Campaña, J.D.R.; Santos, F.A.M.; Dos; et al. Joint Inversion of DC and TEM Methods for Geological Imaging. Pure Appl. Geophys. 2024, 181, 2541–2560. [Google Scholar] [CrossRef]
Lin, Y.; Yang, Q.; Li, X.; et al. Ice-kNN-South: A Lightweight Machine Learning Model for Antarctic Sea Ice Prediction. J. Geophys. Res. Mach. Learn. Comput. 2025, 2, 1–17. [Google Scholar] [CrossRef]
Attia, M.; Tsai, F.T.C. Airborne Geophysical and Borehole Data Fusion to Improve Mississippi River Valley Alluvial Aquifer Characterization. Water Resour. Res. 2025, 61. [Google Scholar] [CrossRef]
Lyu, B.; Wang, Y.; Miao, C.; et al. Fusion of Limited Site-Specific Borehole Logs and Geophysical Data from a Different Site for Three-Dimensional Subsurface Geological Modeling Using Multiscale Generative Adversarial Network. J. Geotech. Geoenvironmental Eng. 2025, 151. [Google Scholar] [CrossRef]
Ibraheem, I.M.; Yogeshwar, P.; Sharifi, F.; et al. Joint inversion of transient electromagnetic and radiomagnetotelluric data for enhanced subsurface characterization. Sci. Rep. 2025, 15, 25494. [Google Scholar] [CrossRef]
Ravasi, M.; Birnie, C. A joint inversion-segmentation approach to assisted seismic interpretation. Geophys. J. Int. 2021, 228, 893–912. [Google Scholar] [CrossRef]
Li, Y.; Xiao, X. Deep Learning-Based Fusion of Optical, Radar, and LiDAR Data for Advancing Land Monitoring. Sensors 2025, 25, 4991. [Google Scholar] [CrossRef]
Kouadio, K.L.; Liu, J.; Liu, R.; et al. K-Means Featurizer: A booster for intricate datasets. In Earth Science Informatics; 2024. [Google Scholar] [CrossRef]
Lima, A.A.J.; Lopes, J.C.; Lopes, R.P.; et al. Soil Organic Carbon Assessment Using Remote-Sensing Data and Machine Learning: A Systematic Literature Review. Remote Sens. 2025, 17, 882. [Google Scholar] [CrossRef]
Kalantary, F.; Ardalan, H.; Nariman-Zadeh, N. An investigation on the Su–NSPT correlation using GMDH type neural networks and genetic algorithms. Eng. Geol. 2009, 104, 144–155. [Google Scholar] [CrossRef]
Licznar, P.; Nearing, M. Artificial neural networks of soil erosion and runoff prediction at the plot scale. CATENA 2003, 51, 89–114. [Google Scholar] [CrossRef]
Mishra, G.; Sulieman, M.M.; Kaya, F.; et al. Machine learning for cation exchange capacity prediction in different land uses. CATENA 2022, 216, 106404. [Google Scholar] [CrossRef]
Rastgou, M.; Bayat, H.; Mansoorizadeh, M.; Gregory, A.S. Estimating Soil Water Retention Curve by Extreme Learning Machine, Radial Basis Function, M5 Tree and Modified Group Method of Data Handling Approaches. Water Resour. Res. 2022, 58, 1–26. [Google Scholar] [CrossRef]
Kim, J.-C.; Lee, S. Comparative Study of Deep Neural Networks for Landslide Susceptibility Assessment: A Case Study of Pyeongchang-gun, South Korea. Sustainability 2023, 16, 245. [Google Scholar] [CrossRef]
Wang, S.; Chen, Y.; Wang, M.; Li, J. Performance Comparison of Machine Learning Algorithms for Estimating the Soil Salinity of Salt-Affected Soil Using Field Spectral Data. Remote Sens. 2019, 11, 2605. [Google Scholar] [CrossRef]
Elaziz, A.E.A.A.; Goda Soliman, K.; Said Abu-Hashim, M.; et al. Enhancing soil salinity prediction in semi-arid regions using machine learning models technology. Ijcbs 2023, 24, 565–574. [Google Scholar]
Zhang, Q.; Liu, M.; Zhang, Y.; et al. Comparison of Machine Learning Methods for Predicting Soil Total Nitrogen Content Using Landsat-8, Sentinel-1, and Sentinel-2 Images. Remote Sens. 2023, 15, 2907. [Google Scholar] [CrossRef]
Lanjewar, M.G.; Gurav, O.L. Convolutional Neural Networks based classifications of soil images. Multimed. Tools Appl. 2022, 81, 10313–10336. [Google Scholar] [CrossRef]
Zhang, X.; Zhu, Y.; Wang, J.; et al. GW-PINN: A deep learning algorithm for solving groundwater flow equations. Adv. Water Resour. 2022, 165, 104243. [Google Scholar] [CrossRef]
Liu, B.; Guo, H.; Li, J.; et al. Application and interpretability of ensemble learning for landslide susceptibility mapping along the Three Gorges Reservoir area, China; Springer Netherlands, 2024. [Google Scholar]
Das, S.; Sharma, P.; Pain, A.; et al. Deep learning based landslide detection using open-source resources: Opportunities and challenges. Earth Sci. Inform. 2023, 16, 4035–4052. [Google Scholar] [CrossRef]
Gutiérrez, F.; Parise, M.; De Waele, J.; Jourde, H. A review on natural and human-induced geohazards and impacts in karst. Earth-Sci. Rev. 2014, 138, 61–88. [Google Scholar] [CrossRef]
He, R.; Zhang, W.; Dou, J.; et al. Application of artificial intelligence in three aspects of landslide risk assessment: A comprehensive review. Rock. Mech. Bull. 2024, 3, 100144. [Google Scholar] [CrossRef]
Carrara, A.; Cardinali, M.; Detti, R.; et al. GIS techniques and statistical models in evaluating landslide hazard. Earth Surf. Process. Landf. 1991, 16, 427–445. [Google Scholar] [CrossRef]
Mersha, T.; Meten, M. GIS-based landslide susceptibility mapping and assessment using bivariate statistical methods in Simada area, northwestern Ethiopia. Geoenvironmental Disasters 2020, 7. [Google Scholar] [CrossRef]
Liu, Q.; ting, Wu T; hong, Deng Y; heng, Liu Z. Intelligent identification of landslides in loess areas based on the improved YOLO algorithm: a case study of loess landslides in Baoji City. J. Mt. Sci. 2023, 20, 3343–3359. [Google Scholar] [CrossRef]
Bui, D.T.; Tsangaratos, P.; Nguyen, V.T.; et al. Comparing the prediction performance of a Deep Learning Neural Network model with conventional machine learning models in landslide susceptibility assessment. Catena 2020, 188, 104426. [Google Scholar] [CrossRef]
Mandal, K.; Saha, S.; Mandal, S. Applying deep learning and benchmark machine learning algorithms for landslide susceptibility modelling in Rorachu river basin of Sikkim Himalaya, India. Geosci. Front. 2021, 12, 101203. [Google Scholar] [CrossRef]
Jiang, Z.; Wang, M.; Liu, K. Comparisons of Convolutional Neural Network and Other Machine Learning Methods in Landslide Susceptibility Assessment: A Case Study in Pingwu. Remote Sens. 2023, 15, 798. [Google Scholar] [CrossRef]
Althuwaynee, O.F.; Pradhan, B.; Park, H.-J.; Lee, J.H. A novel ensemble bivariate statistical evidential belief function with knowledge-based analytical hierarchy process and multivariate statistical logistic regression for landslide susceptibility mapping. CATENA 2014, 114, 21–36. [Google Scholar] [CrossRef]
Wang, Y.; Fang, Z.; Wang, M.; et al. Comparative study of landslide susceptibility mapping with different recurrent neural networks. Comput. Geosci. 2020, 138, 104445. [Google Scholar] [CrossRef]
Zhang, L.; Shi, B.; Zhu, H.; et al. PSO-SVM-based deep displacement prediction of Majiagou landslide considering the deformation hysteresis effect. Landslides 2021, 18, 179–193. [Google Scholar] [CrossRef]
Zhang, Y.; Wang, X.; Tang, H. An improved Elman neural network with piecewise weighted gradient for time series prediction. Neurocomputing 2019, 359, 199–208. [Google Scholar] [CrossRef]
Parise, M. Sinkholes, Subsidence and Related Mass Movements, Second Edi ed; Elsevier, 2022. [Google Scholar]
Chen, H.; Oguchi, T.; Wu, P. Morphometric analysis of sinkholes using a semi-automatic approach in Zhijin County, China. Arab. J. Geosci. 2018, 11. [Google Scholar] [CrossRef]
Rafique, M.U.; Zhu, J.; Jacobs, N. Automatic Segmentation of Sinkholes Using a Convolutional Neural Network. Earth Space Sci. 2022, 9, 1–15. [Google Scholar] [CrossRef]
Jang, B.; Yoon, H.-K. Application of infrared thermal images for sinkhole detection with time-series and time-difference index data through a convolution neural network. Int. J. Pavement Eng. 2024, 25. [Google Scholar] [CrossRef]
Kariminejad, N.; Shahabi, H.; Ghorbanzadeh, O.; et al. Evaluation of Various Deep Learning Algorithms for Landslide and Sinkhole Detection from UAV Imagery in a Semi-arid Environment. Earth Syst. Environ. 2024, 8, 1387–1398. [Google Scholar] [CrossRef]
Jiang, Z.; Hu, S.; Deng, H.; et al. Detection and automatic identification of loess sinkholes from the perspective of LiDAR point clouds and deep learning algorithm. Geomorphology 2024, 465, 109404. [Google Scholar] [CrossRef]
Muili, O.; Babaie, H.A. Sinkhole susceptibility analysis using machine learning for west central Florida. Appl. Comput. Geosci. 2025, 27, 100262. [Google Scholar] [CrossRef]
Lee, E.J.; Shin, S.Y.; Ko, B.C.; Chang, C. Early sinkhole detection using a drone-based thermal camera and image processing. Infrared Phys. Technol. 2016, 78, 223–232. [Google Scholar] [CrossRef]
Zhu, J.; Pierskalla, W.P. Applying a weighted random forests method to extract karst sinkholes from LiDAR data. J. Hydrol. 2016, 533, 343–352. [Google Scholar] [CrossRef]
Kang, M.-S.; Kim, N.; Im, S.B.; et al. 3D GPR Image-based UcNet for Enhancing Underground Cavity Detectability. Remote Sens. 2019, 11, 2545. [Google Scholar] [CrossRef]
Mihevc, A.; Mihevc, R. Morphological characteristics and distribution of dolines in Slovenia, a study of a lidar-based doline map of Slovenia. Acta Carsologica 2021, 50, 11–36. [Google Scholar] [CrossRef]
Nefeslioglu, H.A.; Tavus, B.; Er, M.; et al. Integration of an InSAR and ANN for Sinkhole Susceptibility Mapping: A Case Study from Kirikkale-Delice (Turkey). ISPRS Int. J. Geo-Inf. 2021, 10, 119. [Google Scholar] [CrossRef]
Xixi, L.; Zou, C.; Peng, C.; Wu, C. Uncertainty Quantification in Intelligent-Based Electrical Resistivity Tomography Image Reconstruction With Monte Carlo Dropout Strategy. IEEE Trans. Geosci. Remote Sens. 2023, 61, 1–16. [Google Scholar] [CrossRef]
Ünal, İ.; Kabaş, Ö.; Sözer, S. Real-time electrical resistivity measurement and mapping platform of the soils with an autonomous robot for precision farming applications. Sensors 2020, 20, 251. [Google Scholar] [CrossRef]

Figure 1. Systematic workflow of the AI–EGGE review framework.

Figure 2. AI-driven data fusion strategies (modified after [45]), illustrating data-, feature-, and decision-level fusion across supervised, unsupervised, hybrid, and physics-informed AI–EGGE workflows.

Figure 3. Schematic architectures of commonly used supervised and unsupervised ML models: (a) ANN, (b) SVM, (c) DT, (d) RF, (e) GBM, and (f) CatBoost.

Figure 4. Schematic architectures of unsupervised ML models: (a) k-means and (b) SOM.

Figure 6. CNN framework for AI–EGGE: (a) workflow integrating preprocessing, augmentation, and pretrained CNN models (AlexNet, GoogLeNet, Inception, ResNet variants), and (b) representative architecture with convolution, pooling, flatten, and fully connected layers for image-based classification.

Figure 7. The standard RNN and unfolded RNN (adapted from [146]).

Figure 8. LSTM architecture (adapted from [117]).

Figure 10. PINN framework with six encoder-decoder pairs for solving PDEs, each assigned a distinct convolutional receptive field [171].

Figure 11. Hybrid GBM with PINN framework for predicting circular slope stability (adapted from [10]).

Figure 12. Multimodal fusion framework in AI–EGGE, outlining the workflow from data source and preprocessing through fusion levels, AI modeling, interpretability, and uncertainty analysis to final applications.

Figure 13. ViTF-based ERT inversion framework with spatial convolutional blocks and TF self-attention for high-fidelity, real-time mapping of apparent resistivity pseudo-sections to true resistivity distributions (modified after [41]). (a) Image-based inversion using calibrated apparent resistivity profiles. (b) CNN–ViTF architecture trained on paired apparent–true resistivity images generated by forward modeling.

Figure 14. Hybrid ViTF model results and comparison with other DL models (modified after [41]). Panel 1: Predicted upper and lower resistivity colormap limits across the 1–5 target datasets, with green and blue scatters representing the estimated ranges. MAE and RMSE remain generally within 150 Ωm, and R² exceeds 0.98 across all complexity levels, indicating minimal calibration error. Sample images show that the predicted colormap (a) produces negligible visual distortion compared with the original apparent resistivity images (b). Panel 2: Scatter-density comparison of pixel-wise inversion resistivity values of ViTF outputs using calibrated versus original colormaps across all target complexities. R² values exceed 0.96, with MAE < 140 Ωm and RMSE < 260 Ωm, demonstrating that calibration has no significant impact on inversion accuracy. Panel 3: Example inversion results for 1–5 target datasets obtained using (a) ViTF and (b) GN method, with corresponding error maps for (c) ViTF and (d) GN. ViTF yields sharper anomaly boundaries and lower errors than GN across all complexity levels.

Table 3. Summary of XAI and uncertainty evaluation methods.

Category	Method	Full Name	Description	Applications
Post-hoc XAI	SHAP	SHapley Additive exPlanations	Quantifies each feature’s contribution (positive or negative) to a prediction using Shapley values	Best for local and global interpretability with strong theoretical grounding
	LIME	Local Interpretable Model-Agnostic Explanations	Generates a simple local surrogate model (often linear) to approximate how features influenced a specific prediction, providing fast, model-agnostic insight into the key drivers of that outcome	Useful for explaining individual geotechnical or environmental predictions (e.g., why a specific site was classified as high-risk)
	Grad-CAM	Gradient-weighted Class Activation Mapping	Produces gradient-based heatmaps that show which regions of an input image most influenced a model’s decision, providing a visual explanation of “where the model looked”	Relevant for seismic/image-based data, landslide mapping, to verify physical/geological focus.
	ALE	Accumulated Local Effects	Shows how a feature affects model predictions on average, accounting for feature interactions without bias from correlated features	Best for global interpretation when features are correlated
	IG	Integrated Gradients	Computes feature attributions by integrating model gradients from a baseline input to the actual input.	Best for interpreting DNNs with continuous features.
Intrinsic (Ante-Hoc) Models	EBM	Explainable Boosting Machine	A glass-box boosted GAM (Generalized Additive Models) that learns interpretable features and interaction effects	When high accuracy + full transparency are required for tabular geoscience data
	NAM	Neural Additive Model	A neural form of GAMs where each feature has its own subnetwork, preserving additive interpretability	When nonlinear patterns require neural network flexibility without sacrificing interpretability
	CBM	Concept Bottleneck Model	Predicts human-defined concepts first, then uses them for the final prediction, enforcing semantic reasoning	When explanations must align with domain concepts (e.g., soil type → stiffness → failure risk)
Uncertainty & Performance Metrics	PI	Prediction Intervals	Provide an estimated range within which the true value is expected to lie at a given confidence level.	Quantifies predictive uncertainty for risk-aware decision-making.
	Coverage	PI Coverage	Percentage of observed values falling within the prediction intervals	Evaluates calibration of uncertainty estimates (ideally ≈ target confidence)
	CRPS	Continuous Ranked Probability Score	Compares the full predictive distribution with the observation to assess probabilistic accuracy (lower = better).	Measures calibration + sharpness of probabilistic predictions—superior to RMSE for uncertainty models
	NRMSE	Normalized Root Mean Squared Error	Scale-independent measure of prediction error normalized by range, mean, or standard deviation (lower = better)	Allows fair model comparison across variables, sites, or units

Table 4. Conceptual hierarchy of multimodal fusion levels and representative methods in EGGE.

Level	Definition	Representation	EGGE Examples	References
Low level (data)	Fuse raw/minimally processed observations (joint inversion) to mitigate non-uniqueness and depth-resolution trade-offs	Joint inversion (DC–TEM, TEM–RMT (radio-magnetotelluric), DC–Gravity); deep joint inversion with U-Net reparameterization	DC–TEM (transient electromagnetic) joint inversion; TEM–RMT joint inversion; physics-constrained Swin Transformer for gravity inversion	[182,183,193,197]
Mid-level (feature)	Fuse engineered/learned features via statistics/ML; handles heterogeneous geophysical/geotechnical/monitoring data	PCA/ALE, AEs; feature concatenation; intermediate-fusion deep nets	Excavated soil classification (images + CPT + TDR); underground mining geo-hazards (multimodal)	[181,182,183]
High-level	Fuse model outputs/decisions (ensembles, probabilistic combination) to improve robustness and UQ	Stacking/ensembles; Bayesian model averaging; conformal risk control for UQ	Ensembling susceptibility maps; combining geophysics-only and fused models for risk scoring with calibrated UQ	[165,182]
Advanced AI frameworks (EGGE-adjacent)	Flexible multimodal fusion and missing/misaligned modality handling	TF (Geometric Query); physics-constrained TF; spatiotemporal GNN; diffusion models for missing modalities/completion	Multimodal geophysical inversion (TF); gravity inversion (Swin-TF); structural sensing fusion (GNN); EMAG2 gap-fill & multi-view completion (diffusion)	[40,184,185]

Table 5. Selected research on the applications of AI for site characterization and soil–structure prediction.

Authors	Methods used	Results	Relevance	Limitations
[137]	Applied deep CNN models (ResNet50, VGG16 (Visual Geometry Group Network), InceptionV3) to classify soil aggregate sizes from digital images; the dataset contained soil aggregates of varying classes	Achieved high classification accuracy: ResNet50 (98.7%), VGG16 (97.9%), InceptionV3 (97.5%). Demonstrated strong generalization across aggregate size categories	Provides robust, automated classification of soil aggregates, which is vital for assessing soil structure, stability, and compaction relevant to geotechnical and environmental engineering	Limited dataset diversity (lab-prepared aggregates); requires extension to field conditions; performance may decline for highly heterogeneous soil samples
[90]	Smartphone-based imaging system + CNN & RF; 90 soil samples from India (sand to clay). Features extracted: color, local, and texture	High accuracy: Clay (R² = 0.97–0.98), Sand (R² = 0.96–0.98), moderate for Silt (R² = 0.62–0.75). Developed an Android app for soil texture prediction	Enables low-cost, portable soil classification, useful for field geotechnical surveys where rapid soil assessment is needed	Moderate accuracy for silt; model is sensitive to soil moisture and organic matter; requires controlled image capture
[133]	Field TBM vibration data; CNNs (GoogLeNet, ResNet) & RNNs (LSTM, BiLSTM); wavelet-based features	CNN (ResNet-18) achieved 98.28% accuracy, and is superior to RNN (≈80%)	Real-time ground condition identification during tunneling—crucial for TBM safety and efficiency	Relies on site-specific vibration datasets; generalizability to different geologic terrains is uncertain
[143]	Deep learning (CNN with VGG), preprocessing (grayscale, thresholding, edge detection), 40,000 RGB images	Achieved very high accuracy (F1 ≈ 99.5%); grayscale models performed similarly to RGB, edge/thresholding slightly worse	Enables automated, reliable, fast crack detection in concrete infrastructure (bridges, pavements, buildings)	Limited study on real-world noisy/low-quality images; controlled dataset
[210]	CNN and six DCNNs (ResNet, VGGs, Inception-ResNetV2, Xception, DenseNet) on 903 soil images	CNN achieved 99.86% (train) and 97.68% (validation); ResNet: 99.15%; other DCNNs >97%	Validates CNN/DCNNs for soil classification in geotechnical and agricultural domains	Limited datasets and soil types (alluvial, black, clay, red); generalizability uncertain
[211]	Developed lightweight CNN models for soil image classification; compared performance with standard CNNs; dataset of soil images used for training/validation.	Lightweight CNN achieved comparable accuracy to deeper CNNs (~95–98%) with reduced computational demand; improved efficiency for real-time use	Provides efficient soil classification models suitable for geotechnical site surveys where rapid, on-site predictions are needed, especially with limited hardware resources.	Dataset size and diversity are limited; generalization across soil types and field conditions requires further validation; performance for complex textures (e.g., mixed soils) has not been fully tested.
[99]	Rapid and accurate prediction of soil texture using an image-based deep learning autoencoder convolutional neural network random forest (DLAE-CNN-RF) algorithm	Developed a smartphone-based image acquisition system; extracted particle, color, and texture features; applied a hybrid DLAE-CNN-RF algorithm for soil texture prediction.	Achieved very high prediction accuracy: sand (R² = 0.99), Clay (R² = 0.98), silt (R² = 0.98). Outperformed KNN and VGG16-RF. Designed a GUI for practical soil texture prediction	Provides a low-cost, portable, and efficient alternative to conventional soil texture analysis, enabling rapid characterization crucial for soil mechanics and geotechnical site investigation
[140]	ML algorithms: SVM, DT, RF, XGBoost, KNN, applied to NPK, soil pH, rainfall, temp, humidity dataset (2100 samples)	XGBoost achieved the highest accuracy: Crops (99.09%), horticultural crops (99.3%), and the combined model (98.51%). Demonstrated crop-specific modeling improves accuracy	Highlights the potential of ML for soil fertility and crop suitability, relevant for optimizing soil-crop interactions and improving site-specific soil use	Focused on agriculture; indirect link to geotechnical engineering. Requires large curated datasets; may not generalize to all soil types
[2]	Custom lightweight CNN (Light-SoilNet), dataset of 392 soil samples (sieve + hydrometer verified), smartphone images	Accuracy 97.2% across 5 soil classes (sand, clay, loam, loamy sand, sandy loam)	Low-cost soil classification tool for geotechnical surveys in agriculture & construction	Small dataset; only 5 soil classes; imbalanced dataset handling needed
[138]	Hybrid CNN–TF with Gate-Shift-Fuse for hyperspectral imaging	Achieved state-of-the-art accuracy (up to 99.86%) on benchmark HSI datasets; superior feature fusion and robustness	Relevant to soil mapping, mineral exploration, and subsurface geotechnics via hyperspectral remote sensing	Computationally intensive; fixed patch sizes; generalization across diverse datasets is uncertain

Table 6. Selected studies on AL applications for sinkhole detection using remote sensing data.

Study	Method	Data Utilized	Core Contribution / Key Outcome	Performance	Limitations
[233]	3D CNN	Thermal drone imagery (640×480 px)	Demonstrated the feasibility of using a lightweight 3D CNN model on thermal UAV data to automatically identify artificially created sinkholes	Precision: 87.9%, Recall: 88.1%	Dependent on drone-based thermal surveys; potential omission of sinkholes due to flight speed and background interference; datasets lacked geological variability
[234]	RF	LiDAR (1 m point spacing) and DEM (1.5 m cell size)	One of the earliest works to apply ML to elevation-based sinkhole mapping using LiDAR-derived datasets	Precision: 84.71%, Recall: 65.17%	Poor spatial transferability—accuracy decreased significantly when the model was applied to different regions; high-resolution DEMs are expensive and not easily accessible
[235]	Modified AlexNet CNN	GPR B-scan (50×50 px) and C-scan (50×13 px), enhanced to 200×200 px	Showcased the successful use of CNN for interpreting GPR imagery to detect sinkholes	Precision: 100%, Recall: 100% (with enhanced resolution)	Focused on a localized area; generalization across other locations was not tested; GPR data acquisition is costly and challenging in many karst terrains
[236]	U-Net	LiDAR DEM (1 m)	Developed one of the first U-Net models capable of large-scale sinkhole extraction, mapping >470,000 sinkholes in Slovenia and later >400,000 in the USA	IoU: 60.4%, Dice: 72.36%	Requires high-resolution LiDAR; ~16% deviation from manual expert mapping; applicability to non-limestone terrains not evaluated
[237]	ANN	Optical satellite data + InSAR DEM (10 m)	Applied ANN for both sinkhole detection and susceptibility modeling, highlighting ANN effectiveness for karst hazard assessment	RMSE: 45.1%	DEM accuracy influenced by vegetation and land cover; model performance strongly dependent on DEM quality
[228]	U-Net	LiDAR DEM + aerial imagery (1.524 m/px)	Demonstrated that merging DEM with aerial optical imagery enhances U-Net performance; model transfer successfully applied across different karst regions	IoU: 45.38%, Precision: 66.29%	Limited access to high-resolution LiDAR; performance in non-carbonate terrains remains understudied; imagery alone is insufficient without DEM support

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.