Preprint
Article

This version is not peer-reviewed.

Integrating InSAR and Channel Steepness for AI-Based Coseismic Landslide Modeling in the Nepal Himalaya

Submitted:

03 March 2026

Posted:

05 March 2026

You are already at the latest version

Abstract
Earthquake-induced landslides in active orogens such as the Nepal Himalaya pose major threats to life, infrastructure, and post-disaster recovery. Although coseismic landslide susceptibility mapping increasingly uses machine learning (ML) and deep learning (DL), explicit integration of spaceborne interferometric synthetic aperture radar (InSAR) products, particularly line-of-sight (LOS) displacement and coherence-based damage proxy maps (DPM), remains limited in event-based frameworks. This study develops and evaluates a multi-factor coseismic landslide probability model that incorporates InSAR-derived deformation metrics with key geomorphic and hydrologic predictors to improve rapid post-earthquake hazard assessment. Using the 25 April 2015 Mw 7.8 Gorkha earthquake as a case study, LOS displacement was derived from ALOS-2 PALSAR-2 ScanSAR interferometry, and the normalized channel steepness index (Kₛₙ) was computed from a digital elevation model. Additional predictors included slope, aspect, curvature, elevation, drainage density, distance to river, log-transformed stream power index (logSPI), peak ground acceleration (PGA), rainfall, and land use/land cover. Five models: Random Forest, Extreme Gradient Boosting (XGBoost), a lightweight convolutional neural network, U-Net, and DeepLabV3 were trained using fourteen conditioning factors and a landslide inventory, with class imbalance addressed through majority undersampling for ML and weighted loss with patch oversampling for DL. Incorporating LOS and DPM improved model discrimination and calibration: XGBoost and Random Forest achieved the highest AUC-ROC values (0.972 and 0.969) and lowest Brier scores, while DeepLabV3 produced the highest AUC-PR (0.768) and CSI (0.49). Feature importance analysis identified Kₛₙ as the dominant predictor, and ablation tests confirmed the added value of InSAR metrics. These findings demonstrate the effectiveness of integrating InSAR products for rapid coseismic landslide hazard assessment in the Nepal Himalaya.
Keywords: 
;  ;  ;  ;  ;  ;  ;  

1. Introduction

The 25 April 2015 Mw 7.8 Gorkha earthquake and its major aftershocks, including the Mw 7.3 event of 12 May 2015, triggered tens of thousands of landslides across central Nepal, causing widespread destruction, blocking critical road networks, damming rivers, and contributing to significant casualties and economic losses [1,2,3]. In steep, rapidly uplifting mountain belts such as the Nepal Himalaya, earthquake-induced landslides amplify shaking impacts and prolong disaster cascades by disrupting access, damaging infrastructure, and mobilizing large volumes of sediment. Coseismic landslide susceptibility mapping (LSM) and probability modeling are therefore essential for post-event response, reconstruction planning, and corridor-scale risk management [4,5]. Recent case studies in Nepal have demonstrated that landslide occurrence is strongly associated with geological structures, lithological contrasts, and antecedent rainfall, underscoring the need for multi-factor conditioning frameworks that integrate tectonic, hydrological, and terrain controls [6].
The Nepal Himalaya is characterized by a complex tectonic architecture, with the Main Himalayan Thrust (MHT) accommodating convergence between the Indian and Eurasian plates and driving active uplift and erosion across the orogen [7,8]. Lithology varies systematically from south to north: Lesser Himalayan metasediments and granites, Higher Himalayan crystalline rocks, and Tethyan sedimentary sequences, each with distinct mechanical properties and susceptibility to slope failure [1,8]. Structural discontinuities including thrust faults, shear zones, and lithological contacts concentrate stress and weaken hillslopes, making them preferential sites for earthquake-triggered landslides [1,2]. Coseismic landslide inventories for the 2015 Gorkha event show that landslide density correlates with slope gradient, peak ground acceleration, surface deformation, and proximity to mechanically weak lithologies and large plutonic intrusions [1,2]. Antecedent rainfall further modulates slope stability by elevating pore-water pressure and reducing effective stress, particularly in weathered and fractured rock [9]. Monsoon-driven erosion and tectonic uplift sustain high relief and over-steepened hillslopes that are prone to failure during strong ground motion [7]. Integrating geological, geomorphic, hydrologic, and seismic conditioning factors is therefore essential for robust coseismic landslide probability modeling in the Nepal Himalaya [6,10].
Over the past two decades, statistically based and machine learning (ML) approaches have become standard tools for LSM [4,5,6,11,12,13,14]. Classical methods such as logistic regression, generalized additive models, and Bayesian approaches have been increasingly complemented or superseded by ensemble tree methods (e.g., Random Forest, gradient boosting, XGBoost) and, more recently, deep neural networks [11,12,13,14,15,16,17,18]. These models can flexibly exploit multi-source geospatial data, capture complex nonlinear relationships among conditioning factors, and produce spatially continuous probability fields suitable for operational decision-making under limited data availability and strong class imbalance [5,11,12,13,14,15,16,17,18,19]. Recent work has also emphasized the role of deep learning architectures, including convolutional and recurrent networks, in improving landslide detection, susceptibility mapping, and inventory compilation when combined with high-resolution remote sensing data [16,17,18,19,20,21].
In parallel, advances in satellite and airborne remote sensing have dramatically expanded the availability of high-resolution topography, land cover, and seismic shaking products for landslide modeling. Near-global DEMs, multispectral optical imagery, and long time series of SAR acquisitions now allow for systematic characterization of terrain, vegetation, hydrology, and ground deformation. In the context of event-based coseismic LSM, this enables models that not only rely on static conditioning factors but also incorporate transient signals related to deformation, coherence loss, and hydrometeorological forcing [5,22,23,24,25,26,27]. For example, recent studies have integrated multi-temporal InSAR deformation velocities or persistent scatterer (PS) deformation rates into ML-based susceptibility frameworks, demonstrating clear gains in model skill compared with purely static terrain-based approaches [20,25,26,27,28]. Such dynamic, InSAR-enhanced LSM frameworks are particularly powerful in alpine gorge and corridor settings where continuous monitoring of slope response is required but in situ observations are sparse [25,26,28].
Most LSM studies still rely heavily on static conditioning factors such as slope, aspect, curvature, elevation, lithology, distance to faults or drainage, PGA, rainfall, and land use/land cover (LULC) [4,5,29,30,31]. However, static terrain and environmental variables only partially capture the transient deformation and damage associated with large earthquakes. Spaceborne interferometric synthetic aperture radar (InSAR) offers unique capabilities to measure ground deformation and coherence loss at regional scales, even under cloud cover and during the monsoon [22,23,24,31,32]. InSAR has been successfully used to map coseismic crustal deformation for the Gorkha earthquake [31,32] and to investigate landslide processes and unstable slopes in other regions, including along major transportation corridors and in rapidly urbanizing basins [22,23,25,26,27]. Integrating InSAR-derived products such as line-of-sight (LOS) displacement and coherence-based damage proxy maps (DPM) into coseismic landslide models therefore has the potential to highlight zones of deformation, coherence loss, and surface disturbance that are not readily inferred from static terrain metrics alone. Yet, systematic comparisons of ML and deep learning (DL) models with and without InSAR inputs remain limited for the Nepal Himalaya.
Topographic characterization for landslide modeling and geomorphic analysis increasingly relies on near-global digital elevation models (DEMs) such as the Shuttle Radar Topography Mission (SRTM) 1 arc-second product, which provides approximately 30 m resolution elevation data for most land surfaces between 60°N and 56°S [33]. The SRTM 1 arc-second DEM has become a standard input for terrain derivatives (slope, aspect, curvature), stream network extraction, and channel steepness calculations, as well as for coregistering and geocoding interferometric synthetic aperture radar (InSAR) products in mountain regions where high-resolution local DEMs are scarce [33].
At the same time, geomorphic indices derived from DEMs, notably the normalized channel steepness index (Kₛₙ), provide insight into long-term tectonic forcing, base-level change, and hillslope stability [34,35]. High Kₛₙ values are often associated with steep, actively incising channels, knickzones, and over-steepened hillslopes that are prone to slope failure [34,35,36,37]. Recent work has highlighted the potential of combining Kₛₙ with remote sensing data and statistical or ML methods to improve detection and mapping of landslides and other mass movements [36,37,38]. Because Kₛₙ can be derived systematically from widely available DEMs such as SRTM, it offers a promising, physically interpretable proxy for longer-term tectonic–geomorphic predisposition to landsliding that can be integrated with more transient InSAR-based indicators of ongoing deformation [33].
In parallel with these physical and geomorphic advances, there has been a rapid growth in the use of deep learning and geospatial artificial intelligence for landslide studies. Recent reviews document how convolutional, recurrent, and hybrid neural networks, when combined with optical, LiDAR, and synthetic aperture radar (SAR) data, can improve landslide inventory mapping, susceptibility forecasting, and early warning, while also highlighting key challenges such as data imbalance, limited regional transferability, and model interpretability [11,15,16,17,18,19,20,21]. InSAR coherence and phase time series have similarly been exploited using deep learning for damage and deformation mapping, where recurrent networks act as probabilistic anomaly detectors to separate true damage from background surface changes [19,39,40]. A growing body of literature demonstrates that integrating InSAR-derived deformation velocities or active deformation with machine learning and deep learning algorithms, including graph convolutional networks, gradient-boosted decision trees, and window-based atmospheric correction methods, significantly enhance landslide susceptibility mapping in alpine gorge regions and geotectonically complex settings [26,28]. These developments motivate a closer integration of deep segmentation architectures (e.g., U-Net, DeepLab-style models) with physically meaningful InSAR and geomorphic predictors for coseismic landslide probability mapping.
This study addresses these gaps by jointly integrating InSAR-derived and geomorphic predictors into an event-based coseismic landslide probability modeling framework for the 2015 Gorkha earthquake. We derive LOS displacement from ALOS-2 PALSAR-2 ScanSAR interferometry and compute a normalized channel steepness index (Kₛₙ) from DEM data, then combine these predictors with a suite of 14 conditioning factors that capture topography, geomorphology, hydrology, seismic shaking, and land cover. We train and compare five representative models: Random Forest (RF), XGBoost, a fully convolutional CNN, U-Net, and DeepLabV3.
The specific objectives of this study are to:
(1) describe the derivation of LOS displacement and DPM from ALOS-2 ScanSAR and of Kₛₙ from a DEM, and integrate these into a multi-factor coseismic landslide probability framework for the Gorkha 2015 region;
(2) compare the performance of RF, XGBoost, CNN, U-Net, and DeepLabV3 using discrimination, detection, and calibration metrics (AUC-ROC, AUC-PR, CSI, Brier score);
(3) quantify the contribution of InSAR-derived layers by contrasting models with and without LOS and DPM inputs; and
(4) identify dominant conditioning factors, especially Kₛₙ, and discuss their implications for earthquake-induced landslide occurrence and geomorphic controls across the Nepal Himalaya.

2. Materials and Methods

2.1. Study Area

The study area encompasses the region strongly affected by the 2015 Gorkha earthquake sequence in central Nepal, including districts such as Gorkha, Sindhupalchok, Rasuwa, Nuwakot, Dhading, and parts of Kathmandu and surrounding valleys. The area lies within the Nepal Himalaya, characterized by high relief, steep slopes, active thrust faulting (e.g., Main Himalayan Thrust, MHT), and monsoon-driven erosion [7,8]. Lithology varies from Lesser Himalayan metasediments and granites to Higher Himalayan crystallines and Tethyan sediments, with structural and lithological controls on slope stability [9]. The Mw 7.8 mainshock and Mw 7.3 aftershock produced strong ground shaking and widespread slope failures, with landslide inventories documenting thousands of triggered landslides [1,2]. The spatial extent of the analysis is defined by the coverage of the ALOS-2 ScanSAR interferometric products and the available landslide inventory, yielding a raster domain of approximately 3,467 × 5,573 pixels at the chosen resolution (Section 2.2).
Figure 1. Study area map showing the Central Nepal Himalaya, the epicenter of the 2015 Gorkha earthquake, and the spatial extent of the landslide inventory. Landslide data are compiled from Roback et al. [2,41] and the U.S. Geological Survey (USGS) [3].
Figure 1. Study area map showing the Central Nepal Himalaya, the epicenter of the 2015 Gorkha earthquake, and the spatial extent of the landslide inventory. Landslide data are compiled from Roback et al. [2,41] and the U.S. Geological Survey (USGS) [3].
Preprints 201219 g001

2.2. Data

2.2.1. Landslide Inventory

The landslide inventory used in this study is a binary raster (landslide vs. non-landslide) derived from post-event satellite imagery and the landslide map data of Roback et al. [41] for the 2015 Gorkha earthquake, available via U.S. Geological Survey ScienceBase. Landslide pixels were assigned a value of 1 and non-landslide pixels 0; areas with no data (e.g., clouds, water bodies) were masked. The inventory exhibits severe class imbalance typical of event-based LSM: landslide pixels represent a small fraction (~0.47%) of the total valid pixels. This imbalance was explicitly addressed in both ML and DL pipelines (Section 2.4).

2.2.2. InSAR-Derived Products (ALOS-2 ScanSAR): Derivation of LOS Displacement and DPM

ALOS-2 L-band ScanSAR (WBDR1.1) single-look complex (SLC) imagery was acquired from the Japan Aerospace Exploration Agency (JAXA) on 22 February, 5 April, and 17 May 2015, covering frame 3050. The Mw 7.8 mainshock of 25 April 2015 occurred between the April and May acquisitions. Two interferometric pairs were formed: a pre-seismic pair (22 February–5 April 2015) to provide pre-event coherence for the damage proxy map, and a coseismic pair (5 April–17 May 2015) spanning the earthquake for line-of-sight (LOS) displacement and post-event coherence. The acquisitions used for LOS displacement and the DPM are summarized in Table 1 and Table 2, respectively. All InSAR processing was performed using the InSAR Scientific Computing Environment (ISCE) version 2.6.3 [42,43,44].
The processing workflow followed the ALOS-2 ScanSAR burst-mode InSAR chain implemented in ISCE 2 [42]. Burst synchronization was applied to remove unsynchronized signals, followed by SLC coregistration using water-body masking and DEM-based refinement with the SRTM 1 arc-second global product [33]. Interferograms were formed with 7 range and 2 azimuth looks, filtered using the Goldstein algorithm (strength 0.8, window 64, step 4), and unwrapped with water body masking. Ionospheric phase estimation and correction were applied to mitigate L-band ionospheric effects [45]. Final products were geocoded to geographic coordinates using bicubic interpolation. The InSAR processing flow diagram is shown in Figure 2.
The LOS displacement d_LOS (cm) was computed from the unwrapped interferometric phase φ (radians) as:
d L O S = φ × λ 4 π × 100
where λ = 23.6 cm is the ALOS-2 L-band wavelength. The negative sign conforms to the convention that positive phase corresponds to increasing range (motion away from the sensor). The geocoded LOS displacement from the coseismic pair is shown in Figure 3(a).
The Damage Proxy Map (DPM) was derived from the difference between pre-event and post-event coherence. Pre-event coherence γ_pre was obtained from the 22 February–5 April interferogram, and post-event coherence γ_post from the 5 April–17 May interferogram. The two coherence maps were registered to a common grid, and the DPM was computed as:
DPM = γ_pre − γ_post
DPM values were clipped to [No damage, High damage] for visualization, with higher values indicating greater coherence loss and thus stronger damage proxy. The DPM is shown in Figure 3(b).

2.2.3. Tectonic Geomorphology and the Channel Steepness Index (Kₛₙ)

The normalized channel steepness index (Kₛₙ) was computed to assess spatial variations in river-profile steepness and infer relative tectonic activity across the study area. The Kₛₙ is derived from the stream power incision model, which relates channel slope S to drainage area A through a power-law relationship [46,47]:
S   =   k   A θ
where ks is the channel steepness index and θ is the concavity index. Under steady-state conditions, steepness scales with rock uplift rate relative to erodibility [35,47]. To facilitate comparison across drainage basins with varying concavities, a reference concavity θ_ref is applied to define the normalized steepness index [34]:
k   =   S   A θ _ r e f
where S is channel slope (dimensionless, or in radians), A is drainage area (m²), and θ_ref is the reference concavity. A commonly used value is θ_ref = 0.45, representative of bedrock rivers in equilibrium [34].
Data and Processing
A digital elevation model (DEM) was used to extract the stream network and compute Kₛₙ. Processing was performed in MATLAB using TopoToolbox [48]. The workflow comprised: (1) filling sinks in the DEM; (2) computing flow direction and flow accumulation to obtain drainage area; (3) extracting the stream network above a drainage area threshold of 1 km²; (4) computing local channel slope using an 8-direction gradient (in radians); (5) sampling drainage area and slope at each stream node; and (6) calculating Kₛₙ at each node via Equation (4). The mean Kₛₙ per stream segment was then computed and exported as a polyline shapefile for mapping and overlay with other datasets.
The resulting Kₛₙ map highlights channels that are steeper than expected for their drainage area. Elevated Kₛₙ values are interpreted as indicating higher relative rock uplift or lower erodibility and may coincide with active structures or transient landscape response to tectonic forcing (Figure 4).

2.2.4. Other Conditioning Factors

The study utilizes fourteen conditioning factors representing topography, geomorphology, hydrology, land cover, and seismic intensity. These include CosAspect, SinAspect, DEM, Distance to River, DPM, Drainage Density, Kₛₙ, LULC, LogSPI, PGA, Rain, Slope, and the absolute values of Curvature and LOS displacement. Peak ground acceleration (PGA) was obtained from the USGS ShakeMap [49] and earthquake database [50]. Land use/land cover (LULC) was derived from the ICIMOD Regional Land Cover Monitoring System (RLCMS) for the Hindu Kush Himalaya (HKH) region (2015) [51]. The DEM used for Kₛₙ derivation and InSAR processing was the SRTM 1 arc-second global product [33]. The sine and cosine transformations of aspect were used to appropriately represent its circular nature and to avoid artificial discontinuities at the 0°/360° boundary, thereby enabling the machine-learning models to capture directional effects more effectively.
In addition to the gridded predictor layers, two auxiliary data visualizations were prepared to support interpretation of the physical controls on earthquake-triggered landslides and to justify the inclusion of selected conditioning factors. Figure 5a presents a circular (rose) diagram of landslide counts by slope aspect derived from the mapped inventory. This exploratory analysis was used to evaluate directional bias in landslide occurrence and to motivate the explicit encoding of aspect using its cosine and sine components within the modeling framework, consistent with previous earthquake-induced landslide studies [1,2,30,31].
Figure 5b shows the monthly rainfall time series from September 2014 to August 2015 for the major earthquake-affected districts. This hydrometeorological summary was compiled to characterize antecedent moisture conditions prior to the April 2015 coseismic period and to support the inclusion of rainfall as a conditioning factor. The district-wise rainfall data used to generate Figure 5b are provided in Appendix Table A1. Prior Himalayan research indicates that elevated antecedent precipitation can increase pore-water pressure and reduce slope stability [9].
Normalization and Absolute Value Transformations
To ensure effective training, all continuous factors were resampled to a common 30-meter grid and normalized using the Min-Max formula. All spatial analysis, raster processing, and map preparation were performed using ArcGIS Pro 3.6.0 (Esri, Redlands, CA, USA).
X n o r m = X X m i n X m a x X m i n
where:
X = original pixel value
Xmin = minimum value of the factor
Xmax = maximum value of the factor
Xnorm = normalized value scaled between 0 and 1
Calculating the absolute values for LOS and curvature was a critical refinement, as it collapsed bi-directional risks (toward/away or convex/concave) into a single magnitude-based susceptibility indicator.
Figure 6. Spatial distribution of the fourteen conditioning factors used for landslide modeling: (a) absolute LOS displacement, (b) Kₛₙ, (c) DPM, (d) DEM, (e) distance to river, (f) PGA, (g) cos(aspect), (h) sin(aspect), (i) absolute curvature, (j) LULC, (k) annual rainfall (2014–2015), (l) slope, (m) logSPI, and (n) drainage density.
Figure 6. Spatial distribution of the fourteen conditioning factors used for landslide modeling: (a) absolute LOS displacement, (b) Kₛₙ, (c) DPM, (d) DEM, (e) distance to river, (f) PGA, (g) cos(aspect), (h) sin(aspect), (i) absolute curvature, (j) LULC, (k) annual rainfall (2014–2015), (l) slope, (m) logSPI, and (n) drainage density.
Preprints 201219 g006

2.3. ML and DL Model Architectures

In this study, we evaluated a suite of machine learning (ML) and deep learning (DL) architectures for coseismic landslide probability mapping. Two complementary modeling tracks were used: (i) pixel-wise ML baselines (Random Forest and XGBoost) and (ii) patch-based DL segmentation models (CNN, U-Net, and DeepLabV3 with a ResNet-50 backbone). All models were trained using a multi-band raster stack of 14 conditioning factors, including topographic (slope, aspect, curvature, DEM), seismic (PGA), hydrological (drainage density, log-transformed SPI, rainfall), geomorphic (Kₛₙ), InSAR-derived (LOS displacement and DPM), distance to river, and LULC.

2.3.1. Machine Learning Models

Random Forest (RF):
RF was implemented as a per-pixel classifier using the 14-band predictor stack. Pixels containing NaN values in any predictor or in the landslide label were excluded. Because the dataset was highly imbalanced (≈19.2 million non-landslide pixels, 99.6%; 89,416 landslide pixels, 0.4%), majority-class undersampling was applied to enforce a 20:1 negative-to-positive ratio. This yielded 89,048 positive and 1,780,960 negative samples. The balanced data were split 80/20 for training and validation. RF was trained with 200 trees (n_estimators = 200) using inverse-frequency class weighting (class_weight = "balanced"). The model produced per-pixel landslide probabilities in the range (0, 1).
XGBoost:
XGBoost was implemented as a gradient-boosted tree benchmark using the same feature matrix and sampling strategy as RF. Majority undersampling maintained the 20:1 class ratio, followed by an 80/20 train–validation split. The model was trained with 200 boosting rounds (n_estimators = 200). Class imbalance was handled using scale_pos_weight = n(neg)/n(pos) computed from the training data. The final model outputs per-pixel landslide probabilities in (0, 1).

2.3.2. Deep Learning Models

Deep learning models were designed to generate pixel-wise landslide probability maps using spatial context. All networks accept input patches of size (B, 14, 128, 128) and produce full-resolution outputs. The first convolution in each architecture was modified to accommodate the 14-band predictor stack. Because landslide pixels remain sparse within patches, training used a combined weighted loss (binary cross-entropy + Dice, 0.5/0.5) with BCE pos_weight = 15.
CNN:
A lightweight fully convolutional CNN was implemented with four sequential convolutional stages that preserve the original 128×128 resolution (no pooling or downsampling). Each stage uses a ConvBlock with two 3×3 convolutions followed by batch normalization and ReLU activation. Channel depth increases progressively (14 → 64 → 128 → 128). A final 1×1 convolution produces the single-channel logit map. The network contains approximately 857,857 trainable parameters.
U-Net:
A standard encoder–decoder U-Net was implemented for pixel-wise segmentation. The architecture includes four encoder levels, a bottleneck, and a symmetric decoder with skip connections. Using a base width f = 64, the encoder increases feature depth through f, 2f, 4f, and 8f via 2×2 max pooling, with the bottleneck expanding to 16f. The decoder restores spatial resolution using 2×2 transposed convolutions and concatenates encoder features at each scale before applying ConvBlocks. A final 1×1 convolution generates the per-pixel logits. The resulting U-Net contains approximately 31.04 million trainable parameters.
DeepLabV3:
DeepLabV3 was implemented using the torchvision DeepLabV3-ResNet50 model (PyTorch) [52,53]. The pretrained architecture was adapted by modifying the first convolutional layer of the ResNet-50 backbone to accept the 14-band input stack (Conv2d(14, 64, 7×7, stride=2)). The model retains the standard ASPP head and produces a single-channel segmentation map.
Figure 7. Architectures of the deep learning models used for coseismic landslide mapping: (a) lightweight CNN, (b) U-Net encoder–decoder with skip connections, and (c) DeepLabV3 with a modified ResNet-50 backbone and ASPP module, all producing single-channel probability maps.
Figure 7. Architectures of the deep learning models used for coseismic landslide mapping: (a) lightweight CNN, (b) U-Net encoder–decoder with skip connections, and (c) DeepLabV3 with a modified ResNet-50 backbone and ASPP module, all producing single-channel probability maps.
Preprints 201219 g007

2.4. Training and Evaluation

2.4.1. Data Split and Sampling

Machine Learning Models (Random Forest, XGBoost)
Training samples were derived from the pixel-level predictor stacks and the binary landslide inventory. Because landslide pixels represent approximately 0.4% of the study area, the majority (non-landslide) class was undersampled to reduce bias toward the negative class while maintaining computational tractability. Only negative pixels were undersampled, whereas all positive pixels were retained. The target ratio was 20 negative pixels per positive pixel (e.g., 89,048 positive pixels and 1,780,960 negative pixels in the balanced dataset).
The balanced dataset was divided into training (80%) and validation (20%) subsets using stratified random sampling so that both subsets preserved the 20:1 negative-to-positive ratio. The split was performed with a fixed random seed (random_state = 42) to ensure reproducibility. Performance metrics were computed on the held-out validation subset that was not used during model fitting.
For the pixel-based ML models, stratified random sampling was adopted to enable stable model comparison under severe class imbalance. While spatial blocking is often desirable in landslide studies, the present work focuses on comparative model behavior under identical sampling conditions.
Deep learning segmentation models:
For the convolutional models, image patches were extracted from the predictor stacks using a sliding window approach (patch size = 128 × 128 pixels; stride = 64 pixels, i.e., 50% overlap), yielding dense spatial coverage of the study area. Each patch was assigned a binary label: positive if it contained at least one landslide pixel and negative otherwise.
To mitigate patch-level class imbalance, positive (landslide-containing) patches were oversampled by a factor of two (pos_patch_oversample = 2). The number of negative patches was optionally capped to keep the training set computationally manageable while still ensuring sufficient exposure to positive examples.
Training and validation patch sets were kept geographically independent through a spatial partition. Specifically, patches whose top-left row coordinate fell within the upper 80% of raster rows were assigned to the training set, whereas patches located within the lower 20% were reserved for validation. This north–south hold-out reduces spatial overlap between training and validation data and helps limit optimistic bias due to spatial autocorrelation in convolutional models. If this rule yielded fewer than 50 validation patches, the pipeline reverted to a stratified random patch split to maintain statistical reliability.
Within each patch, pixel-level imbalance was further addressed using a composite loss consisting of weighted binary cross-entropy (BCE) and Dice loss. The positive class weight in BCE was set to 15, and the Dice component was weighted at 0.5 (total loss = 0.5 × Dice + 0.5 × BCE). This formulation encourages sensitivity to rare landslide pixels while improving spatial overlap with the reference mask.
Together, the pixel-based balanced sampling for ML models and the spatially partitioned patch sampling for DL models allow a consistent comparison between paradigms while respecting their different learning units (pixels vs. patches).

2.4.2. Training Configuration

Traditional Machine Learning Models:
Random Forest was trained using bootstrap aggregation with class weighting (class_weight = “balanced”) to compensate for residual imbalance in the 20:1 training data. Standard hyperparameters (e.g., number of trees, depth, and minimum leaf size) followed the implementation settings; no exhaustive grid search was performed in the present experiments.
XGBoost was trained using gradient boosting with class weighting via scale_pos_weight set to the negative-to-positive ratio in the balanced training subset. To improve probabilistic reliability, probability calibration was applied using isotonic regression within a 5-fold cross-validation framework (CalibratedClassifierCV).
Both ML models were trained on the same stratified 80% training subset and evaluated on the identical held-out 20% validation subset.
Deep Learning Segmentation Models:
All three architectures (CNN, U-Net, DeepLabV3) were optimized using the Adam optimizer with an initial learning rate of 1 × 10⁻³ and weight decay of 1 × 10⁻⁴. Training progress was monitored using validation loss (composite BCE–Dice loss on the spatial validation patches). The learning rate was reduced on plateau (ReduceLROnPlateau), and early stopping with a patience of 12 epochs was applied to limit overfitting. The model checkpoint with the lowest validation loss was retained for final evaluation.

2.4.3. Evaluation Metrics

Model performance was assessed using complementary discrimination, detection, and calibration metrics:
  • AUC-ROC: Measures overall discrimination across thresholds and is relatively insensitive to class imbalance.
  • AUC-PR (Average Precision): More informative for rare-event detection, emphasizing performance on the landslide class.
  • Critical Success Index (CSI): Defined as TP / (TP + FP + FN), evaluating the balance between correct detections and false alarms.
  • Brier score: Quantifies probabilistic calibration; lower values indicate better-calibrated predictions.
  • Confusion matrix (row-normalized, %): Reports True No-LS predicted as No-LS, False alarms (No-LS → LS), Misses (LS → No-LS), and Correct detections (LS → LS).

2.5. Variable and Feature Importance

To identify the most influential conditioning factors controlling landslide occurrence, permutation-based importance analysis was performed for both ML and DL models using the 14 predictor bands (slope, aspect cos/sin, absolute curvature, DEM, Kₛₙ, drainage density, logSPI, PGA, distance to river, rainfall, LULC, LOS_abs, and DPM).
For the ML models (Random Forest and XGBoost), permutation importance was computed using the scikit-learn permutation_importance function with AUC-PR as the scoring metric. Each feature was randomly permuted in the validation set, and the resulting drop in AUC-PR was averaged over five repetitions to reduce variance.
For the DL models (CNN, U-Net, and DeepLabV3), a band-wise (block) permutation approach was used. Each input band was sequentially zeroed in the validation patches, and the corresponding decrease in AUC-PR was recorded. This ablation-based analysis quantifies the relative contribution of each conditioning factor to model performance.

3. Results

3.1. Model Performance Comparison

Model performance was evaluated using AUC-ROC, AUC-PR, CSI, and Brier score. Results are summarized in Table 3 and Figure 8, Figure 9, Figure 10, Figure 11 and Figure 12.

3.1.1. Discrimination and Ranking

ROC curves indicate strong discrimination for all models. The tree-based classifiers achieved the highest AUC-ROC (XGBoost = 0.972; Random Forest = 0.969), followed by DeepLabV3 (0.937), CNN (0.903), and U-Net (0.892). However, because ROC-AUC can appear optimistic under severe class imbalance, AUC-PR provides a more informative measure of landslide detection performance.

3.1.2. Precision–Recall and Threshold Performance

Based on AUC-PR, DeepLabV3 performed best (0.768), followed by U-Net (0.667), CNN (0.664), XGBoost (0.654), and Random Forest (0.643). At the fixed probability threshold of 0.5, DeepLabV3 also achieved the highest CSI (0.49), indicating the most favorable balance between detection and false alarms. Random Forest (0.40) and U-Net/XGBoost (~0.38) showed moderate performance, while CNN had the lowest CSI (0.30) due to higher false positives.
In terms of calibration, the tree-based models produced the lowest Brier scores (XGBoost = 0.025; RF = 0.043), whereas CNN showed the highest Brier score (0.276), indicating less reliable probability estimates.

3.1.3. Confusion Matrix Interpretation

Row-normalized confusion matrices highlight distinct model behaviors:
  • Random Forest: Strong detection with moderate false positives.
  • XGBoost: Conservative predictions at the 0.5 threshold, resulting in many missed landslides.
  • CNN: Very high landslide recall (~94%) but excessive false alarms.
  • U-Net: More balanced than CNN but with lower recall.
  • DeepLabV3: Best overall trade-off, combining high landslide recall (~90%) with strong non-landslide accuracy (~84%).

3.2. Spatial Probability Maps

The trained models were applied to the full study area to generate pixel-wise landslide probability maps for Random Forest, XGBoost, CNN, U-Net, and DeepLabV3. Each map represents the predicted likelihood of coseismic landslide occurrence (0–1), enabling visual comparison of spatial susceptibility patterns among models. Areas consistently identified as high probability indicate robust agreement on landslide-prone terrain, whereas differences reflect the distinct learning behavior of each architecture. The ML models (RF and XGBoost) produce full-coverage maps at native pixel resolution, while the DL outputs were tiled and mosaicked to obtain continuous probability surfaces.
Figure 11 presents a visual comparison between the zoomed in landslide probability maps generated by XGBoost, Random Forest, U-Net, CNN, and DeepLabV3 and the reference landslide inventory. All models successfully delineate the primary landslide concentration along the valley corridor; however, deep learning models, particularly CNN and DeepLabV3, better capture fine-scale spatial patterns and exhibit closer agreement with the ground truth. Tree-based models produce smoother susceptibility patterns and show relatively higher spatial overprediction in surrounding areas.

3.3. Predictor Importance, InSAR Role, and Kₛₙ Dominance

Permutation analyses (Section 2.5) show that multiple terrain and seismic factors contribute to landslide prediction, including DEM, PGA, slope, and drainage density, although their relative influence varies by model.

3.3.1. Role of InSAR

InSAR-derived variables (LOS displacement and DPM) consistently ranked among the most important predictors, particularly in the DL models. Their inclusion improved model discrimination and calibration, highlighting the value of integrating spaceborne deformation and coherence products for coseismic landslide mapping in high-relief, data-scarce regions such as the Nepal Himalaya.
In the illustrative example, we compare three panels: (a) a probability map generated without InSAR inputs, (b) a probability map generated with InSAR (LOS and DPM) included, and (c) the mapped landslide inventory (ground truth). In the no-InSAR case, the model largely fails to assign elevated probabilities to a known landslide zone, effectively missing an important cluster of failures. When LOS displacement and DPM are added, the same area is highlighted as a high-probability zone, and this pattern closely matches the observed landslides in the ground-truth panel. This contrast demonstrates that, without InSAR products, some coseismic landslide zones remain undetected in the probability maps, whereas incorporating InSAR substantially improves the model’s ability to capture real failures.
Figure 13. Effect of InSAR inputs on landslide prediction: (a) model output without InSAR variables, (b) model output with LOS displacement and DPM included, and (c) reference landslide inventory (ground truth).
Figure 13. Effect of InSAR inputs on landslide prediction: (a) model output without InSAR variables, (b) model output with LOS displacement and DPM included, and (c) reference landslide inventory (ground truth).
Preprints 201219 g013

3.3.2. Dominance of Kₛₙ

The normalized channel steepness index (Kₛₙ) emerged as the single most influential predictor across all ML and DL models. This result underscores the strong control of fluvial incision and tectonically driven hillslope steepening on earthquake-triggered landslides. High Kₛₙ values reliably delineate over-steepened, failure-prone terrain, supporting its use as a robust and transferable geomorphic indicator in active mountain belts.
To further quantify this relationship, we derived a Kₛₙ-density surface and classified it into five zones (very low, low, medium, high, and very high). We then extracted four south–north profile lines across the study area to examine how mapped landslides are distributed relative to these Kₛₙ zones as shown in Figure 14. The majority of landslides fall within the high-Kₛₙ zone along these profiles, providing additional, spatially coherent evidence that steep, high-Kₛₙ reaches concentrate earthquake-induced slope failures and reinforcing the dominant role of Kₛₙ in controlling landslide probability.

4. Discussion

This study demonstrates that integrating machine learning and deep learning with InSAR-derived products and geomorphic indices provides a powerful framework for event-based coseismic landslide probability modeling in the Nepal Himalaya. The comparative evaluation of Random Forest, XGBoost, CNN, U-Net, and DeepLabV3 shows that all models are capable of reproducing the broad spatial pattern of landslides triggered by the 2015 Gorkha earthquake, but they differ in discrimination, calibration, and spatial detail. In particular, the combination of InSAR line-of-sight (LOS) displacement, coherence-based damage proxy map (DPM), and normalized channel steepness index (Kₛₙ) emerges as a key driver of model performance.

4.1. Model Performance

Tree-based models (Random Forest and XGBoost) achieve the highest AUC-ROC and provide the best-calibrated probabilities. However, when class imbalance is considered, DeepLabV3 performs best based on AUC-PR and CSI, while CNN and U-Net show comparable skill. Spatially, the deep learning models, especially CNN and DeepLabV3, better capture fine-scale landslide clusters and boundaries. In contrast, tree-based models produce smoother maps and slightly overpredict surrounding areas. This reflects their different strengths: tree models excel in feature discrimination, while convolutional networks better exploit spatial context.

4.2. Role of InSAR-Derived Products

LOS displacement and DPM significantly improve model results. When these layers are removed, some known landslide zones are underestimated or missed. Including InSAR data improves both detection and calibration, particularly for DeepLabV3 and U-Net. This confirms that deformation and coherence information provide critical signals that static terrain variables alone cannot capture, especially in steep, data-scarce regions like the central Nepal Himalaya.

4.3. Dominance of Kₛₙ and Geomorphic Controls

Across all models, Kₛₙ consistently emerges as the most influential predictor, highlighting the strong geomorphic control on coseismic landsliding. High Kₛₙ values correspond to steep, actively incising channels and over-steepened hillslopes that are more susceptible to failure during strong shaking.
Density and profile analyses further support this relationship: most mapped landslides occur within high-Kₛₙ zones, and peaks in Kₛₙ density align with landslide clusters. This confirms that Kₛₙ is a physically meaningful proxy for long-term tectonic–geomorphic predisposition.

4.4. Implications, Limitations, and Future Work

The integration of InSAR metrics and Kₛₙ offers a practical pathway for regional hazard screening in data-poor mountain belts using only SAR and DEM data. Such event-based models can support rapid post-earthquake prioritization and corridor-scale risk assessment.
Nevertheless, several limitations remain. The models are trained on a single event (2015 Gorkha), and temporal transferability is untested. Uncertainties in the landslide inventory, InSAR processing, and DEM-derived Kₛₙ propagate into predictions. Deep learning models also require greater computational resources and careful handling of class imbalance.
Future research should therefore focus on (i) validating the models in other segments of the Nepal Himalaya and in different tectonic and climatic settings, (ii) assessing temporal transfer by applying the trained models to landslides from subsequent earthquakes, (iii) exploring additional InSAR time-series metrics and multi-orbit combinations, and (iv) systematically evaluating alternative geomorphic indices and Kₛₙ calculation schemes. Integrating physics-based ground motion and slope stability models with data-driven probability mapping also represents a promising direction for improving interpretability and robustness.

5. Conclusions

This study compared five ML and DL models for coseismic landslide probability mapping in the 2015 Gorkha earthquake using a 14-band predictor stack that includes InSAR LOS displacement, DPM, and DEM-based geomorphic indices such as Kₛₙ. All models capture the overall spatial pattern of landslides, but their performance characteristics differ. Tree-based models (RF and XGBoost) exhibit the highest AUC-ROC and best calibration (lowest Brier scores), while DeepLabV3 achieves the highest AUC-PR and CSI, reflecting superior performance under severe class imbalance and better spatial delineation of landslide clusters.
Deep learning segmentation models produce sharper and more spatially coherent probability maps, whereas tree-based models generate smoother but slightly overpredicted susceptibility fields. InSAR-derived LOS and DPM substantially improve discrimination and calibration across all architectures and are especially valuable for deep learning approaches that exploit spatial context in multi-band predictor stacks. Across models, the normalized channel steepness index (Kₛₙ) consistently emerges as the dominant predictor, confirming the strong geomorphic and tectonic control on earthquake-triggered landslides.
Overall, our results indicate that integrating InSAR products with Kₛₙ and related DEM-derived metrics provides a transferable, remotely sensed framework for coseismic landslide probability mapping in high-relief, data-scarce regions. Ensemble tree models offer efficiency, robustness, and interpretability for regional screening, while deep learning architectures add fine-scale spatial detail at higher computational cost. Further validation across different earthquakes and regions is needed to confirm the broader applicability of this framework and to refine best practices for combining InSAR, geomorphic metrics, and ML/DL methods in landslide hazard assessment.

Author Contributions

Conceptualization, R.S. and G.W.; methodology, R.S. and S.K.; software, R.S.; validation, R.S. and R.R.; formal analysis, R.S.; investigation, R.S. and R.R.; data curation, R.S.; writing—original draft preparation, R.S.; writing—review and editing, R.S., G.W., S.K., R.R. and S.R.; visualization, R.S.; supervision, G.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The landslide inventory is derived from map data of Roback et al. (2017) [41,54] (https://doi.org/10.5066/F7DZ06F9; ScienceBase: https://www.sciencebase.gov/catalog/item/582c74fbe4b04d580bd377e8). Peak ground acceleration (PGA) data were obtained from the USGS ShakeMap [49] and earthquake database [50]. Land use/land cover (LULC) was obtained from the ICIMOD Regional Database System (RDS) for the Hindu Kush Himalaya (HKH) region (2015) [51] (https://rds.icimod.org/Home/DataDetail?metadataId=1972511). The SRTM 1 arc-second global DEM [33] is available from NASA LP DAAC. ALOS-2 PALSAR-2 ScanSAR imagery was provided by the Japan Aerospace Exploration Agency (JAXA) and is available through the JAXA Earth Observation Research Center.

Acknowledgments

The authors thank the Japan Aerospace Exploration Agency (JAXA) for providing ALOS-2 PALSAR-2 ScanSAR imagery. Spatial mapping and analysis were conducted using ArcGIS Pro 3.6.0 (Esri, Redlands, CA, USA).

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
AUC-PR Area Under the Precision-Recall Curve
AUC-ROC Area Under the Receiver Operating Characteristic Curve
CSI Critical Success Index
DPM Damage Proxy Map
HKH Hindu Kush Himalaya
InSAR Interferometric Synthetic Aperture Radar
Kₛₙ Normalized Channel Steepness Index
LOS Line-of-Sight
LULC Land Use/Land Cover
LSM Landslide Susceptibility Mapping
MHT Main Himalayan Thrust
PGA Peak Ground Acceleration
RLCMS Regional Land Cover Monitoring System
SLC Single-Look Complex
SPI Stream Power Index

Appendix A

Appendix A.1.

Table A1. Monthly rainfall data (mm) used to generate the graph in Figure 5b.
Table A1. Monthly rainfall data (mm) used to generate the graph in Figure 5b.
Monthly Rainfall in mm
Month Gorkha Dhading Rasuwa Nuwakot Kathmandu Lalitpur Sindhupalchok Kavre Dolakha Ramechhap
Sep-14 198.72 205.7 132.2 524.6 325.6 118.9 437 150.9 82.05 126
Oct-14 53.25 100.2 0.2 104.6 87.2 121.3 89.625 64.45 28.35 45.5
Nov-14 0 0 0 0 0 0 0 0 0.5 0
Dec-14 40.825 82.7 39.5 28.2 30.2 22.7 44.725 18.4 13.65 8.5
Jan-15 82.88 13.5 16.4 5 15.8 8.6 22.375 18.05 0.05 1
Feb-15 66.2 62.8 4.5 51.8 44.8 31.5 39.45 24.6 8.85 20
Mar-15 93.3 31.8 103.8 91.4 90.1 78.9 129.625 65.35 35.15 45.5
Apr-15 30.56 49.3 31 68 8.8 49.7 54.675 49.25 58.1 52.3
May-15 36.425 61.2 8.8 128.6 10 36 133.7 47.8 21.35 26.3
Jun-15 181.8 246.7 118.6 198.6 317.8 180.7 319.1 47.5 62.15 63.5
Jul-15 416.8 359.8 133.3 778 556.4 299.8 480.275 294.45 212.55 267.5
Aug-15 312.26 181.2 281.7 775.8 692.8 280.3 612.55 257.5 159.3 158

References

  1. Kargel, J.S.; Leonard, G.J.; Shugar, D.H.; Haritashya, U.K.; Bevington, A.; Fielding, E.J.; Fujita, K.; Geertsema, M.; Miles, E.S.; Steiner, J.; et al. Geomorphic and Geologic Controls of Geohazards Induced by Nepal’s 2015 Gorkha Earthquake. Science 2016, 351. [CrossRef]
  2. Roback, K.; Clark, M.K.; West, A.J.; Zekkos, D.; Li, G.; Gallen, S.F.; Chamlagain, D.; Godt, J.W. The Size, Distribution, and Mobility of Landslides Caused by the 2015 Mw7.8 Gorkha Earthquake, Nepal. Geomorphology 2018, 301, 121–138. [CrossRef]
  3. Collins, B.D.; Jibson, R.W. Assessment of Existing and Potential Landslide Hazards Resulting from the April 25, 2015 Gorkha, Nepal Earthquake Sequence; U.S. Geological Survey, 2015;
  4. Guzzetti, F.; Carrara, A.; Cardinali, M.; Reichenbach, P. Landslide Hazard Evaluation: A Review of Current Techniques and Their Application in a Multi-Scale Study, Central Italy. Geomorphology 1999, 31, 181–216. [CrossRef]
  5. Reichenbach, P.; Rossi, M.; Malamud, B.D.; Mihir, M.; Guzzetti, F. A Review of Statistically-Based Landslide Susceptibility Models. Earth-Sci. Rev. 2018, 180, 60–91. [CrossRef]
  6. Rimal, R.; Dhital, M.; Rijal, M.; Silwal, R.; Yadav, V.; Azad, M.A.; KAINTHOLA, A. Association of Landslides with Geological Structures and Rainfall: A Case Study of Two Landslides in Sunkuda, Nepal. Nat. Hazards 2025, 121, 11423–11442. [CrossRef]
  7. Lavé, J.; Avouac, J.P. Fluvial Incision and Tectonic Uplift across the Himalayas of Central Nepal. J. Geophys. Res. Solid Earth 2001, 106, 26561–26591. [CrossRef]
  8. Upreti, B.N. An Overview of the Stratigraphy and Tectonics of the Nepal Himalaya. J. Asian Earth Sci. 1999, 17, 577–606. [CrossRef]
  9. Gabet, E.; Burbank, D.; Putkonen, J.; Pratt-Sitaula, B.; Ojha, T. Rainfall Thresholds for Landsliding in the Himalaya of Nepal. Geomorphology 2004, 63, 131–143. [CrossRef]
  10. Regmi, A.; Dhital, M.; Zhang, J.; Su, L.-J.; Chen, X. Landslide Susceptibility Assessment of the Region Affected by the 25 April 2015 Gorkha Earthquake of Nepal. J. Mt. Sci. 2016, 13, 1941–1957. [CrossRef]
  11. Wang, Y.; Fang, Z.; Wang, M.; Peng, L.; Hong, H. Comparative Study of Landslide Susceptibility Mapping with Different Recurrent Neural Networks. Comput. Geosci. 2020, 138, 104445. [CrossRef]
  12. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [CrossRef]
  13. Chen, C.; Breiman, L. Using Random Forest to Learn Imbalanced Data. Univ. Calif. Berkeley 2004.
  14. Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System; 2016; p. 794;.
  15. Sameen, M.I.; Pradhan, B.; Lee, S. Application of Convolutional Neural Networks Featuring Bayesian Optimization for Landslide Susceptibility Assessment. CATENA 2020, 186, 104249. [CrossRef]
  16. Wang, H.; Zhang, L.; Yin, K.; Luo, H.; Li, J. Landslide Identification Using Machine Learning. Geosci. Front. 2021, 12, 351–364. [CrossRef]
  17. Yi, Y.; Zhang, Z.; Zhang, W.; Jia, H.; Zhang, J. Landslide Susceptibility Mapping Using Multiscale Sampling Strategy and Convolutional Neural Network: A Case Study in Jiuzhaigou Region. CATENA 2020, 195, 104851. [CrossRef]
  18. Zhang, Q.; Wang, T. Deep Learning for Exploring Landslides with Remote Sensing and Geo-Environmental Data: Frameworks, Progress, Challenges, and Opportunities. Remote Sens. 2024, 16, 1344. [CrossRef]
  19. Sahin, E. Assessing the Predictive Capability of Ensemble Tree Methods for Landslide Susceptibility Mapping Using XGBoost, Gradient Boosting Machine, and Random Forest. SN Appl. Sci. 2020, 2. [CrossRef]
  20. Wang, X.; Du, A.; Hu, F.; Liu, Z.; Zhang, X.; Wang, L.; Guo, H. Landslide Susceptibility Evaluation Based on Active Deformation and Graph Convolutional Network Algorithm. Front. Earth Sci. 2023, 11, 1132722. [CrossRef]
  21. Al-Najjar, H.A.H.; Pradhan, B.; Kalantar, B.; Sameen, M.I.; Santosh, M.; Alamri, A. Landslide Susceptibility Modeling: An Integrated Novel Method Based on Machine Learning Feature Transformation. Remote Sens. 2021, 13, 3281. [CrossRef]
  22. Wasowski, J.; Bovenga, F. Investigating Landslides and Unstable Slopes with Satellite Multi Temporal Interferometry: Current Issues and Future Perspectives. Eng. Geol. 2014, 174. [CrossRef]
  23. Intrieri, E.; Raspini, F.; Fumagalli, A.; Lu, P.; Del Conte, S.; Farina, P.; Allievi, J.; Ferretti, A.; Casagli, N. The Maoxian Landslide as Seen from Space: Detecting Precursors of Failure with Sentinel-1 Data. Landslides 2018, 15, 123–133. [CrossRef]
  24. Aimaiti, Y.; Liu, W.; Yamazaki, F.; Maruyama, Y. Earthquake-Induced Landslide Mapping for the 2018 Hokkaido Eastern Iburi Earthquake Using PALSAR-2 Data. Remote Sens. 2019, 11, 2351. [CrossRef]
  25. Zhu, Z.; Gan, S.; Yuan, X.; Zhang, J. Landslide Susceptibility Mapping with Integrated SBAS-InSAR Technique: A Case Study of Dongchuan District, Yunnan (China). Sensors 2022, 22, 5587. [CrossRef]
  26. Vaka, D.S.; Yaragunda, V.R.; Perdikou, S.; Papanicolaou, A. InSAR Integrated Machine Learning Approach for Landslide Susceptibility Mapping in California. Remote Sens. 2024, 16, 3574. [CrossRef]
  27. Hussain, M.A.; Chen, Z.; Zheng, Y.; Shoaib, M.; Shah, S.U.; Ali, N.; Afzal, Z. Landslide Susceptibility Mapping Using Machine Learning Algorithm Validated by Persistent Scatterer In-SAR Technique. Sensors 2022, 22, 3119. [CrossRef]
  28. Li, Z.; Xiang, J.; Zhuo, G.; Zhang, H.; Dai, K.; Shi, X. Dynamic Landslide Susceptibility Assessment in the Yalong River Alpine Gorge Region Integrating InSAR-Derived Deformation Velocity. Remote Sens. 2025, 17, 3210. [CrossRef]
  29. Bai, S.; Wang, J.; Zhang, Z.; C, C. Combined Landslide Susceptibility Mapping after WenChuan Earthquake at the Zhouqu Segment in the Bailongjiang Basin,China. Catena 2012, 99, 18–25. [CrossRef]
  30. Xu, C.; Xu, X.; Yao, X.; Dai, F. Three (Nearly) Complete Inventories of Landslides Triggered by the May 12, 2008 Wenchuan Mw 7.9 Earthquake of China and Their Spatial Distribution Statistical Analysis. Landslides 2013, 11. [CrossRef]
  31. Natsuaki, R.; Nagai, H.; Motohka, T.; Ohki, M.; Watanabe, M.; Thapa, R.B.; Tadono, T.; Shimada, M.; Suzuki, S. SAR Interferometry Using ALOS-2 PALSAR-2 Data for the Mw 7.8 Gorkha, Nepal Earthquake. Earth Planets Space 2016, 68, 15. [CrossRef]
  32. Lindsey, E.O.; Natsuaki, R.; Xu, X.; Shimada, M.; Hashimoto, M.; Melgar, D.; Sandwell, D.T. Line-of-Sight Displacement from ALOS-2 Interferometry: Mw 7.8 Gorkha Earthquake and Mw 7.3 Aftershock. Geophys. Res. Lett. 2015, 42, 6655–6661. [CrossRef]
  33. Farr, T.G.; Rosen, P.A.; Caro, E.; Crippen, R.; Duren, R.; Hensley, S.; Kobrick, M.; Paller, M.; Rodriguez, E.; Roth, L.; et al. The Shuttle Radar Topography Mission. Rev. Geophys. 2007, 45. [CrossRef]
  34. Willett, S.D.; Hovius, N.; Brandon, M.T.; Fisher, D.M.; Wobus, C.; Whipple, K.X.; Kirby, E.; Snyder, N.; Johnson, J.; Spyropolou, K.; et al. Tectonics from Topography: Procedures, Promise, and Pitfalls.; Geological Society of America, 2006.
  35. Kirby, E.; Whipple, K. Expression of Active Tectonics in Erosional Landscapes. J. Struct. Geol. 2012, 44, 54–75. [CrossRef]
  36. Liu, F.; Yao, X.; Li, L. Applicability of Geomorphic Index for the Potential Slope Instability in the Three River Region, Eastern Tibetan Plateau. Sensors 2021, 21, 6505. [CrossRef]
  37. Smith, A.G.G.; Fox, M.; Schwanghart, W.; Carter, A. Comparing Methods for Calculating Channel Steepness Index. Earth-Sci. Rev. 2022, 227, 103970. [CrossRef]
  38. Reyes-Carmona, C.; Galve, J.P.; Pérez-Peña, J.V.; Moreno-Sánchez, M.; Alfonso-Jorde, D.; Ballesteros, D.; Torre, D.; Azañón, J.M.; Mateos, R.M. Improving Landslide Inventories by Combining Satellite Interferometry and Landscape Analysis: The Case of Sierra Nevada (Southern Spain). Landslides 2023, 20, 1815–1835. [CrossRef]
  39. Stephenson, O.; Kohne, T.; Zhan, E.; Cahill, B.; Yun, S.-H.; Ross, Z.; Simons, M. Deep Learning-Based Damage Mapping With InSAR Coherence Time Series. IEEE Trans. Geosci. Remote Sens. 2021, PP, 1–17. [CrossRef]
  40. Rao, A.; Jung, J.; Silva, V.; Molinario, G.; Yun, S.-H. Earthquake Building Damage Detection Based on Synthetic-Aperture-Radar Imagery and Machine Learning. Nat. Hazards Earth Syst. Sci. 2023, 23, 789–807. [CrossRef]
  41. Dimitrios Zekkos; Marin K. Clark; Kevin Roback; Sean F. Gallen; A. Joshua West; Gen Li; Deepak Chamlagain; Jonathan W Godt Map Data of Landslides Triggered by the 25 April 2015 Mw 7.8 Gorkha, Nepal Earthquake 2017.
  42. Rosen, P.A.; Gurrola, E.; Sacco, G.F.; Zebker, H. The InSAR Scientific Computing Environment.; Nuremberg, April 23 2012.
  43. Liang, C.; Fielding, E. Interferometry with ALOS-2 Full-Aperture ScanSAR Data. IEEE Trans. Geosci. Remote Sens. 2017, PP. [CrossRef]
  44. Liang, C.; Fielding, E. Measuring Azimuth Deformation With L-Band ALOS-2 ScanSAR Interferometry. IEEE Trans. Geosci. Remote Sens. 2017, PP, 1–14. [CrossRef]
  45. Liang, C.; Liu, Z.; Fielding, E.; Burgmann, R. InSAR Time Series Analysis of L-Band Wide-Swath SAR Data Acquired by ALOS-2. IEEE Trans. Geosci. Remote Sens. 2018, PP, 1–15. [CrossRef]
  46. HOWARD, A.D.; KERBY, G. Channel Changes in Badlands. GSA Bull. 1983, 94, 739–752. [CrossRef]
  47. Whipple, K.X.; Tucker, G.E. Dynamics of the Stream-Power River Incision Model: Implications for Height Limits of Mountain Ranges, Landscape Response Timescales, and Research Needs. J. Geophys. Res. Solid Earth 1999, 104, 17661–17674. [CrossRef]
  48. Schwanghart, W.; Scherler, D. Short Communication: TopoToolbox 2 – MATLAB-Based Software for Topographic Analysis and Modeling in Earth Surface Sciences. Earth Surf. Dyn. 2014, 2, 1–7. [CrossRef]
  49. M 7.8 - 67 Km NNE of Bharatpur, Nepal Available online: https://earthquake.usgs.gov/earthquakes/eventpage/us20002926/executive (accessed on 27 February 2026).
  50. Search Earthquake Catalog Available online: https://earthquake.usgs.gov/earthquakes/search/ (accessed on 27 February 2026).
  51. ICIMOD | RDS Available online: https://rds.icimod.org/Home/DataDetail?metadataId=1972511 (accessed on 27 February 2026).
  52. Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation 2015.
  53. Chen, L.-C.; Zhu, Y.; Papandreou, G.; Schroff, F.; Adam, H. Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation. In Computer Vision – ECCV 2018; Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y., Eds.; Lecture Notes in Computer Science; Springer International Publishing: Cham, 2018; Vol. 11211, pp. 833–851 ISBN 978-3-030-01233-5.
  54. Map Data of Landslides Triggered by the 25 April 2015 Mw 7.8 Gorkha, Nepal Earthquake | U.S. Geological Survey Available online: https://www.usgs.gov/data/map-data-landslides-triggered-25-april-2015-mw-78-gorkha-nepal-earthquake (accessed on 27 February 2026).
Figure 2. Workflow of the ALOS-2 ScanSAR InSAR processing used to derive coseismic LOS displacement and coherence products, including pre-processing, interferogram generation, phase correction, and post-processing steps.
Figure 2. Workflow of the ALOS-2 ScanSAR InSAR processing used to derive coseismic LOS displacement and coherence products, including pre-processing, interferogram generation, phase correction, and post-processing steps.
Preprints 201219 g002
Figure 3. InSAR-derived products for the 2015 Gorkha earthquake. (a) Coseismic line-of-sight (LOS) displacement from the ALOS-2 ScanSAR interferogram. (b) Coherence-based Damage Proxy Map (DPM) derived from pre- and post-event coherence differences, where higher values indicate greater surface disturbance.
Figure 3. InSAR-derived products for the 2015 Gorkha earthquake. (a) Coseismic line-of-sight (LOS) displacement from the ALOS-2 ScanSAR interferogram. (b) Coherence-based Damage Proxy Map (DPM) derived from pre- and post-event coherence differences, where higher values indicate greater surface disturbance.
Preprints 201219 g003
Figure 4. Spatial distribution of the normalized channel steepness index (Kₛₙ) in the study area, derived from the DEM. Higher Kₛₙ values indicate over-steepened channels associated with enhanced tectonic uplift.
Figure 4. Spatial distribution of the normalized channel steepness index (Kₛₙ) in the study area, derived from the DEM. Higher Kₛₙ values indicate over-steepened channels associated with enhanced tectonic uplift.
Preprints 201219 g004
Figure 5. Exploratory analyses supporting conditioning-factor selection. (a) Rose diagram (circular histogram) of landslide counts by slope aspect derived from the inventory, used to justify cosine-sine encoding of aspect. (b) Monthly rainfall time series (September 2014–August 2015) for major earthquake-affected districts, illustrating antecedent moisture conditions prior to the 2015 Gorkha earthquake.
Figure 5. Exploratory analyses supporting conditioning-factor selection. (a) Rose diagram (circular histogram) of landslide counts by slope aspect derived from the inventory, used to justify cosine-sine encoding of aspect. (b) Monthly rainfall time series (September 2014–August 2015) for major earthquake-affected districts, illustrating antecedent moisture conditions prior to the 2015 Gorkha earthquake.
Preprints 201219 g005
Figure 8. Model discrimination performance. (a) ROC curves comparing CNN, U-Net, DeepLabV3, Random Forest, and XGBoost. (b) Precision–recall curves highlighting landslide detection performance under class imbalance.
Figure 8. Model discrimination performance. (a) ROC curves comparing CNN, U-Net, DeepLabV3, Random Forest, and XGBoost. (b) Precision–recall curves highlighting landslide detection performance under class imbalance.
Preprints 201219 g008
Figure 9. Row-normalized confusion matrices (%) for the evaluated models at a 0.5 probability threshold: (a) Random Forest, (b) XGBoost, (c) CNN, (d) U-Net, and (e) DeepLabV3.
Figure 9. Row-normalized confusion matrices (%) for the evaluated models at a 0.5 probability threshold: (a) Random Forest, (b) XGBoost, (c) CNN, (d) U-Net, and (e) DeepLabV3.
Preprints 201219 g009
Figure 10. Landslide probability maps generated by the trained models across the study area: (a) Random Forest, (b) XGBoost, (c) CNN, (d) U-Net, (e) DeepLabV3, and (f) reference landslide inventory (ground truth). Values represent predicted coseismic landslide probability (0–1).
Figure 10. Landslide probability maps generated by the trained models across the study area: (a) Random Forest, (b) XGBoost, (c) CNN, (d) U-Net, (e) DeepLabV3, and (f) reference landslide inventory (ground truth). Values represent predicted coseismic landslide probability (0–1).
Preprints 201219 g010
Figure 11. Visual comparison of landslide probability predictions from XGBoost, Random Forest (RF), U-Net, CNN, and DeepLabV3 against the reference landslide inventory (ground truth) for representative subregions.
Figure 11. Visual comparison of landslide probability predictions from XGBoost, Random Forest (RF), U-Net, CNN, and DeepLabV3 against the reference landslide inventory (ground truth) for representative subregions.
Preprints 201219 g011
Figure 12. Permutation-based feature importance for the evaluated models: (a) Random Forest, (b) XGBoost, (c) CNN, (d) U-Net, and (e) DeepLabV3, showing the relative contribution of conditioning factors to landslide prediction.
Figure 12. Permutation-based feature importance for the evaluated models: (a) Random Forest, (b) XGBoost, (c) CNN, (d) U-Net, and (e) DeepLabV3, showing the relative contribution of conditioning factors to landslide prediction.
Preprints 201219 g012
Figure 14. Relationship between Kₛₙ density and landslide distribution. (a) Classified Kₛₙ-density map with south–north profile locations. (b–e) Elevation profiles along Profiles 1–4 showing the correspondence between high-Kₛₙ zones and mapped landslides.
Figure 14. Relationship between Kₛₙ density and landslide distribution. (a) Classified Kₛₙ-density map with south–north profile locations. (b–e) Elevation profiles along Profiles 1–4 showing the correspondence between high-Kₛₙ zones and mapped landslides.
Preprints 201219 g014
Table 1. ALOS-2 PALSAR-2 ScanSAR acquisitions used to generate the coseismic line-of-sight (LOS) displacement for the 2015 Gorkha earthquake.
Table 1. ALOS-2 PALSAR-2 ScanSAR acquisitions used to generate the coseismic line-of-sight (LOS) displacement for the 2015 Gorkha earthquake.
Acquisition Date Satellite Mode Frame Off-Nadir Angle Role
5 April 2015 ALOS-2 ScanSAR (HH) 3050 35.2° Reference
17 May 2015 Secondary
Table 2. Interferometric pairs used to compute the coherence-based Damage Proxy Map (DPM), showing the pre-event and post-event acquisition intervals.
Table 2. Interferometric pairs used to compute the coherence-based Damage Proxy Map (DPM), showing the pre-event and post-event acquisition intervals.
Pair Acquisition Dates
Pre-event (γ_pre) 22 February–5 April 2015
Post-event (γ_post) 5 April–17 May 2015
Table 3. Performance metrics for the evaluated ML and DL models based on AUC-ROC, AUC-PR, CSI, and Brier score for coseismic landslide probability mapping.
Table 3. Performance metrics for the evaluated ML and DL models based on AUC-ROC, AUC-PR, CSI, and Brier score for coseismic landslide probability mapping.
Model AUC-ROC AUC-PR CSI Brier
CNN 0.9029 0.6635 0.3007 0.2764
U-Net 0.8918 0.6675 0.3819 0.1409
DeepLabV3 0.9367 0.7677 0.4956 0.1229
Random Forest 0.9689 0.6425 0.3985 0.0428
XGBoost 0.9724 0.6535 0.3822 0.0250
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated