1. Introduction
Accurate monitoring of Land Use and Land Cover (LULC) dynamics is essential for achieving the Sustainable Development Goals (SDGs) and supporting climate mitigation strategies advanced by the Intergovernmental Panel on Climate Change (IPCC) [
1]. Over the past decade, the open-data policies of Earth observation missions such as Landsat and Copernicus Sentinel, together with the democratization of high-performance cloud computing through platforms like Google Earth Engine (GEE), have transformed global observation capabilities and enabled large-scale, multi-temporal analyses across spatial scales [
2].
Despite these advances, the production of systematic LULC maps in mountainous ecosystems remains severely constrained, even when machine-learning classifiers are employed [
3,
4]. Persistent cloud cover creates substantial data gaps in optical time series, while complex topography introduces radiometric distortions that degrade model performance [
5].
This observational deficit has produced a critical geographic and thematic bias within the national literature. A recent systematic review [
6] shows that LULC research in Chile is disproportionately concentrated in the central region, whereas areas such as Aysén and Magallanes account for less than 2% of published studies. This gap is particularly concerning given that the Chilean Patagonia represents a globally significant natural laboratory, characterized by a pronounced west–east ecological gradient encompassing temperate forests, shrublands, steppes, and wetlands [
7,
8]. Moreover, the limited monitoring efforts conducted to date have primarily focused on forest dynamics, leaving substantial gaps in the characterization of transitional land-cover classes. These classes are frequently misclassified in official static inventories and global products [
6,
9,
10,
11].
To overcome these limitations, it is necessary to move beyond traditional optical-only approaches. The combined use of robust statistics derived from the temporal distribution (e.g., medians and percentiles such as
and
) has become an effective strategy for mitigating atmospheric noise and capturing phenological variability in optical time series, even under limited observation availability [
12]. Nevertheless, exclusive reliance on optical data remains a vulnerability at austral latitudes [
13]. Consequently, the integration of Synthetic Aperture Radar (SAR), particularly Sentinel-1, emerges as a critical complementary solution by providing information on physical structure and surface roughness that is independent of atmospheric conditions [
14]. In environments characterized by abrupt relief, however, effective SAR use requires the explicit incorporation of topographic variables, not only to contextualize the radar signal as a function of terrain geometry but also to represent ecologically relevant altitudinal gradients [
15].
Within this context, the present study implements a systematic ablation experimental design in the Aysén River Basin to quantify the relative and complementary contributions of the optical, topographic, and radar domains to LULC classification. Three working hypotheses are formulated, linking sensor physical properties to landscape structure:
1.Phenological Hypothesis (H1): Distribution-based percentile metrics are expected to capture the amplitude of phenological signals more effectively than measures of central tendency, enabling the discrimination of spectrally similar land covers with contrasting intra-annual dynamics (e.g., deciduous versus evergreen vegetation).
2. Structural Hypothesis (H2): The inclusion of SAR backscatter from Sentinel-1 is hypothesized to provide information orthogonal to optical reflectance, facilitating land-cover discrimination based on surface roughness and volumetric structure, particularly for classes with contrasting architectures (e.g., urban areas versus bare soil).
3.Topo-Ecological Hypothesis (H3): Topographic variables are anticipated to act as environmental proxies of the altitudinal gradient, constraining the spatial probability of class occurrence and reducing thematic confusion in Andean transition zones (e.g., vegetation, snow, and wetland distributions).
2. Materials and Methods
2.1. Study Area
The Aysén River basin is located in the Aysén del General Carlos Ibáñez del Campo Region of southern Chilean Patagonia, situated approximately between 45°00′–46°16′ S and 71°20′–73°00′ W (
Figure 1). According to the official delimitation by the General Water Directorate’s (DGA) National Inventory of Hydrographic Basins, the basin covers a total area of 1,142,537 ha.
The territory is characterized by rugged topography, extending from sea level at the Aysén Fjord to the Andean summits. The basin’s relief rises progressively from the eastern valleys to the Andes Mountains in the west, where the highest altitudes and steepest slopes occur. Elevations in this sector reach 2,227 m a.s.l., with a mean slope of 32% [
16].
Climatically, the basin exhibits a marked west-to-east gradient. In the western sector, annual precipitation exceeds 3,000–4,000 mm, accompanied by persistent cloud cover, which fosters the development of temperate evergreen forests and Lenga (Nothofagus pumilio) stands [
16,
17]. Eastward, precipitation drops to 621 mm annually in Balmaceda, where Patagonian steppes predominate under a cold, dry climate [
7]. This environmental contrast drives an ecologically significant biogeographical transition from coastal rainforests to inland semi-arid environments.
Furthermore, the current landscape configuration of the Aysén River basin is the direct result of massive and singular anthropogenic disturbance. Historically, during the agro-livestock colonization period (late 19th to mid-20th century), the basin underwent massive and systematic burning of native forest to clear land for pasture [
18]. It is estimated that nearly 60% of the original forest was destroyed, causing severe fragmentation and loss of landscape connectivity [
7,
16]. The western sector was the most severely affected; here, the loss of vegetation cover promotes soil erosion and prompted reforestation programs using exotic species, primarily Pinus spp. These plantations altered the landscape structure and evolved into a relevant productive component for the region [
16]. This legacy of degradation has generated vast transition zones or anthropogenic ecotones, composed of dense secondary shrublands (Nothofagus antarctica), degraded grasslands, and matrices of standing deadwood.
As a result of the interaction between these natural and anthropogenic factors, the Aysén River basin currently exhibits a highly heterogeneous environmental mosaic. Temperate rainforests, peatlands, shrublands, agricultural grasslands, rocky areas, snow, and glaciers coexist within the territory alongside productive zones and dispersed settlements. This topographical, climatic, and historical diversity reflects the environmental and socio-ecological transformations of the 20th century, rendering the basin a representative landscape of Chilean Patagonia where processes of conservation, productive use, and ecological regeneration converge [
7,
18].
2.2. Data Acquisition
The satellite and topographic data employed in this study consist of public products from the Sentinel-2, Sentinel-1, and SRTM missions, accessed and processed via the Google Earth Engine platform [
19].
2.2.1. Sentinel-2 (Optical)
We employed images from the Multispectral Instrument (MSI) onboard the Sentinel-2A and Sentinel-2B satellites, part of the European Space Agency’s (ESA) Copernicus program. The products used correspond to Level-2A (Surface Reflectance), obtained via Sen2Cor atmospheric correction from Level-1C data [
20]. This processing level provides surface reflectance suitable for land use and land cover analysis [
21]. Each image consists of 13 spectral bands with spatial resolutions of 10, 20, and 60 m. In this study, we employed the 10 m bands (B2–B4, B8) and the 20 m SWIR bands (B11, B12), the latter resampled to 10 m to maintain spatial consistency. The SWIR bands are particularly useful for discriminating between snow and clouds due to the strong absorption of snow in the shortwave infrared region, in contrast to the high reflectance of clouds in this spectral range [
22], a key aspect in the context of the high cloud cover typical of Patagonia.
2.2.2. Sentinel 1 (SAR)
We incorporated C-band (5.405 GHz) Synthetic Aperture Radar (SAR) data acquired by the Sentinel-1 system, which operates as an active dual-polarization sensor providing observations in VV (vertical transmit/vertical receive) and VH (vertical transmit/horizontal receive) polarizations [
23]. Sentinel-1 allows for systematic image acquisition every six days, independent of weather conditions or time of day. Specifically, we used Ground Range Detected (GRD) products in Interferometric Wide Swath (IW) mode from the COPERNICUS/S1_GRD collection. These products are radiometrically calibrated, orthorectified, and have a spatial resolution of 10 m. SAR data were used to complement optical information given their sensitivity to physical surface properties such as structure, roughness, and moisture content, which are especially relevant in mountainous and environmentally complex landscapes [
23,
24].
2.2.3. Digital Elevation Model
The Digital Elevation Model (DEM) from the Shuttle Radar Topography Mission (SRTM) was used as the source of topographic information, with a spatial resolution of 30 m. This model was generated from C-band radar interferometry data and has a reported vertical accuracy of ±16 m [
25]. Although SRTM may exhibit higher uncertainties in areas of rugged relief due to radar shadow or decorrelation over snow-covered and glacial surfaces, Version 3 (used in this study) incorporates a void-filling process based on auxiliary data, providing nearly global, gap-free coverage [
25,
26]. The product is available in the GEE catalog under the identifier USGS/SRTMGL1_00314 [
27].
2.2.4. Auxiliary Data
a)PlanetScope High-Resolution Imagery
In the absence of field data, very high-resolution satellite imagery can serve as reference data for accuracy assessment [
28]. In this study, reference information was obtained from PlanetScope (3 m spatial resolution), which was visually interpreted to validate land use and land cover classes. These data were used exclusively for independent validation and were not included in the classifier training set.
b)CONAF Land Use Cadastre
The “Vegetational Resources and Land Use Cadastre” by CONAF constitutes official cartography at a 1:50,000 scale, generated through multi-temporal satellite image analysis, GIS-assisted photointerpretation, and field verification. In this study, the 2020–2022 update was used as auxiliary information to support the thematic validation and spatial consistency of coinciding LULC classes, particularly in forest ecosystems and areas of anthropogenic transition [
10].
c)Copernicus Land Cover products
Land cover products from the Copernicus program, developed within the Copernicus Global Land Service framework, provide a harmonized first-order classification at regional and global scales based on multi-temporal analysis of 10-meter resolution optical satellite imagery [
9]. In this study, Copernicus Land Cover maps were employed as auxiliary information to establish a general thematic reference framework. Upon this framework, we developed a more detailed, regionally focused classification that incorporates specific classes from the Patagonian context both ecological and productive such as Patagonian steppes and forage grasslands.
2.3. Pre-processing and Composite Generation
2.3.1. Temporal Filtering and Masking
All Sentinel-2 SR scenes from 2021 intersecting the study basin were retrieved. Given the persistent cloud cover characteristic of Chilean Patagonia, a scene-level inclusion threshold of 70% was applied to ensure sufficient temporal observation availability. Subsequently, clouds, cirrus, and shadows were masked at the pixel level using the Cloud Score+ (CS+) algorithm. This algorithm, developed by the Google Earth Engine team as an extension of the method by [
29], , estimates atmospheric visibility via weakly supervised deep learning and has demonstrated robust performance in recent LULC classification applications [
30].
2.3.2. Image Compositing
Following masking, multi-temporal mosaics were constructed at different aggregation scales to represent both the mean surface state and its temporal dynamics. The hierarchical compositing strategy is detailed below:
Valid Sentinel-2 scenes were grouped by austral quarter, corresponding to summer (DJF), autumn (MAM), winter (JJA), and spring (SON). For each period, the median reflectance per pixel was calculated. The use of the median reduces the influence of outliers and avoids biases associated with extreme phenological peaks, providing stable and comparable seasonal representations [
31,
32]. Each composite integrated between 5 and 15 observations per pixel, depending on temporal availability and cloud persistence. Key spectral indices (NDVI, EVI2, NDWI, NDSI, and NBR) were calculated from the reflectance composites, generating a multi-band set of seasonal composites.
-
b)
Percentile Composites
To characterize intra-seasonal variability, the 25th (
) and 75th (
) percentiles were calculated for the spectral indices NDVI, EVI2, NDWI, NDSI, and NBR. These percentiles describe the temporal amplitude of the phenological and hydrological signals and complement the seasonal median by providing additional information on relative conditions within the analyzed period. This complementary approach has been shown to improve the separability of dynamic classes in heterogeneous environments [
33]
-
c)
Annual Composite and Gap-filling
An annual composite for 2021 was generated using the median of all valid post-masking observations from the year. This product was used exclusively to fill data gaps in the seasonal mosaics caused by persistent cloud cover or shadows. This hierarchical compositing scheme, which prioritizes the seasonal level over the annual level, ensures spatial continuity without sacrificing the dominant phenological signal. This strategy is consistent with “best-available-pixel” algorithms described by [
34] and aligns methodologically with global products such as ESA WorldCover [
35].
-
d)
Annual SAR Composites
Since Synthetic Aperture Radar (SAR) is unaffected by cloud cover, gap-filling strategies were unnecessary, unlike in the optical case. In this study, Sentinel-1 data were integrated via an annual composite, calculated as the temporal median of the VV and VH backscatter coefficients, as well as the derived SAR indices.
2.4. Predictor Variable Extraction
Based on the generated composites, a multitemporal data cube was constructed at a 10 m spatial resolution. To evaluate the complementarity of the different information sources, variables were organized into four thematic blocks (A, P, T, R); their detailed mathematical formulations are presented in
Table 1.
2.4.1. Block A – Multispectral Optical (Baseline)
This block represents the mean phenological state and spectral composition of the surface. It integrates Sentinel-2 bands (B2, B3, B4, B8, B11, B12) and a set of spectral indices calculated for each season. In addition to standard vegetation (NDVI, EVI2), water (NDWI), and snow (NDSI) indices, specific indices for arid and anthropized zones were incorporated: SAVI to minimize soil background noise in sparse steppes, BSI to characterize bare soils, and NDBI for built-up areas. (Total: 56 layers).
2.4.2. Block P – Percentile (Temporal Dynamics)
Designed to capture the intra-seasonal variability that the median tends to smooth out [
6], this block consists of the 25th 25th (
) and 75th (
) percentiles calculated for the full set of eight spectral indices (NDVI, EVI2, SAVI, NDWI, NDSI, NBR, BSI, and NDBI) [
7]. These metrics describe the amplitude of the phenological signal, enabling discrimination between stable covers (e.g., evergreen forest) and those with high temporal variance (e.g., crops or ephemeral wetlands). (Total: 40 layers).
2.4.3. Block T – Topographic (Geomorphological Context)
Four static variables were derived from the elevation model (SRTM, resampled): elevation, slope, and the aspect components Northness and Eastness. These variables act as proxies for the thermal and insolation gradients that condition vegetation distribution in Andean environments. (Total: 4 layers).
2.4.4. Block R – Polarimetric Radar (Structure and Roughness)
Derived from annual Sentinel-1 composites, this block provides information on target geometry, independent of solar illumination. It includes backscatter intensities (VV, VH), local incidence angle, and a set of advanced polarimetric indices sensitive to canopy structure: the Cross-Polarization Ratio (CR), Degree of Polarization (DOP), dual-pol Radar Vegetation Index (RVI), and Normalized Polarimetric RVI (NPRVI). These variables allow for the characterization of biomass and structural complexity under any atmospheric condition. (Total: 8 layers).
Table 1.
Mathematical definitions and references for the predictor variables.
Table 1.
Mathematical definitions and references for the predictor variables.
Block |
Variable |
Name |
Mathematical formulation |
Reference |
| OPTICAL (A) |
NDVI |
Norm. Difference Veg. Index |
|
[36] |
| EVI2 |
Enhanced Veg. Index (2-band) |
|
[37] |
| SAVI |
Soil Adjusted Veg. Index |
|
[38] |
| NDWI |
Norm. Difference Water Index |
|
[39] |
| NDSI |
Norm. Difference Snow Index |
|
[40] |
| NBR |
Normalized Burn Ratio |
|
[41] |
| NDBI |
Norm. Difference Built-up Index |
|
[42] |
| BSI |
Bare Soil Index |
|
[43] |
| |
|
|
|
|
| TOPOGRAPHY (T). |
Elev |
Elevation (SRTM) |
Surface elevation above mean sea level (m) derived from the Shuttle Radar Topography Mission digital elevation model. |
[25] |
| Slope |
Slope Gradient |
First derivative of a continuous elevation surface, expressing the maximum rate of elevation change per unit horizontal distance. |
[44] |
| North |
Northness (Aspect Component) |
Continuous transformation of slope aspect expressing the degree to which a surface is oriented toward the north–south direction. |
[45] |
| East |
Eastness (Aspect Component) |
Continuous transformation of slope aspect expressing the degree to which a surface is oriented toward the east–west direction. |
[45] |
| |
|
|
|
|
| RADAR (R) |
VV y VH |
Backscatter Intensity |
Normalized radar cross-section (σ0) derived from SAR image intensity |
[46] |
| CR |
Cross-Polarization Ratio |
|
[47] |
| DOP |
Degree of Polarization |
|
[48] |
| RVI |
Radar Veg. Index (Dual-Pol) |
|
[49] |
| NPRVI |
Normalized Polarimetric RVI |
|
[48] |
2.5. Experimental Design: Modular Contribution Assessment
To quantify the specific contribution of the different information sources, we implemented a modular contribution assessment experimental design. The objective was to measure the performance gain provided by each individual thematic block (modules P, T, R described in
Section 2.4) when integrated into a backbone configuration.
The experiment was structured by defining a Baseline (Model A), composed exclusively of standard phenological information. Additional information modules were added to this base in a controlled manner. This approach allows for isolating the discriminative capacity of temporal dynamics, topography, and radar, unlike simple stacking strategies where the individual contribution of each source tends to be diluted or masked.
Reference: The optical baseline (A).
Marginal Contribution: Augmented models (A+P, A+T, A+R) to evaluate the specific complementarity of each module.
Total Synergy: Full integration (Full) to evaluate the maximum multi-sensor scenario.
Table 2.
Experimental configurations for the modular contribution assessment.
Table 2.
Experimental configurations for the modular contribution assessment.
| Model |
Feature Set Description |
Conceptual Role of the Block |
Examples of Potentially Discriminated Covers |
| A (Seasonal Optical) |
Multispectral variables and indices derived from Sentinel-2 (seasonal medians). |
Represents the reference optical spectral and phenological information. |
Vegetation vs. soil; deciduous vs. evergreen. |
| A + P (Optical + Percentiles) |
A + P25 and P75 percentiles for each band/index. |
Captures intra-seasonal variability of the optical signal (temporal dynamics). |
Crops and grasslands; temporary vs. permanent water. |
| A + T (optical + Topography) |
A + variables derived from the DEM (elevation, slope, Northness, Eastness). |
Incorporates static environmental gradients associated with relief and insolation. |
High-Andean vegetation; forest vs. shrubland. |
| A + R (Optico + Radar SAR) |
A + VV, VH backscatter and annual derived SAR metrics. |
Adds structural and moisture information independent of clouds. |
Wetlands; wet soils; dense vegetation. |
| Full (A + P + T + R) |
Full integration of all variables. |
Evaluates the complete multi-sensor synergy of the system. |
Fine-grained discrimination among the total set of classes. |
To ensure statistical comparability, the classification algorithm (Random Forest), hyperparameters, and training and validation sets were held constant across all runs. Consequently, variations in accuracy metrics (OA, F1-score) are attributable solely to the information contributed by the specific module under evaluation.
2.6. Sampling and Class Definition
Reference data were generated using a convergent evidence strategy that integrated three information sources. The year 2021 was selected as the study period to align with the most recent update of the Native Vegetational Resources Cadastre, ensuring maximum temporal synchrony between the validation data and the satellite signal.
The sources used were: (i) visual interpretation of high-spatial-resolution PlanetScope imagery (3 m/pixel), (ii) reference vectors from the aforementioned CONAF Cadastre, and (iii) Copernicus Global Land Cover products. Based on the delimitation of validated homogeneous polygons, we applied stratified random sampling, selecting 1,500 points per class to create a robust dataset of 16,500 samples.
To ensure statistical independence and minimize the effect of spatial autocorrelation, a spatial hold-out partition strategy was implemented. The total dataset was randomly divided into two disjoint subsets: 70% allocated for model training and 30% reserved exclusively for external validation.
The resulting classification scheme comprises 11 Land Use and Land Cover (LULC) classes; their biophysical definitions are detailed in
Table 3.
2.7. Classifier Configuration
Supervised classification was performed using the Random Forest (RF) algorithm, an ensemble learning method based on the aggregation of multiple decision trees. RF was selected for its ability to model nonlinear relationships and its robustness to noise in high-dimensional feature spaces [
50].
Systematic reviews have shown that RF consistently outperforms traditional parametric classifiers (e.g., Maximum Likelihood) and achieves accuracy that is comparable to or higher than more complex machine-learning methods such as Support Vector Machines (SVM). These advantages are coupled with reduced requirements for parameter tuning and lower computational cost [
21,
51].
Model implementation relied on the following hyperparameter settings to stabilize generalization error:
Number of trees (ntree): A total of 200 decision trees were grown.
Variables per split (mtry): The number of predictor variables evaluated at each node split was set to the square root of the total number of predictors .
Splitting criterion: Entropy (information gain) was used to optimize node partitioning.
Under this configuration, five independent models were trained, corresponding to the experimental blocks defined in
Section 2.5 (A, A+P, A+T, A+R, A+P+T+R). In all cases, a fixed random seed was used to ensure full reproducibility of the experiments.
2.8. Accuracy Assessment and Performance Metrics
Model reliability was assessed using an independent validation dataset (30%). For each experimental configuration, a confusion matrix was computed, from which accuracy metrics were derived following standard validation protocols [
28].
The following metrics were calculated:
Overall Accuracy (OA): The proportion of correctly classified samples relative to the total number of validation samples.
Kappa coefficient (κ): A chance-corrected measure of agreement, reported for historical comparability and interpreted in a complementary manner, given the recent debate regarding its suitability for thematic map evaluation [
52].
Producer’s Accuracy (PA) and User’s Accuracy (UA): Class-specific metrics used to quantify omission and commission errors, respectively.
In addition, given the class imbalance inherent to the study area, robust metrics recently recommended to reduce evaluation bias were also computed [
53]:
- 4.
Balanced Accuracy (BA): The arithmetic mean of class-wise sensitivity (recall), a critical metric to ensure that dominant classes do not mask errors associated with minority classes.
- 5.
Macro F1-score: The harmonic mean of precision and recall, averaged across all classes, assigning equal weight to each category in the final model evaluation.
Finally, a comparative analysis was conducted by quantifying the absolute differences in these metrics between the multisensor models (A+P, A+T, A+R, and Full) and the optical baseline model (A).
3. Results
3.1. Global Performance of the Ablation Models
Overall LULC classification performance improved progressively as additional variable blocks were incorporated into the seasonal optical baseline model (A).
Table 4 summarizes the global accuracy metrics obtained for each experimental configuration.
The optical baseline model (A), based exclusively on seasonal spectral information, achieved an Overall Accuracy (OA) of 89.2%, with a κ coefficient of 0.871, a Balanced Accuracy (BA) of 86.1%, and a Macro-F1 of 80.5%. Although these values indicate strong overall performance, the contrast between OA and Macro-F1 reveals heterogeneous class-wise performance, consistent with a land-cover imbalance scenario.
The subsequent inclusion of multi-temporal percentiles (A+P) yielded modest but consistent improvements relative to the baseline, with gains of +0.4% in OA and +1.2% in Macro-F1. This result indicates that intra-annual information contributes to stabilizing spectral responses across land covers, although its isolated impact on overall performance remains limited. In contrast, adding topographic variables (A+T) produced more pronounced improvements, particularly in metrics sensitive to class-level performance, such as BA (+1.3%) and Macro-F1 (+3.8%). This pattern suggests enhanced discrimination of land covers conditioned by topographic gradients.
The largest single contribution, however, was observed with the incorporation of radar information (A+R). Relative to the baseline model, A+R improved OA by 2.5%, κ by 0.028, and Macro-F1 by 3.8%, highlighting the substantial role of SAR data in separating structurally complex land covers that are spectrally similar.
Finally, the Full model (A+P+T+R) integrated all data sources and achieved the best overall performance across all evaluated metrics, with an OA of 92.5%, a BA of 89.0%, and a Macro-F1 of 86.0%. The cumulative gain of +5.5 percentage points in Macro-F1 relative to the optical baseline demonstrates that multisensor integration is both highly effective and synergistic. The contributions of topography and radar are not redundant but complementary, correcting misclassification in different underrepresented classes.
To visualize the marginal impact of each variable domain,
Figure 3 presents the relative gains with respect to the baseline model.
Taken together,
Figure 3 highlights the progressive and incremental nature of the observed performance gains, underscoring the dominant contribution of topographic information and, in particular, radar data, as well as the cumulative effect achieved by the Full model. These results establish the quantitative basis for the detailed class-wise analysis presented in the following subsection.
3.2. Class-wise Metrics and Confusion Patterns
The class-wise analysis reveals that the impact of the different variable blocks varies markedly across land-cover categories, as summarized in
Figure 4.
Classes with well-defined spectral signatures, such as Water, Snow/Ice, and forest covers, exhibit high F1 values (≥90%). In contrast, classes characterized by greater spectral and structural heterogeneity show substantially lower performance, most notably Natural Grasslands/Shrublands (48.4% F1) and Bare Soil/Alluvial Beaches (53.4% F1), followed by Forage Grassland (72.9% F1) and Wetlands (75.2% F1). For Natural Grasslands/Shrublands, the combination of high PA and very low UA indicates a predominance of commission errors, consistent with over-assignment of this class.
The inclusion of multi-temporal percentiles (A+P) results in limited and class-specific improvements, with a clear increase for Forage Grassland (+6.6 percentage points in F1) and a moderate improvement for the Urban class (+2.9 percentage points in F1). For the remaining categories, changes are minor and do not substantially alter the confusion patterns observed in the optical baseline model.
In contrast, the topographic block (A+T) produces more consistent gains for relief-conditioned classes, particularly Bare Soil/Alluvial Beaches (+15.6 percentage points in F1), Natural Grasslands/Shrublands (+13.7), and Rocky Terrain (+7.4), as well as moderate improvements for Wetlands (+6.3). Classes that were already well classified remain relatively stable.
Similarly, the addition of radar information (A+R) contributes to reducing persistent confusion among structurally complex classes. Under this configuration, Natural Grasslands/Shrublands shows the largest relative gain in F1 (≈+16 percentage points), while the Urban class (≈ +6 percentage points) and Forage Grassland (+4 percentage points) exhibit moderate improvements. Bare Soil/Alluvial Beaches shows a smaller gain (≈+3 percentage points), whereas Wetlands display an intermediate increase (≈+4 percentage points). In contrast, Water and Snow/Ice remain virtually unchanged, consistent with their high separability already achieved using optical information.
Finally, the Full model (A+P+T+R) consolidates the observed improvements, achieving high F1 values for most classes and substantially reducing the performance gaps of the baseline model. Nevertheless, Natural Grasslands/Shrublands remains the lowest-performing class (F1 ≈ 70%), followed by Bare Soil/Alluvial Beaches (F1 ≈ 76%) and Forage Grassland (F1 ≈ 81%).
3.3. Variable Importance
Variable importance analysis using the Random Forest classifier allowed for the identification of the most influential predictors in each model configuration and an evaluation of how their hierarchy shifts across the incremental scheme. To facilitate comparison between configurations,
Figure 5 presents the 15 variables with the highest relative importance for each case, estimated using the Mean Decrease in Gini Index.
In the seasonal optical baseline model (A), the importance hierarchy is dominated by short-wave infrared (SWIR) bands, specifically B11_son and B12_jja. These are accompanied by spectral indices associated with snow, moisture, and the contrast between vegetated and non-vegetated surfaces (NDSI, NBR, NDWI, NDBI y BSI). This pattern indicates that the classification relies primarily on spectral contrasts related to surface moisture conditions, seasonal snow presence, and the differentiation of land cover states.
The incorporation of multi-temporal percentiles (A+P) maintains the SWIR bands as pillars but introduces statistical metrics of intra-annual extremes. In particular, the high percentiles of indices related to snow and surface/vegetation cover state (such as NDSI_p75 and NBR_p75) gain greater relevance. This suggests that the persistence or maximum intensity of certain events throughout the year provides critical information that complements the data contained in seasonal medians.
In the model with topography (A+T), a marked shift in the importance hierarchy is observed, with elevation and slope occupying the top-ranking positions and significantly outperforming individual spectral predictors. Although seasonal optical variables remain in the list, their relative weight decreases. This pattern confirms the foundational role of relief and the altitudinal gradient in the spatial distribution of land covers within the study area.
Analysis of the radar-integrated model (A+R) shows that the importance hierarchy is headed by the acquisition geometry variable (angle), followed by VV and VH backscatter intensities, and derived SAR indices sensitive to structure and scattering, such as PRVI and NPRVI. These variables exceed the importance of the optical bands, which remain in intermediate ranking positions. This pattern suggests that, in this configuration, the primary contribution of SAR is linked to capturing the geometric and structural information of the terrain.
Finally, in the Full model (A+P+T+R), the set of most influential variables is dominated by physical landscape descriptors, with elevation and slope among the highest-weighted predictors, alongside the acquisition geometry variable (angle). In this context, there is a prominent presence of SAR variables within the top 10, including PRVI, NPRVI, VV, VH, CR, and RVI. Optical variables appear starting from the seventh position in the ranking (e.g., B12_djf and B11_son). Taken together, this pattern highlights the integrated contribution of topographic gradients, observation geometry, and SAR descriptors in achieving the highest global performance.
For further details, the complete list of the top 20 variables for each model configuration is provided in
Appendix A.
3.4. Spatial Consistency of the Mapping
The spatial distribution of land use and land cover produced by the best-performing configuration (Full model, A+P+T+R) for the entire study area is shown in
Figure 6. At the basin scale, the map adequately reproduces the main ecological and land-use gradients characteristic of the region, capturing the transition from steppe and shrubland in the eastern sector, through an intermediate agricultural–forestry mosaic, to evergreen forests in the western sector, as well as the location of the main urban centers
a) Case 1: Lacustrine–steppe transition in the Lago Misterioso sector
In the reference image, the interface between native forest and forest plantations is clearly distinguishable, associated with the reddish tones of native forest during its autumn phenological phase and the more homogeneous texture and regular geometry of evergreen plantations. This pattern is consistently reproduced in the classification maps.
In the optical baseline model (A) and the A+P configuration, persistent commission errors are observed for the Urban class, reflected in low User’s Accuracy values (UA = 72.0% and 75.4%, respectively), together with an over-assignment of the Wetlands class at higher elevations, consistent with UA = 73.4% in both configurations. These confusions are visually expressed as spurious Urban and Wetlands assignments in areas where such covers are not expected according to ground reference information.
The incorporation of topographic variables (A+T) leads to a clear reduction of Wetlands at higher elevations, with an increase in UA to 81.6% and in F1 to 81.5%, consistent with the visual removal of spurious wetlands on slopes and steep terrain. In this configuration, greater spatial stability is also observed for classes such as Steppe and Bare Soil; however, this improvement does not translate into a reduction of commission errors for the Urban class, whose UA decreases to 68.8%.
The inclusion of radar information (A+R) produces a clear reduction in Urban commission errors, with UA increasing to 77.6% and F1 to 86.8%, consistent with a more accurate delineation of lacustrine–terrestrial transitions. For Wetlands, improvements are more moderate (UA = 80.4%; F1 = 79.5%).
Finally, the Full model (A+P+T+R) consolidates the observed reductions, achieving UA = 88.9% and F1 = 93.0% for the Urban class, and UA = 83.8% and F1 = 83.1% for Wetlands, while classes that were already well characterized (e.g., Water, Native Forest, and Forest Plantations) remain stable (F1 ≥ 93%). Overall, these results are reflected in enhanced spatial stability of lacustrine–terrestrial interfaces within the analyzed area.
b) Case 2: Urban–fluvial environment of the city of Puerto Aysén
This case examines the spatial consistency of the mapping in a complex urban–fluvial setting characterized by compact urban areas, active alluvial plains, riparian wetlands, and bare-soil surfaces linked to fluvial bars.
In configurations A and A+P, persistent commission errors are observed for the Urban class, manifested as spurious expansion into riparian sectors and non-built surfaces, consistent with the low UA values reported (72.0% and 75.4%, respectively). These confusions are mainly concentrated along urban–fluvial interfaces and in transitions with Bare Soil/Sands and Wetlands. The incorporation of topography (A+T) contributes to greater spatial coherence of the fluvial corridor and adjacent alluvial surfaces but does not reduce Urban commission errors (UA = 68.8%), indicating that topographic information stabilizes the geomorphological context without directly discriminating urban cover.
In contrast, the A+R configuration shows a clear reduction in Urban commission errors, with UA increasing to 77.6% and F1 to 86.8%, reflected in a more precise delineation of the urban footprint. For Wetlands, improvements are moderate (UA = 80.4%; F1 = 79.5%), associated with a progressive stabilization of their spatial distribution. The Full model (A+P+T+R) consolidates these improvements, achieving the highest accuracy values for the Urban class (UA = 88.9%; F1 = 93.0%) and enhanced spatial stability along urban–riparian interfaces.
3.5. Sensitivity to the Temporal Aggregation of SAR Variables
As a complementary analysis, the sensitivity of the integrated model (A+P+T+R) to the temporal aggregation scheme of SAR variables was evaluated by comparing the annual aggregation used in the main experimental design with an alternative seasonal aggregation.
From a quantitative perspective, the SAR seasonal-aggregation variant yielded additional gains in global metrics relative to the Full model with annual aggregation (OA = 92.5%; Macro-F1 = 86.0%), reaching an OA of 92.9% and a Macro-F1 of 89.5%. These values correspond to increases of +0.4 and +3.5 percentage points, respectively.
However, qualitative inspection of spatial consistency reveals contrasting behavior.
Figure 7 illustrates this effect in the urban–periurban environment of Coyhaique, comparing the optical reference (
Figure 7A) with the Full model using annual SAR aggregation (
Figure 7B) and its seasonal-aggregation variant (
Figure 7C). While annual aggregation preserves a compact urban delineation that is consistent with the reference, seasonal aggregation induces spurious expansion of the Urban class into rural and periurban areas.
Figure 8.
Spatial comparison between the reference image (RGB composite acquired on 08 April 2021) and the classification maps corresponding to configurations A, A+P, A+T, A+R, and Full (A+P+T+R) in the urban–fluvial environment of the city of Puerto Aysén, characterized by the coexistence of compact urban areas, active alluvial plains, riparian wetlands, and bare-soil surfaces associated with fluvial bars.
Figure 8.
Spatial comparison between the reference image (RGB composite acquired on 08 April 2021) and the classification maps corresponding to configurations A, A+P, A+T, A+R, and Full (A+P+T+R) in the urban–fluvial environment of the city of Puerto Aysén, characterized by the coexistence of compact urban areas, active alluvial plains, riparian wetlands, and bare-soil surfaces associated with fluvial bars.
A similar, though less pronounced, pattern is observed for Forage Grasslands, which exhibit increased spatial fragmentation under seasonal aggregation.
5. Conclusions
The implemented ablation design demonstrated that multisensor fusion is indispensable for overcoming the limitations of optical remote sensing in complex Andean landscapes. Beyond improving global performance, this approach allowed for the decoupling and quantification of contributions from complementary information domains. While the central tendency baseline proved insufficient for capturing compositional heterogeneity (Macro-F1 = 80.5%), the inclusion of phenological metrics (P) refined vegetation discrimination, and the subsequent integration of topographic (T) and radar (R) variables generated critical, non-redundant gains (+3.8 points each), acting as geomorphological filters and structural descriptors. The integrated model (A+P+T+R) maximized global performance (OA = 92.5%; Macro-F1 = 86.0%), validating the hypotheses regarding the necessary complementarity between phenological dynamics (H1), physical structure (H2), and topographic landscape context (H3).
Methodologically, this study cautions against optimizing statistical metrics at the expense of the spatial plausibility of cartographic products. It was shown that although seasonal SAR data aggregation improved numerical metrics, it introduced noise and geometric artifacts; in contrast, annual composites operated as robust temporal regularizers, prioritizing cartographic coherence over marginal metric gains. Nonetheless, challenges persist in transitional classes such as Grasslands/Shrublands (F1 ≈ 70%), suggesting a future need to incorporate explicit spatial context (e.g., Deep Learning or GEOBIA) to resolve fine-scale ambiguities. From an operational perspective, the developed workflow based entirely on open data from the Copernicus program and cloud-based processing constitutes a cost-effective and scalable solution for the systematic production of LULC maps in vast and inaccessible regions. This approach is directly transferable to land management agencies, enabling frequent updates of land cover data, a key input for watershed management, fire monitoring, and planning under climate change scenarios.
Figure 1.
Location of the study area in southern Chile. (A) Regional context showing the Aysén Region of Chile highlighted in red along the Pacific coast. (B) Administrative and geographic context of the study basin within the Aysén Region. (C) Topographic overview of the study basin, including elevation, the main subantarctic river network, and urban areas.
Figure 1.
Location of the study area in southern Chile. (A) Regional context showing the Aysén Region of Chile highlighted in red along the Pacific coast. (B) Administrative and geographic context of the study basin within the Aysén Region. (C) Topographic overview of the study basin, including elevation, the main subantarctic river network, and urban areas.
Figure 2.
Workflow for LULC classification using Random Forest and block-wise variable evaluation (optical A, percentiles P, topography T, and radar R) in the Aysén River Basin.
Figure 2.
Workflow for LULC classification using Random Forest and block-wise variable evaluation (optical A, percentiles P, topography T, and radar R) in the Aysén River Basin.
Figure 3.
Performance gains relative to the optical baseline model (A) for the different experimental configurations. Bars represent improvements in Overall Accuracy (OA), Balanced Accuracy (BA), and Macro-F1 score.
Figure 3.
Performance gains relative to the optical baseline model (A) for the different experimental configurations. Bars represent improvements in Overall Accuracy (OA), Balanced Accuracy (BA), and Macro-F1 score.
Figure 4.
Heatmap of accuracy metrics (Producer’s Accuracy—PA, User’s Accuracy—UA, and F1 score) by land-cover class and evaluated model (A, A+P, A+T, A+R, and Full).
Figure 4.
Heatmap of accuracy metrics (Producer’s Accuracy—PA, User’s Accuracy—UA, and F1 score) by land-cover class and evaluated model (A, A+P, A+T, A+R, and Full).
Figure 5.
Ranking of the 15 most influential variables for each classification model configuration (A, A+P, A+T, A+R, and Full), based on the Mean Decrease in Gini Index.
Figure 5.
Ranking of the 15 most influential variables for each classification model configuration (A, A+P, A+T, A+R, and Full), based on the Mean Decrease in Gini Index.
Figure 6.
Spatial distribution of land use and land cover (LULC, 2021) in the Aysén River Basin derived from the best-performing configuration (Full model, A+P+T+R).
Figure 6.
Spatial distribution of land use and land cover (LULC, 2021) in the Aysén River Basin derived from the best-performing configuration (Full model, A+P+T+R).
Figure 7.
Spatial comparison between the reference image (RGB composite acquired on 08 April 2021) and the classification maps corresponding to configurations A, A+P, A+T, A+R, and Full (A+P+T+R) in the Lago Misterioso sector, characterized by a highly heterogeneous lacustrine–terrestrial mosaic (wetlands, steppe, native forest, and forest plantations).
Figure 7.
Spatial comparison between the reference image (RGB composite acquired on 08 April 2021) and the classification maps corresponding to configurations A, A+P, A+T, A+R, and Full (A+P+T+R) in the Lago Misterioso sector, characterized by a highly heterogeneous lacustrine–terrestrial mosaic (wetlands, steppe, native forest, and forest plantations).
Table 3.
Definition of land use and land cover classes.
Table 3.
Definition of land use and land cover classes.
| |
Class |
General description |
| 1 |
Snow |
Surfaces of persistent snow or ice (glaciers/ice fields). |
| 2 |
Urban |
Built-up areas, road infrastructure, and artificial surfaces. |
| 3 |
Water |
Inland water bodies (lakes, rivers, reservoirs). |
| 4 |
Forage grassland |
Managed herbaceous vegetation for livestock production (rotational pastures). |
| 5 |
Forest plantation |
Exotic fast-growing monocultures (pine, eucalyptus, or other species). |
| 6 |
Natural grasslands / shrublands |
Transitional natural shrub and herbaceous vegetation. |
| 7 |
Wetlands |
Areas saturated or flooded part of the year (wet meadows, peatlands, marshes, swamps) with hydrophilic vegetation. |
| 8 |
Rocky terrain |
Slopes, outcrops, and rocky massifs with sparse or absent vegetation; high reflectance and rough texture. |
| 9 |
Native / mixed forest |
Forest formations dominated by native species with dense canopy cover. |
| 10 |
Steppe |
Discontinuous xerophytic vegetation (coirón grass) associated with semi-arid conditions. |
| 11 |
Bare soil / sands / alluvial beaches |
Sediments, sand flats, fluvial beaches, and eroded areas lacking vegetation cover. |
Table 4.
Summary of global performance metrics (Overall Accuracy, Kappa coefficient, Balanced Accuracy, and Macro-F1) obtained for the five evaluated classification model configurations (A, A+P, A+T, A+R, and Full).
Table 4.
Summary of global performance metrics (Overall Accuracy, Kappa coefficient, Balanced Accuracy, and Macro-F1) obtained for the five evaluated classification model configurations (A, A+P, A+T, A+R, and Full).
| Model |
OA (%) |
κ |
BA (%) |
Macro-F1 (%) |
| A (optical) |
89.2 |
0.871 |
86.1 |
80.5 |
| A+P (optical+P) |
89.6 (+0.4) |
0.876 (+0.005) |
86.9 (+0.8) |
81.7 (+1.2) |
| A+T (optical+T) |
90.3 (+1.1) |
0.883 (+0.012) |
87.4 (+1.3) |
84.3 (+3.8) |
| A+R (optical+R) |
91.7 (+2.5) |
0.899 (+0.028) |
88.4 (+2.3) |
84.3 (+3.8) |
| Full (A+P+T+R) |
92.5 (+3.3) |
0.905 (+0.034) |
89.0 (+2.9) |
86.0 (+5.5) |