3.2. Model Performance and Region-Specific
The original SOC data showed mild asymmetry (skewness = 0.29). A Box–Cox transformation was applied before modeling, with λ estimated by maximum likelihood in
Section 2.2.2. After model predictions were obtained, the Duan smearing estimator was used to back-transform all predicted values to the original SOC content scale [
20].The accuracy metrics (R
2, RMSE, MAE, RPIQ and LCCC) in
Table 3 were all calculated from these back-transformed values.
Table 3 summarizes the metrics for each model on the training set and the test set, and
Figure 3 shows the corresponding scatter plots of predicted VS measured SOC on the test set.
Figure 3.
Scatter plots of measured versus predicted SOC content for the five machine learning algorithms. (a) Gradient Boosting Decision Tree (GBDT); (b) Partial Least Squares Regression (PLS); (c) Random Forest (RF); (d) Support Vector Machine (SVM); (e) Extreme Gradient Boosting (XGBoost).
Figure 3.
Scatter plots of measured versus predicted SOC content for the five machine learning algorithms. (a) Gradient Boosting Decision Tree (GBDT); (b) Partial Least Squares Regression (PLS); (c) Random Forest (RF); (d) Support Vector Machine (SVM); (e) Extreme Gradient Boosting (XGBoost).
On the training set, GBDT gave the highest R² (0.660), with an RMSE of 3.952 g·kg⁻¹ and an LCCC of 0.727, indicating that it effectively captured the nonlinear relationships between SOC and the predictor variables. RF and XGBoost followed, with R² values of 0.585 and 0.577. PLS produced the lowest R² (0.442). Linear models cannot adequately capture the complex soil–landscape interactions that characterize this heterogeneous agricultural region, a limitation clearly reflected in the PLS result. SVM, with an R² of 0.520, fell between the ensemble methods and PLS. A consistent drop in performance appeared when the models were applied to the test set. GBDT maintained the highest R² (0.518), the lowest RMSE (4.498 g·kg⁻¹), and the highest RPIQ (2.229), together with an LCCC of 0.621. SVM achieved a slightly higher LCCC (0.631), although its R² fell to 0.506. This pattern reveals two complementary strengths: GBDT explained a larger portion of the variance, while SVM constrained systematic bias more tightly. RF (R² = 0.484) and XGBoost (R² = 0.511) followed GBDT. The clear performance ranking across all models—from GBDT down to PLS—confirms that SOC prediction in this landscape requires models capable of capturing complex, high-order interactions among predictors. GBDT, whose core mechanism is sequential residual correction, held a clear advantage over the parallel bagging strategy employed by RF. XGBoost, although it also belongs to the boosting family, did not outperform RF substantially. Its stronger regularization may have suppressed noise during training but did not translate into better generalization on the given data distribution. PLS again ranked lowest. When prediction surfaces are strongly nonlinear, linear formulations encounter fundamental limits, and these limits were clearly visible in the test-set results.
These contrasting behaviors are directly visible in the prediction scatterplots (
Figure 3). GBDT produced predictions that cluster tightly around the 1:1 line, with few outliers and modest deviations across the entire SOC range. The SVM output shows a slight compression of predicted values. In contrast, RF and PLS exhibit considerably larger scatter, particularly at the low and high ends of the SOC distribution. For SOC values above 30 g·kg⁻¹, both models systematically underpredict, displaying a clear tail bias. XGBoost falls between these two extremes: its scatter is smaller than that of RF and PLS but does not match the concentration achieved by GBDT. Based on these consistent results, GBDT was selected as the foundational model and was subsequently used to produce the final SOC maps and to perform the SHAP interpretation and uncertainty analyses.
This clear performance ranking can be attributed to three sets of factors: the heterogeneous environmental conditions across Hungary’s croplands, the internal design logic of each algorithm, and the statistical character of the soil carbon dataset itself. The plain regions are predominantly covered by fertile Chernozems and meadow soils rich in SOC, while Luvisols and Cambisols dominate the surrounding hills and mountains, creating a spatially structured mosaic of soil properties that models must capture. Together with the spatially heterogeneous environment, these factors produce a strongly nonlinear relationship between predictor variables and SOC content, as well as a skewed SOC content distribution. Under these conditions, models have a clear advantage, which effectively reduce bias in complex regions of the predictor space. This is most apparent in the contrast between the two boosting implementations. . GBDT, through sequential residual fitting, concentrates on areas with the largest errors and adaptively captures local nonlinearities, a mechanism central to gradient boosting [
31]. The tight clustering of its predictions around the 1:1 line, including at high SOC, indicates that this mechanism effectively captures the nonlinear variation and the spatial heterogeneity of the study area. Although XGBoost is based on the same boosting principle, it consistently performs slightly worse than GBDT across all metrics. It incorporates L1 and L2 regularization along with a more aggressive tree pruning strategy than standard gradient boosting [
32]. Where SOC content varies sharply with soil type and topographic position, these constraints may slightly limit the model from fully capturing the complexity of the predictor-response surface. Given a training dataset of moderate size that nonetheless spans a wide SOC interval, the bias incurred by such penalization of model complexity may surpass the corresponding gain in variance reduction [
40]. The net effect is a performance gap that, though modest in magnitude, persists across all evaluation criteria.
The behavior of RF reinforces this interpretation. Bagging primarily averages predictions over decorrelated trees, which effectively suppresses variance but lacks a built-in mechanism for bias correction in highly nonlinear feature spaces [
30]. Consequently, RF predictions gravitate toward the global mean: low SOC sites are overestimated, while high SOC values are systematically underpredicted. RF records the lowest LCCC among all models, a diagnostic that points directly to bias, rather than variance, as the dominant source of error. The superiority of boosting over bagging in this study is therefore a direct consequence of bias reduction. Across Hungary’s croplands, where SOC patterns are spatially intricate and soil-forming conditions are heterogeneous, boosting algorithms offer a demonstrable advantage when training samples are limited. Bias is explicitly and iteratively corrected, which is precisely what the structure of the data demands.
SVM performed well on a complementary aspect of model evaluation. The highest LCCC among all candidates was delivered by this model, marginally exceeding the value recorded for GBDT. Although a larger fraction of the total variance was explained by GBDT, the systematic departures from the 1:1 identity line were more effectively suppressed by SVM. This behavior traces back to two defining features of the algorithm: the ε-insensitive loss function, which disregards errors smaller than a specified margin, and the structural risk minimization principle that explicitly balances model complexity against empirical risk [
29]. A mild compression of the predicted range follows from this design—penalties are applied only when residuals exceed the ε-threshold, so extreme outputs are curtailed while large systematic errors are largely avoided. For Hungarian croplands, this bias–variance trade-off carries practical implications. National-scale carbon stock assessments demand spatially unbiased estimates across regions that differ markedly in soil conditions. In such applications, the lower systematic bias of SVM’s aggregate estimates becomes a distinct asset. Field-level management, by contrast, places a premium on local precision. Here, the higher R² achieved by GBDT may make it the more suitable option. PLS proved to be the weakest learner overall. Its linear latent-variable formulation could not accommodate the nonlinear associations that link the predictors to SOC. The consequence was twofold: the most severe underestimation occurred at high SOC values, and the overall dispersion of predictions was the largest. The performance gap separating PLS from the nonlinear models indicates that imposing a linear structure introduces systematic error when the underlying predictor–response relationship departs from linearity—a problem that becomes especially acute in the extreme ranges. This finding aligns with earlier work in soil spectroscopy, where nonlinear techniques have consistently outperformed PLS [
41].
Model performance is never solely a function of the algorithm; it is also shaped by the environmental context where the training and test samples were collected. The data inherit the properties of that context, so a model’s predictive success is, to a large extent, conditioned by the region it is asked to represent. Local factors govern the statistical architecture of the data; the architecture, in turn, dictates how algorithms behave. In this study, five learners drawn from distinct families—partial least squares, support vector regression, random forest, gradient boosting, and its regularized variant XGBoost—were evaluated on Hungarian cropland. Gradient boosting (GBDT) attained the top rank. An equivalent outcome was recorded in the alpine meadows of Gannan [
42] and the arid Akesai region [
43], where boosting once again outperformed alternative tree ensembles. These three environments—high-elevation grassland, dryland, and temperate farmland—differ dramatically, yet converge on the same winner. Universality is absent from the broader record. Across 11 Latin American nations, the highest-performing algorithm varied spatially [
44]. In northwestern Iran, a bagging-based random forest surpassed both support vector regression and XGBoost, achieving an R² of 0.84 [
45]. Adjacent Andean catchments subjected to the same GWR model produced an R² of 0.99 in one and a negative coefficient of determination in the other [
46]. Taken together, these findings effectively preclude the notion that any single model class can claim universal superiority. Rather than committing in advance to a particular algorithm, model selection should be steered by empirical comparisons conducted at the local scale—an adaptive, data-driven strategy for which the Hungarian investigation serves as a concrete exemplar. Model candidates were systematically benchmarked against a set of environmental descriptors that included soil type diversity, topographic roughness, and the legacy of past cultivation. Distributional properties of the data—SOC dynamic range, skewness, and the degree of nonlinearity manifest in the predictor–response relationship—were also fed into the evaluation. What emerged from this benchmarking was a clear, unambiguous preference for one model. The approach, grounded from start to finish in site-specific evidence, demonstrated its fitness for purpose on Hungarian cropland. Whether the same methodology can be successfully transferred to novel environmental settings is a question that independent corroboration must yet resolve. A logical extension of the present work would widen the assessment to span a broader spectrum of environmental gradients, with predictor suites augmented by climatic variables, lithological information, and vegetation indices extracted from remote sensing imagery.
3.3. SHAP-Based Variable Contribution and Nonlinear Effects
According to the SHAP mean absolute value ranking (
Figure 4), the near-infrared band (B5) is by far the most important predictor, followed by Topographic Wetness Index (TWI), Normalized Red-Green Difference Index (NRGDI), elevation (H), red band (B4), and SWIR2 band (B7). Spectral bands together account for 42% of total contribution, confirming that soil reflectance properties dominate SOC prediction in this bare-soil composite dataset. The SHAP bee-swarm plot (
Figure 5) shows the direction of each feature’s contribution to the predicted SOC content: B5, NRGDI, H, B7, valley depth (VD), SWIR1 band B6, blue band B2, B4, and NDMI have negative contributions (i.e., higher variable values lead to lower predicted SOC content); TWI, bare soil index (BSI), and aspect have positive contributions. The remainder of this discussion focuses on interpreting the most prominent patterns and their implications.
Figure 4 presents the global predictor importance based on mean absolute SHAP values. The SHAP dependence plots (
Figure 6) further reveal complex nonlinear interactions between topographic and spectral covariates across the Hungarian study area. A consistent inverse relationship underlies these patterns: as SOC content increases, soil reflectance decreases across the optical range, which is a well-established spectral response in soil spectroscopy.
B5 (0.85–0.88 μm) makes a predominantly negative contribution to SOC prediction, consistent with this inverse relationship. Its dependence plot (
Figure 6) shows a threshold at a reflectance of approximately 0.16. Below this value, SHAP contributions are positive, indicating that low reflectance is associated with higher predicted SOC; above the threshold, contributions become negative, meaning the model interprets bright NIR signals as indicative of carbon-poor, mineral-dominated soils. This threshold marks the transition between two competing optical regimes: at low reflectance, absorption by organic matter suppresses the NIR signal, whereas at high reflectance, the spectral response is dominated by mineral components of inherently higher reflectance. B4 (0.64–0.67 μm), B6 (1.57–1.65 μm), and B7 (2.11–2.29 μm) all exhibit a comparable shift from positive to negative SHAP values at characteristic reflectance levels. B7 has a similar pattern to B5, and the two act together in the model [
47]. The consistent threshold behavior across these bands reflects their shared sensitivity to the spectral contrast between organic-rich and mineral-dominated soils; in the SWIR region, this sensitivity encompasses both soil organic matter and clay mineral assemblages [
48]. In the visible range, B3 (0.53–0.59 μm) and B2 (0.450–0.51 μm) also show negative SHAP contributions, although their influence is considerably smaller. The blue-band response is primarily attributable to absorption by humus-associated chromophores, which reduces reflectance in this spectral region. In the bare-soil composites, B3 reflectance is dominated by the combined effects of iron oxides and SOC content: darker, carbon-rich soils absorb more radiation in the green band, resulting in lower reflectance.
Among topographic variables, TWI contributes positively. High TWI points to converging water flow, leading to waterlogged, reducing soils that slow organic matter decomposition—a pattern observed in both cropland and grassland systems [
49]. Within the study area, the most fertile Chernozems and meadow soils occupy topographic depressions that coincide with regions of high TWI, lending empirical support to this interpretation. The SHAP dependence plot (
Figure 6) confirms this threshold-type response. For TWI values below approximately 10, SHAP values are negative, corresponding to well-drained, aerobic conditions on higher or steeper terrain, where rapid moisture loss limits organic matter accumulation. Above this threshold, SHAP values turn positive, reflecting low-lying, convergent positions where persistent water saturation restricts oxygen diffusion and promotes reducing conditions that slow decomposition. Despite this clear pattern, the effect size of TWI is substantially smaller than that of B5, indicating that topographic moisture control acts as a secondary modifying influence rather than a primary driver of SOC accumulation. H exerts a negative influence on predicted SOC and displays an equally abrupt decline in the dependence plot. Upland positions are subject to intensified erosion, and in the northern hills, thin soils overlying volcanic and metamorphic substrates impose an additional constraint on carbon retention. VD follows a similarly negative trajectory: larger VD values denote upper-slope settings where erosive potential is considerably amplified; its dependence plot reveals a gradual decline without a pronounced threshold, consistent with the subdued topographic gradients of the plain. Plan curvature also carries a negative association with predicted SOC, conforming to the classic geomorphic framework wherein convex slopes promote runoff and sediment removal, whereas concave slopes favor deposition and accumulation [
50]. TRI and Slope contributed only negligibly, as expected given the gentle gradients that dominate Hungarian croplands. Among all terrain-derived variables, aspect alone shows a positive influence. South-facing slopes receive greater solar radiation, which can stimulate vegetation productivity and increase organic matter input [
51]; however, enhanced radiation also accelerates mineralization. The net positive contribution suggests that, in this region, the gain from increased production outweighs the loss from faster decomposition, although this interpretation remains provisional without concurrent vegetation data.
Among spectral indices, NRGDI is negatively associated with SOC. This index characterizes soil redness: high NRGDI values correspond to redder soils, which typically indicate iron-oxide-rich subsoil or eroded surfaces with low organic carbon content. It therefore serves as a reasonable proxy for low-SOC conditions. BSI, a brightness indicator, contributes negatively because high values typically correspond to sandy or saline soils with low organic matter. In the present multi-temporal bare-soil composite, where all pixels represent bare soil or very sparse residue, BSI captures variations in soil surface brightness rather than functioning as a vegetation-vs-soil discriminator. Thus, higher BSI values indicate brighter surfaces associated with lower SOC, consistent with the inverse reflectance–SOC relationship. NDMI likewise exhibits a negative contribution. The bare-soil compositing approach removes vegetation and selects the driest, most exposed moment per pixel, eliminating rainfall and soil moisture signals. Consequently, NDMI no longer functions as a moisture indicator; instead, high NDMI values identify bright mineral surfaces—such as aeolian sands and loess-derived soils—that lack organic matter and produce strong mineral scattering.
Together, these dependence patterns demonstrate that the relationships between SOC and its key predictors are predominantly nonlinear. Many variables—both spectral and topographic—exert their influence through abrupt transitions rather than steady changes. This prevalence of nonlinear structure explains why the classical GBDT, with minimal regularization allowing it to closely track sharp threshold breaks without being penalized for model complexity, systematically outperformed the more strongly regularized XGBoost and the linear alternatives. The reflectance thresholds identified in the SHAP dependence plots hold practical value beyond model interpretation. Under bare-soil conditions, a spectral domain defined by B5 reflectance values below approximately 0.16 enables direct flagging of potentially high-SOC cropland from remote sensing imagery—a simple screening procedure that can guide targeted soil sampling and the spatial delineation of zones where conservation tillage should be prioritized [
52].
Several limitations should be noted. The interpretation of TWI relies on general hydrological reasoning; direct empirical evidence—such as soil redox potential or microbial activity measured across contrasting TWI sites—would strengthen the causal claims. The SHAP-derived thresholds are specific to the Hungarian cropland context and may not directly transfer to regions with different soil types, climate regimes, or management histories. Future work should test the generalizability of these thresholds across broader agroecological settings.
3.4. SOC Spatial Prediction and Uncertainty Quantification
We applied an optimized Gradient Boosting Decision Tree (GBDT) model (R² = 0.518, RMSE = 4.498 g·kg⁻¹, 10-fold cross-validation) to produce a 30 m resolution map of topsoil organic carbon (SOC) for Hungarian croplands (
Figure 7). Predicted SOC content ranged from 6.36 to 28.27 g·kg⁻¹, with a mean of 16.30 g·kg⁻¹ and a standard deviation of 4.65 g·kg⁻¹. SOC content was classified into five classes using 5 g·kg⁻¹ intervals: I (5–10), II (10–15), III (15–20), IV (20–25), and V (25–30 g·kg⁻¹).
Five elevation zones were derived from a DEM using natural breaks [
53]. The area and percentage of each SOC class within these zones are summarized in
Table 4. Hungarian cropland is heavily concentrated in the low-elevation zone and decreases markedly with increasing elevation, leaving only a small proportion at higher altitudes. SOC classes (Class IV and Class V) are similarly concentrated in the low-elevation zone. Class V occurs almost exclusively in Zone 1 (0–124 m), covering 1181.64 km², whereas the combined extent across all higher zones totals only 1.28 km². Class IV dominates Zone 1, and together with Class III it accounts for over 78% of the zone. As elevation increases, the composition of SOC classes undergoes a marked shift. In Zone 2 (124–195 m), Class II becomes dominant (53.13%), while Class IV declines to 7.26%. In Zones 3 to 5 (>195 m), Class II is overwhelmingly dominant, comprising 78.81%, 83.58%, and 74.06% of the respective zones. Overall, low-elevation cropland not only covers the largest area but also stores the highest SOC levels. With increasing elevation, cropland area diminishes and the SOC class composition shifts toward a Class II-dominated, lower-SOC profile, with high-value classes becoming rare.
The 30 m SOC map reveals a pronounced east–west zonation across Hungarian croplands: higher SOC values are concentrated in the eastern and southeastern plains, while lower values prevail across the western and hilly regions. Class II accounts for the largest areal extent, whereas SOC categories—Classes IV and V—are largely confined to low-elevation plains. Class I appears sporadically on aeolian sand dunes of the Great Plain and in high-elevation northern croplands, Class III stretches across the central region and reaches the northern foothills and western border, and Class V occurs as scattered patches in northeastern alluvial depressions and southern lowlands. This broad spatial pattern is consistent with both the Global Black Soil Map (GBSmap) [
54] and the national-level product of Szatmári et al. [
55], confirming that the model effectively captures the regional SOC distribution associated with Chernozems, the dominant high-SOC soil type in Hungarian croplands. Complementing this regional agreement, the 30 m resolution reveals within-class heterogeneity that coarser products cannot resolve. . In the eastern high-SOC belt, contiguous Class IV patches are interspersed with scattered Class V occurrences, whereas small Class I fragments appear on sandy dune landforms in the southern part of the study area. This finer spatial detail exposes local-scale SOC variations tied to soil type boundaries, microtopographic depressions, and isolated sand lenses—features inherently smoothed away in coarser mapping products. Such fine-grained differentiation has direct implications for site-specific management, as it delineates zones of contrasting carbon status within fields that would otherwise be treated as uniform.
Class V is almost entirely confined to Zone 1, even though it represents only a small fraction of that zone. Together with Class IV, these highest-SOC soils are concentrated in a narrow low-elevation range. As elevation increases, the proportion of these high-SOC classes declines sharply, and the elevational extent of each class narrows progressively from Class II through to Class V. This sequential contraction supports the inference that the highest carbon stocks are largely confined to flat, low-relief terrain. Similar elevational trends in SOC have been reported for temperate mountain regions [
56]. Within the study area, however, the elevational gradient reflects more than a straightforward climatic control. Fertile alluvial depressions and Chernozem-mantled plains are spatially concentrated at low elevations. In these settings, thick organic horizons have developed from deep, base-rich parent materials and have been sustained by sediment accretion and centuries of agricultural enrichment. The elevational signal is therefore mediated, at least in part, by the distribution of soil types and depositional landforms. Higher on the Transdanubian Hills, Luvisols and Cambisols become the dominant soil groups. Moderate organic inputs combined with relatively rapid decomposition maintain Class II as the prevalent SOC category there. Class I emerges in two sharply contrasting elevational contexts. Across the cool, humid northern uplands, steep slopes experience accelerated erosion that strips fertile topsoil. Organic inputs to the root zone are restricted by the consequent loss of soil volume and nutrients, offsetting any carbon gains that slower decomposition might otherwise have permitted. In the lowland sandy tracts, rapid drainage, low cation exchange capacity, and minimal mineral protection act together to severely suppress organic matter retention. These divergent settings demonstrate that SOC is not governed by elevation alone. Rather, it covaries with an ensemble of soil-forming factors, and it is their combined influence that ultimately shapes the carbon stock.
The spatial configuration of SOC across the study area is governed by the interplay of three primary factors: topography, soil type, and cultivation. In the northern mountains and hills, Luvisol–Chernozem landscapes are interspersed with closed depressions where elevated SOC pockets are concentrated. These topographic lows serve as local sinks, intercepting and retaining sediment and runoff from adjacent slopes. Such inputs sustain localized carbon enrichment; without them, these sites would bear only the moderate SOC levels common across the surrounding region. A distinctly different spatial arrangement characterizes the eastern Great Plain, where extensive Chernozems with deep, dark humus horizons reflect historically strong carbon accumulation. The model predictions confirm extensive and relatively continuous Class IV and Class V cover across this region, consistent with the high SOC levels expected from these fertile soils. Nonetheless, long-term intensive cultivation has likely diminished the original carbon stock to some degree. Repeated mechanical tillage and reduced residue return accelerate SOC mineralization, yet the Chernozem landscape still retains a high baseline. High-SOC areas persist most prominently within alluvial depressions, where elevated moisture and finer sediments retard decomposition and provide a measure of protection, allowing Class V patches to remain embedded within the broader Class IV matrix [
57]. The cultivation imprint is further modulated in the Danube–Tisza interfluve, where a fine-scale mosaic of aeolian sands and Chernozems imposes additional edaphic heterogeneity, producing a more varied local SOC pattern within the generally high-SOC plain [
58]. In the southern Danube–Tisza interfluve and the Kiskunság sand region, sandy and coarse skeletal soils dominate the landscape. Biomass production is restricted by the high sand content and low water-holding capacity of these substrates; organic matter decomposition is simultaneously hastened, resulting in an inherently low SOC status. On the marginal dunes, the decline to Class I represents the most extreme expression of this textural control, a pattern fully consistent with the poor retention capacity typical of sandy soils [
59]. The western part of the study area, corresponding to the Transdanubian Hills, is defined by rapid soil-type transitions from Cambisols to Fluvisols that occur along valley axes. An intimate juxtaposition of Class II and Class III is generated by these transitions, mirroring the patchwork of weakly developed alluvial soils intermingled with material sloughed from adjacent slopes. Carbon stored in this zone is relatively labile: the alluvial deposits are geologically young, and the coarse-textured Fluvisols offer only a limited capacity to stabilize organic matter.Across these contrasting environments, macro-scale SOC patterns are consistently shaped by soil–geomorphology interactions. These interactions establish a physical template upon which more localized processes—hydrological redistribution, cultivation history, and microtopography—exert their influence [
60]. The relative importance of these secondary controls varies geographically: cultivation acts as a moderate modifier in the eastern plains, texture and drainage dominate in the south, and alluvial dynamics play the primary role in the west.
A pixel-level coefficient of variation (CV) map derived from 100 bootstrap resampling runs was generated to quantify the spatial distribution of SOC prediction uncertainty across Hungarian croplands (
Figure 8). The lowest uncertainty (CV < 5%, dark green) is concentrated in the eastern Great Plain (~46.8–47.3°N, 18.5–21.0°E), closely overlapping the extensive high-SOC zones dominated by Chernozems. This overlap between high prediction confidence and high SOC is directly linked to the dominant role of B5. In this region, the deep, dark humus horizons of Chernozems produce persistently low NIR reflectance, providing a stable spectral signal for B5, the most influential predictor in the model. The subdued topography and gradual soil transitions further reduce the complexity of predictor–response relationships, contributing to greater prediction stability. Higher uncertainty appears in two main zones. The first encompasses the southern Danube–Tisza interfluve and the Kiskunság sand region (~46.0°N), where sandy and coarse-textured soils dominate. The high sand content and low water-holding capacity restrict biomass production and accelerate organic matter decomposition, resulting in inherently low SOC levels. Although the terrain appears flat, a fine-scale mosaic of aeolian sands and Chernozems creates high-frequency spatial variation that the current predictor set cannot fully resolve. The second zone lies in the western Transdanubian Hills (~17°E), where the landscape transitions from hills to plains and soil types shift abruptly from Cambisols to Fluvisols along valley axes. River terraces, floodplains, and hillslopes intermingle, producing rapid alternations of soil and topographic conditions that further elevate prediction uncertainty. The northern high-elevation region appears in light green tones, indicating moderate to moderately low uncertainty. This suggests that, despite the pronounced topographic relief, the existing terrain variables still fall short of fully explaining SOC variation in this complex landscape—particularly the localized carbon enrichment within closed depressions of the Luvisol–Chernozem mosaic, which cannot be entirely captured by regional-scale topographic proxies.
Overall, the spatial pattern of uncertainty reflects how well the environmental predictors match the spatial heterogeneity of SOC. In regions with relatively uniform soil and terrain, the stable signal of the dominant spectral predictor ensures high prediction confidence. Conversely, where soils form fine-scale mosaics or topography transitions abruptly, the current predictor suite is less capable of resolving high-frequency variation, leading to elevated prediction risk. This pattern aligns with the overarching framework in which soil–geomorphology interactions set the physical template for SOC distribution, while local-scale processes superimpose additional variability [
52]. This prediction framework, while robust at the Hungarian cropland scale, has notable limitations. Its performance remains constrained by the spatial representativeness of the training samples and by the inherent limitation of static covariates, which cannot fully capture the slow dynamics of soil carbon through real-time spectral signals. Nevertheless, the combined use of the 30 m SOC content map and its corresponding pixel-wise uncertainty map offers a distinct advantage over conventional single-product outputs: it equips decision-makers not only with SOC estimates for each field, but also with a spatially explicit quantification of their reliability. In low-uncertainty zones such as the eastern Great Plain, SOC predictions can directly inform variable-rate fertilization, organic matter amendments, and soil health monitoring. In high-uncertainty zones such as the southern sandy region and the western transitional hills, the confidence layer explicitly indicates where supplementary field sampling or more conservative management strategies should be prioritized, thereby reducing the risk of misguided decisions arising from model error. This “prediction plus confidence” dual-layer framework provides actionable, quantitative support for precision nutrient management, the spatial prioritization of conservation tillage, and the spatially informed implementation of agricultural carbon neutrality goals.