Preprint
Article

This version is not peer-reviewed.

Quantifying Categorical Information Loss in Forest Compositional Mapping: Implications for the Accuracy of Forest Assessment in Lualaba Province (DR Congo)

Submitted:

21 April 2026

Posted:

23 April 2026

You are already at the latest version

Abstract
Forests of Lualaba Province (DR Congo) form a compositionally complex mosaic of dry dense forest, gallery forest, and Miombo woodland. Yet, categorical land-cover maps impose discrete boundaries on these inherently continuous vegetation gradients, systematically discarding subpixel compositional information critical for forest monitoring and carbon accounting. The magnitude of this information loss at the landscape scale, however, remains largely unquantified. In this study, we train a Multi-Output Neural Network (MONN) using Sentinel-2 spectral and textural predictors (2025) to estimate the proportional cover of three forest types across the province. Model performance is benchmarked against a normalised Random Forest (RF) using spatial block cross-validation. Categorical information loss is quantified pixel-wise using two complementary metrics, dominant class proportion and Shannon compositional entropy, alongside a derived interpretive quantity, categorical information loss. The MONN slightly outperformed RF (R² = 0.648 vs 0.630; RMSE = 0.224 vs 0.229), yet the results reveal a fundamentally heterogeneous landscape structure. The mean dominant-class proportion was only 56.2%, indicating that categorical maps discard, on average, 43.8% of compositional information per pixel. Only 7.9% of forested pixels exceeded the 75% dominance threshold, while Shannon entropy reached 74.1% of its theoretical maximum, indicating that forest types coexist in near-equal proportions across most pixels. This renders categorical attribution structurally inadequate for most of the forested landscape. Across 92.1% of forested pixels, no single forest type achieved clear dominance. These results show that compositional mixing is the dominant structural condition of the landscape, and that compositional mapping is essential for representing tropical forest structure in heterogeneous drylands. By formally quantifying categorical information loss at the landscape scale, this study shows that continuous compositional mapping converts this structural ambiguity into a spatially explicit ecological signal, with direct implications for monitoring vegetation dynamics and biodiversity, highlighting a structural source of error in carbon stock estimation in tropical dry forests.
Keywords: 
;  ;  ;  ;  
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated