1. Introduction
The need to protect the ecological environment and restore ecosystems has grown in recent years due to the effects of human activity and global climate change [
1,
2,
3,
4]. As an important vegetation type, shrubs play a crucial role in ecosystems [
5,
6,
7]. They not only maintain soil moisture balance and prevent soil erosion but also provide habitats and fodder [
8,
9,
10], which are essential for preserving biodiversity and maintaining ecological balance [
11]. Among their attributes, shrub biomass is a crucial measurement indicator for reflecting shrub growth status and overall vegetation productivity [
12,
13]. Therefore, accurately estimating shrub biomass is a key task for understanding vegetation growth status, ecological functions, and ecosystem carbon storage [
14,
15,
16].
Conventional methods of field surveying have limitations in accurately estimating the biomass of shrubs [
17]. Firstly, these methods necessitate a substantial allocation of human resources and financial investment and are time-consuming, particularly in expansive areas with intricate topography [
18,
19]. Secondly, field surveys can only yield localized sample data, making it challenging to capture the spatial distribution and temporal patterns of shrubs fully. Conversely, the use of destructive sampling techniques to quantify shrub biomass may potentially impact the surrounding ecological system [
20]. Furthermore, the biomass of shrubs is influenced by various factors such as vegetation structure, growth environment, and climatic conditions [
21,
22]. Consequently, it is challenging to account for these factors using traditional methods comprehensively.
To address the limitations of conventional methods, remote sensing technology has emerged as an effective tool for estimating shrub biomass [
23,
24,
25]. This technology enables the acquisition of extensive and continuous data with high spatial and temporal resolution, facilitating large-scale estimation of shrub biomass [
26,
27]. Recent advancements in satellite remote sensing technology, along with the availability of data from satellites such as Landsat and Sentinel series, have expanded the possibilities in the field of biomass estimation [
28,
29]. However, utilizing only the spectral bands of remote sensing images in shrub biomass estimation has its constraints [
30]. For example, the visible and near-infrared bands have a narrow wavelength range [
31], making it challenging to capture the differences in factors closely related to shrub biomass, such as vegetation structure, leaf area index, and chlorophyll content. To overcome these limitations, many studies have used vegetation indices as important indicators for biomass estimation [
32]. Vegetation indices are numerical values calculated from spectral bands that can reflect key information about vegetation growth status, chlorophyll content, leaf area index, and other related factors [
33,
34]. However, existing vegetation indices may encounter saturation issues and struggle to differentiate higher biomass shrubs in densely vegetated areas [
35]. Therefore, it is imperative to evaluate which vegetation indices or spectral bands contribute most significantly to the estimation of shrub biomass, emphasizing the need to identify the optimal feature combination for accurate estimation.
Unmanned aerial vehicles (UAVs) are also essential in the estimation of shrub biomass due to their ability to provide detailed information and significant advantages [
36,
37]. Equipped with high-resolution sensors and advanced image processing techniques, UAVs can capture high-quality remote sensing imagery, offering comprehensive data on shrub vegetation [
38,
39]. They are capable of capturing extensive areas of shrub coverage from a high-altitude overhead perspective, enabling the rapid and accurate identification and extraction of shrub objects [
37,
40,
41,
42]. Additionally, UAVs have higher spatial resolution and flexibility compared to traditional aerial remote sensing techniques, allowing for the capture of finer details of shrub structure and characteristics [
43]. This capability is crucial for biomass estimation, as the spatial heterogeneity and subtle variations of shrubs can significantly impact the accuracy of estimations. Furthermore, UAVs can enhance workflow efficiency, reduce workforce, and save time and costs compared to traditional field surveys and plot measurements [
44]. However, there is limited research on integrating UAV-derived shrub biomass estimation at the satellite scale, highlighting the need for further investigation in this area.
Traditionally, biomass estimation models relied on statistical regression techniques to establish empirical relationships between field-measured biomass and various biophysical or remote sensing variables [
45]. Over time, several modeling approaches have been developed to enhance the precision of estimating shrub biomass [
37,
46,
47]. In recent times, machine learning methods have emerged as potent tools in this field, demonstrating superior performance compared to conventional modeling methods [
48]. However, these models had limitations in capturing complex nonlinear relationships and accounting for spatial heterogeneity [
49]. In contrast, machine learning methods, such as random forests, support vector machines, and neural networks, have gained traction due to their capability to handle intricate and nonlinear relationships [
50,
51]. Machine learning methods offer several advantages over traditional modeling techniques in shrub biomass estimation, particularly in effectively handling large volumes of remote sensing data, including multispectral and hyperspectral imagery, which provide rich spectral and spatial information for accurate biomass estimation [
48,
52,
53].
This work aimed to propose a framework to estimate shrubland biomass at 10m over an arid and semi-arid mountain region based on multi-scale data from field measurements, UAV, Sentinel-1, Sentinel-2 and Landsat observations. The Helan Mountains in Ningxia province, China, were selected as the study area. Firstly, a land cover classification was conducted to identify the shrublands and other land cover types. Secondly, the prediction model of shrub biomass was developed in a Random Forest Regression (RFR) approach driven by different predicted variable datasets based on field measurements, UAV, and satellite images. The field measurement data was used to establish the allometric growth equation between shrub biomass and shrub structure parameters. Using the allometric equation, the shrub biomass was determined in UAV data. Using the UAV-based shrub biomass as inputs, the optimal satellite-based biomass estimation model was developed by comparing different predictor variables from Landsat, Sentinel-1, and Sentinel-2 satellites. Thirdly, with the best model, we created a map of the biomass distribution of shrubland in the Helan Mountains. The accuracy of the resultant map was evaluated over various ranges of shrub biomass or shrub coverage. Finally, we assessed the spatial characteristics of shrubland biomass based on the resultant biomass map and the auxiliary datasets of climate and topography.