4. Discussion
This review synthesised 757 peer-reviewed studies published between 2000 and 2025 to provide the most comprehensive evidence base yet assembled for satellite remote sensing of vegetation responses to climate change. The findings reported in
Section 3.1 through
3.11 reveal a field that has grown exponentially in output, become substantially more analytically sophisticated, and yet remains deeply constrained by geographic inequity, inconsistent performance reporting, and limited open science practice. The discussion that follows interprets the principal findings across five thematic dimensions: publication growth and its structural drivers; the sensor and spectral index landscape; the machine learning and deep learning transition; thematic and biome coverage patterns; and the open science deficit — before identifying the most critical knowledge gaps and priority research directions that emerge from the evidence.
4.1. Publication Growth and Its Structural Drivers
The annual publication trajectory documented in
Section 3.2 reveals two distinct phases of acceleration. The period from 2000 to approximately 2011 yielded fewer than 12 studies per year, consistent with the operational constraints of that era: satellite data archives were costly or restricted in access, cloud computing infrastructure for large geospatial datasets did not exist at scale, and machine learning methods had not yet been widely adopted in remote sensing contexts. A first inflection point around 2012–2015 coincided with the opening of the Landsat archive in 2008, which made decades of multispectral data freely and programmatically accessible for the first time. The most pronounced acceleration began after 2019, culminating in 137 studies in 2025 alone — the highest annual count in the corpus. The period 2021–2025 alone accounts for 480 studies, representing 63.4% of all included publications.
This surge is temporally coherent with three independently documented developments. First, the Sentinel-2 constellation, which achieved its full 5-day global revisit capacity from 2017 onwards, provided 10-metre multispectral imagery at no cost, enabling vegetation monitoring at spatial scales that had previously required expensive commercial data. Second, Google Earth Engine — cited as an analytical platform in 46 studies (6.1% of the corpus) — lowered the computational barrier to processing planetary-scale time series, allowing researchers without dedicated high-performance computing infrastructure to conduct analyses that would previously have required institutional supercomputing resources. Third, the maturation of deep learning architectures, particularly convolutional neural networks and long short-term memory networks from 2015 onwards, provided new analytical tools for exploiting the temporal and spatial richness of these expanded data archives. The quantitative record of publication growth documented here thus reflects not simply increased scientific interest, but the convergence of three enabling technologies that together democratised large-scale vegetation remote sensing.
The rapid growth trajectory also introduces structural concerns. As
Section 3.11 documents, only 78 studies (10.3%) reported the coefficient of determination (R²), and only 64 studies (8.5%) reported root mean square error (RMSE). The concentration of publications in the most recent five years, combined with persistently low rates of performance metric reporting, suggests that the field is expanding faster than its quality assurance mechanisms are consolidating. Journals, reviewers, and authors face a shared challenge: ensuring that the exponential increase in output does not outpace the development of methodological standards capable of benchmarking that output reliably.
4.2. Geographic Distribution and Research Equity
The geographic analysis presented in
Section 3.3 reveals a pronounced concentration of research output in three regions: East Asia (32.2% of studies, n = 244), Europe (27.5%, n = 208), and North America (19.4%, n = 147). Together, these three regions account for nearly 80% of total output. By the Global North/South classification, 81.1% of studies originated from Global North institutions, compared with only 16.4% from Global South institutions. China was the single most studied country (n = 87), followed by the United States (n = 40), Canada (n = 19), India (n = 17), and Brazil (n = 16). A further 19 studies (2.5%) could not be classified by Global North/South designation due to ambiguous or multi-institutional authorship metadata; provisional inspection of author countries suggests the majority of these would also fall within the Global North, indicating the 81.1% figure is likely a conservative lower bound.
The mismatch between where research is conducted and where the world's most ecologically significant and climate-sensitive vegetation systems are located is stark. Sub-Saharan Africa contributed only 21 studies (2.8%) despite containing the Congo Basin, the world's second largest tropical forest and a carbon sink of global significance; the studies that do exist from this region demonstrate both the feasibility and the scientific value of remote sensing approaches applied under these conditions (e.g., Bojer et al., 2025; Ikuemonisan et al., 2025; Bihon et al., 2025). Southeast Asia contributed 15 studies (2.0%), despite hosting some of the highest rates of deforestation and some of the most biodiverse forest ecosystems on Earth (e.g., Laosuwan et al., 2025; Vetrita et al., 2025; Zhou et al., 2025). Latin America contributed 27 studies (3.6%), with Brazil accounting for the majority of these despite the Amazon’s outsized role in the global carbon and water cycles (e.g., Souza et al., 2025; Romero-Sánchez et al., 2025). Peatlands — which appear in only three studies (Habib et al., 2024; Zhou et al., 2023) — are almost entirely absent from the corpus despite constituting a disproportionate fraction of terrestrial carbon stocks and being acutely vulnerable to drainage and fire.
This equity deficit is not merely a matter of scientific representativeness; it has direct consequences for the quality of global carbon accounting, climate projections, and ecosystem management. Models of global terrestrial carbon flux are calibrated with the observational data available to them: a corpus that systematically underrepresents tropical forests, savannas, dryland ecosystems, and peatland systems will propagate those absences into the projections used to inform climate policy. The growing availability of free, globally consistent satellite data from Sentinel-2, MODIS, and Landsat removes the data access barrier that historically justified geographic concentration. The remaining barriers — institutional capacity, research funding, and the structural dynamics of international scientific publishing — require deliberate policy responses rather than passive reliance on technological democratisation.
4.3. Sensor Platforms and Spectral Index Dominance
Section 3.4 documents 110 unique sensor terms across the corpus, reflecting genuine methodological pluralism. However, the distribution is highly concentrated: MODIS was employed in 40.0% of studies (n = 303), Landsat in 29.9% (n = 226), and Sentinel-2 in 28.1% (n = 213). SAR-based sensors collectively appeared in 189 studies (25%), with Sentinel-1 (n = 78) and PALSAR (n = 30) being the most frequently used radar platforms. LiDAR was employed in 66 studies (8.7%), primarily for biomass estimation and forest structure characterisation.
The temporal progression of sensor use reflects broader developments in data availability. MODIS dominance across the full 25-year period reflects its uninterrupted daily global coverage since 2000 and the maturity of its pre-processed vegetation products (MOD13, MOD15, MOD17), which dramatically reduce the preprocessing burden for research groups. The rising prevalence of Sentinel-2 from 2017 onwards — it already represents the third most commonly used sensor across the full corpus — signals the beginning of a platform transition that is likely to become more pronounced in subsequent years as the Sentinel-2 archive deepens. AVHRR (n = 51) maintained relevance specifically for long-term trend studies requiring data extending before 2000, a functionality that no currently operational sensor can replace.
The spectral index landscape is equally concentrated. NDVI dominated at 45.7% of studies (n = 346), a proportion that reflects both its scientific utility and its status as the de facto standard for vegetation monitoring since Tucker (1979). EVI ranked second (13.2%, n = 100), preferred in high-biomass tropical environments where NDVI saturates. LAI appeared in 56 studies (7.4%), serving as the primary link between spectral observations and process-based carbon and water flux models. The emergence of Solar-Induced Fluorescence (SIF, n = 16) — exclusively in post-2015 publications — is particularly significant: unlike structural indices such as NDVI and EVI, SIF is directly coupled to photosynthetic electron transport and provides a more mechanistically grounded measure of actual carbon assimilation. Representative applications include SIF-based phenology monitoring (Marsh et al., 2025; Dutra et al., 2025; Meng et al., 2021), carbon flux estimation (Wenyu et al., 2025; Morais Filho et al., 2021), and drought stress detection (Wang et al., 2025; Zhang et al., 2024). Its growing uptake, enabled by the availability of products from GOME-2, GOSAT, and OCO-2, represents the most important spectral index innovation of the recent period and warrants substantially greater systematic synthesis than it has yet received.
The continued dominance of NDVI, despite well-documented limitations including atmospheric sensitivity, saturation in high-biomass environments, and susceptibility to soil background effects, reflects the path-dependency of an index that benefits from four decades of continuous satellite records. Newer indices — EVI2, NDWI, NDMI, and the drought-specific indices VCI, VHI, and TCI — address specific limitations of NDVI but have not accumulated the temporal depth or methodological validation record needed to supplant it for trend analysis. This creates a structural tension: the index best suited for long-term change detection is also the one with the most significant biophysical limitations for detecting subtle phenological and productivity changes in specific biome contexts.
4.4. The Machine Learning and Deep Learning Transition
The analytical method distribution documented in
Section 3.5 reveals a field in methodological transition. Random Forest was the most commonly applied method (21.1%, n = 160), consistent with its established robustness for high-dimensional classification and regression tasks, its resistance to overfitting, and its intrinsic feature importance estimation capability — a property highly valued in vegetation-climate studies seeking to identify dominant drivers. Linear regression methods ranked second (17.2%, n = 130), reflecting their continued interpretability and utility for quantifying linear relationships between spectral indices and biophysical variables at scales where the linearity assumption holds.
Deep learning methods showed a marked increase in prevalence from 2019 onwards. CNNs (n = 12) and U-Net architectures (n = 10) were predominantly applied to image segmentation and land cover mapping tasks (e.g., Hanan et al., 2025; D’Amico et al., 2025; Abdelbaki et al., 2025). LSTM networks (n = 6) were specifically deployed for temporal modelling of vegetation time series, exploiting their capacity to capture long-range temporal dependencies of the kind that link antecedent climate conditions to lagged vegetation responses (e.g., Sun et al., 2025; Jeong et al., 2022). The GAN-CNN-LSTM model documented in one study on drought forecasting in Africa achieved the lowest RMSE, MAPE, and MAE values of any model in its comparison set, suggesting that hybrid architectures combining generative and sequential components may offer particular advantages for spatiotemporal prediction tasks.
The broader methodological literature — synthesised in the introduction of the review — anticipated exactly this evolution. Reichstein et al. (2019) argued in Nature that deep learning architectures, particularly those capable of exploiting spatial and temporal context simultaneously, offered transformative potential for Earth system science. The corpus analysed here provides empirical confirmation of that trajectory: deep learning methods now appear in the literature with sufficient frequency to be systematically trackable, though they remain a minority of the total analytical output. The continued dominance of Random Forest and linear regression reflects the reality that not all research questions require deep learning's complexity, and that simpler methods often offer superior interpretability at acceptable cost to predictive accuracy.
A persistent methodological limitation documented across the corpus is the inconsistency of performance metric reporting. The mean R² across the 78 studies that reported it was 0.72 (SD = 0.21, median 0.77, interquartile range 0.65–0.86), indicating moderate to good predictive performance on average. RMSE was reported in 64 studies, with a mean of 105.4 and SD of 427.4 — a variance so high that it reflects the diversity of predicted variables and their units rather than genuine performance heterogeneity. Only 11 studies reported MAE and 21 reported overall accuracy for classification tasks. This inconsistency makes cross-study benchmarking effectively impossible without unit normalisation and application-domain stratification. The field lacks the standardised reporting conventions that have been established in neighbouring disciplines such as computational biology and clinical prediction modelling, and the development of such conventions represents a priority task for the remote sensing community.
4.5. Application Domain and Biome Coverage
Phenology constituted the largest application domain in the corpus (30.9%, n = 234), reflecting the sensitivity of plant phenological cycles to inter-annual climate variability and the maturity of NDVI and EVI time series for detecting seasonal transitions. Land cover change (16.6%, n = 126) and biomass estimation (16.4%, n = 124) followed closely, together constituting one-third of the total corpus. Multi-domain studies (19.2%, n = 145) indicated a growing trend towards integrative research frameworks that link productivity, phenology, and carbon flux within unified analytical approaches. Drought stress (7.9%, n = 60) and greening/browning trends (5.3%, n = 40) formed coherent sub-corpora, while wildfire recovery (3.0%, n = 23) was notably small given the escalating global significance of fire as a vegetation disturbance agent.
The biome distribution documented in
Section 3.9 reveals systematic under-representation of several ecosystem types. Forests were the most studied biome (18.2%, n = 138), consistent with their prominence in global carbon accounting and the availability of forest-specific remote sensing products. Tundra was disproportionately well-represented (4.4%, n = 33) relative to its global land area, reflecting heightened scientific attention to Arctic warming and the potential tipping point dynamics associated with permafrost degradation. By contrast, dryland and desert ecosystems (2.2%, n = 17), grasslands (2.2%, n = 17), peatlands (three studies), and shrublands (six studies) were all severely under-represented relative to their global extent and ecological importance.
The under-representation of drylands is particularly consequential. Drylands cover approximately 40% of the global land surface, and the vegetation dynamics of dryland ecosystems — dominated by the interplay of precipitation variability, vapour pressure deficit, and soil moisture — differ fundamentally from those of humid forest systems. The methodological arsenal developed for humid forest monitoring translates imperfectly to these environments, where vegetation cover is sparse and spectrally mixed, NDVI saturates at the low end rather than the high end, and phenological cycles are driven by episodic rainfall rather than photoperiod. The few studies that have addressed dryland systems demonstrate the feasibility of satellite-based monitoring in these challenging environments, including phenological tracking in desert biomes (e.g., Dutra et al., 2025; Qader et al., 2022), above-ground biomass estimation in arid shrublands (Hernández-Martínez et al., 2025), and greening/browning trend analysis across arid regions (Tang et al., 2025; Chen et al., 2025). The development of dryland-specific remote sensing methods, validated against the growing network of eddy covariance flux towers in these environments, represents a research priority of high ecological and carbon accounting significance.
Peatland under-representation is similarly acute. Peatlands store approximately 500–600 Pg of carbon in organic soils accumulated over millennia — a stock that dwarfs the carbon in above-ground forest biomass. Their vulnerability to drainage, fire, and the permafrost degradation associated with Arctic warming makes them a critical system for long-term carbon cycle monitoring. Yet only three studies in the 757-study corpus explicitly addressed peatland ecosystems: Habib et al. (2024) examined land cover change and carbon flux dynamics in tropical peatlands; and Zhou et al. (2023) quantified carbon stocks using multi-sensor fusion in boreal peat systems. Wetlands more broadly (6.5%, n = 49) were better represented, largely through studies of mangroves and salt marshes, but the peat-forming wetlands that constitute the largest terrestrial carbon store were essentially absent.
4.6. Open Science Practices and Reproducibility
The open science audit documented in
Section 3.10 reveals a field in which reproducibility remains substantially constrained. Of the 757 included studies, only 91 (12.0%) explicitly reported sharing analysis code, and only 70 studies (9.2%) confirmed open data availability. Studies that did share both code and data provide replicable templates for the broader community — including, for example, Maurya and Mahajan (2025), Zhou et al. (2025), Kluczek and Zagajewski (2025), Chen et al. (2025), and Crişu et al. (2025), whose repositories are catalogued in
Table A2. Sixteen studies explicitly stated that code was not publicly available, and 37 stated that data were not shared. The large majority (85.9%, n = 650) made no statement about code or data availability, a pattern consistent with the absence of mandatory data and code sharing policies in many journals across the study period.
These rates are low by the standards of neighbouring disciplines that have invested more systematically in reproducibility infrastructure. The consequences are concrete: when methods cannot be inspected, replicated, or applied to new study regions, the cumulative scientific value of individual studies is substantially reduced. Systematic reviews like the present one must rely on reported results rather than re-analysed data, making it impossible to normalise performance metrics, harmonise variable definitions, or verify the robustness of reported findings. The adoption of platforms such as GitHub, Zenodo, and the Open Science Framework for code and data deposition — now standard practice in fields such as ecology and computational biology — would substantially enhance the scientific return on the investment represented by each published study.
The temporal trajectory of open science adoption, while not separately presented here due to data limitations, is likely positive: mandatory data availability statements are increasingly required by high-impact journals, and the culture of pre-registration and open materials is diffusing from experimental psychology and ecology into the remote sensing community. The 12.0% code sharing rate documented across the full 2000–2025 period almost certainly underestimates the current (2023–2025) rate, given that open science norms were effectively absent in the remote sensing literature before approximately 2015. Future systematic reviews with a shorter time horizon will be better positioned to document this trajectory quantitatively.
4.7. Limitations of This Review
Protocol registration. This review was not prospectively registered on PROSPERO or the Open Science Framework (OSF) prior to data collection. Prospective registration is a core component of PRISMA-compliant systematic reviews, as it distinguishes a priori hypotheses and analytical decisions from post-hoc adaptations, and provides an auditable record against which deviations can be assessed (Page et al., 2021). The absence of registration does not invalidate the findings, but it means that decisions regarding eligibility criteria, search strategy, and synthesis approach cannot be independently verified as having been made before results were known. Future updates to this review should be prospectively registered to strengthen methodological transparency.
Database coverage. This review searched a single bibliographic database (Scopus). Web of Science, PubMed, CAB Abstracts, and grey literature sources — including technical reports, pre-prints, and institutional repositories — were not systematically searched. Single-database searches are known to introduce selection bias, as coverage varies substantially across disciplines, journals, and publication types (Bramer et al., 2017). Scopus provides broad coverage of peer-reviewed remote sensing literature, and the final corpus of 757 studies represents a substantial evidence base; however, it is likely that a proportion of relevant studies indexed exclusively in Web of Science or published as grey literature were not captured. The findings, particularly those relating to geographic distribution and thematic coverage, should be interpreted with this constraint in mind. The underrepresentation of Global South research identified in
Section 3.3 may be partly attributable to database bias, since Scopus indexing is itself skewed toward journals from high-income countries.
4.8. Synthesis: Research Gaps and Priority Directions
The evidence base assembled across 757 studies and the systematic analysis of their geographic distribution, sensor use, analytical methods, application domains, and biome coverage converges on five priority research directions.
First, the geographic equity deficit demands targeted investment in research capacity in underrepresented regions, particularly Sub-Saharan Africa, Southeast Asia, and tropical Latin America. The barrier is no longer data access — Sentinel-2, Landsat, and MODIS are globally and freely available — but the institutional capacity to process and interpret those data in the context of local ecological knowledge. International collaboration programmes that pair Global North analytical capacity with Global South ecological expertise offer a partial solution, but must be structured to build sustainable local capacity rather than extracting intellectual value for external publication.
Second, the systematic under-representation of drylands, peatlands, and shrublands in the published literature requires deliberate community effort to develop and validate remote sensing methods appropriate for these ecosystems. This includes the development of dryland-specific spectral indices that perform robustly at low vegetation cover fractions, the extension of phenological monitoring frameworks to episodically vegetated systems driven by rainfall rather than photoperiod, and the integration of radar (SAR) and thermal infrared observations to penetrate the cloud cover that limits optical monitoring of tropical peatland systems.
Third, the performance metric inconsistency documented across the corpus — only 10.3% of studies reporting R² and 8.5% reporting RMSE — constitutes a barrier to cumulative scientific progress that requires a community-level response. The adoption of standardised reporting guidelines, analogous to the EQUATOR network's TRIPOD statement for clinical prediction models, would enable meaningful cross-study benchmarking and facilitate the meta-analytic synthesis that the volume of published studies now warrants.
Fourth, the under-utilisation of Solar-Induced Fluorescence as a vegetation monitoring tool — present in only 16 studies across the full corpus — represents a missed opportunity given SIF's mechanistic superiority to structural indices for capturing actual photosynthetic activity. As the OCO-3, TROPOMI, and future FLEX mission datasets accumulate, the systematic integration of SIF with structural indices such as NDVI and EVI in multi-index vegetation monitoring frameworks deserves substantially greater research investment than the current literature reflects.
Fifth, the open science deficit documented here — 12.0% code sharing and 9.2% data sharing across the corpus — must be addressed through a combination of journal policy, funder requirements, and community norm shifts. The scientific community has demonstrated, in fields from genomics to ecology, that open code and data sharing dramatically accelerate methodological progress and enable the type of cross-study synthesis on which evidence-based policy depends. The vegetation remote sensing community has the infrastructure — GitHub, Zenodo, the Google Earth Engine code sharing environment — to achieve substantially higher open science adoption rates than the current baseline reflects.
Taken together, the findings of this review affirm both the remarkable scientific productivity of the satellite remote sensing of vegetation responses to climate change over the 25-year study period and the structural constraints that limit the field's capacity to provide the globally consistent, biome-complete, methodologically reproducible evidence base that the urgency of climate change demands. Addressing these constraints — geographic, methodological, and institutional — represents the defining challenge for the field in the decade ahead.