3. Thematic Organization of Methods
The study found significant diversity in data sources regarding the benefits of urban green space and schoolyard greening on property valuation. Researchers typically use three types of data: (1) transactional data (real estate sales, assessed values, rental prices), (2) spatial and remote sensing data (satellite imagery, street view images, NDVI, canopy cover), and (3) survey and qualitative data (questions, interviews, field observations).
The framework illustrated in
Figure 1 situates schoolyard greening within the broader scope of urban green space research, explaining the connections among methodological approaches (e.g., statistical, geographical, qualitative), key outcome variables (property value, equity, health), and policy significance. This visual model emphasizes the specific methodological and policy challenges associated with schoolyard greening, which often overlaps with educational, social, and environmental issues.
Most of the quantitative research is based on transactional data. Sale prices per square meter, listing prices, and land price zoning data, which are commonly obtained from real estate agencies, government records, or online platforms, provide accuracy and objectivity for property valuation but can be difficult to obtain (Ben et al., 2023; Chen, Lin et al., 2022; Lieber, 2022; Moradi et al., 2022; Zhang et al., 2022).
Spatial and distant sensing data are becoming important. The widespread use of NDVI (Normalized Difference Vegetation Index), tree canopy coverage from satellite data, and the Green View Index (GVI) derived from street views enables an objective and scalable assessment of "greenness." Landsat, Google Earth Engine, and Baidu Maps are widely used for high-resolution research (Ben et al., 2023; Chen, L. et al., 2020; Ewane et al., 2023; Zhang et al., 2022). Several research use various spatial datasets for deep analysis, including transactional, administrative, and environmental datasets (Droj et al., 2024; Chen, L. et al., 2020).
Surveys and qualitative data enhance traditional sources, particularly where official records are unavailable or local viewpoints are important. Questionnaire-based studies (Aziz et al., 2021; Setiowati et al., 2024), interviews with real estate agents (Ledraa & Aldubikhi, 2025), and field observations all contribute to the empirical framework and contextualization of quantitative data. This data is especially useful for measuring subjective elements of green space value and community sentiments toward greening programs (Setiowati et al., 2024; Lieber, 2022).
The operationalization of "greenness" serves as a methodological focus point, which includes several major strategies:
1. Researchers typically use proximity-based metrics to evaluate greenness, such as straight-line (Euclidean) or network distances between properties and nearby green areas, parks, or schoolyards (Moradi et al., 2022; Lieber, 2022; Zhang et al., 2022). This method gives a baseline statistic, but it may oversimplify actual access in densely crowded metropolitan areas.
2. Common remote sensing metrics include NDVI, tree canopy cover, and green coverage percentages, which measure vegetation density and health. These indicators, which are used at several geographical sizes (parcel, neighborhood, and buffer zones), are recognized for their impartiality and replicability. The Green View Index (GVI), created from street-level pictures, provides a "human perspective" on visible greenery (Chen, L. et al., 2020; Ewane et al., 2023; Zhang et al., 2022).
3. Accessibility and Synthesis Indexes: Recent research has developed accessibility indices that assess green spaces based on size, population serviced, and potential congestion, often using floating catchment or kernel density approaches (Ben et al., 2023; Zhang et al., 2022). These approaches realize that not every green area provides equal access and benefits to all demographics.
4. Use qualitative and checklist-based techniques. Certain studies use checklists or established techniques to evaluate the quality, upkeep, or amenity value of green spaces, such as resident questionnaires or expert panels (Zhang et al., 2022). Nonetheless, subjective metrics are unusual in large valuation studies.
5. Tools specific to the schoolyard: The Green Schoolyard Evaluation Tool (GSET) and its associated tools are intended to promote greenness in educational settings (van den Bogerd & Maas, 2024), however they are rarely used in property value research.
The research consistently shows a lack of schoolyard-specific data and methodological focus. Due to data restrictions, most studies combine schoolyards with other green areas or remove them entirely (Clauzel et al., 2025; Ewane et al., 2023). There are numerous issues associated with maintaining schoolyard greening.
Data on schoolyard characteristics, vegetation condition, and accessibility vary.
Inadequate use of specialist approaches, such as GSET, leads to poor comparability across settings (van den Bogerd & Maas, 2024).
Sekulova and Ruiz Mallén (2024) contend that proximity measurements usually overlook specific school-community linkages and usage patterns.
These constraints are made worse in the Global South, where municipal records and remote sensing coverage are frequently insufficient (Setiowati et al., 2024; Ewane et al., 2023).
The hedonic pricing model (HPM) is widely used in urban green space valuation research, regularly breaking down real estate prices into structural, locational, and environmental components (Setiowati et al., 2024; Moradi et al., 2022; Ben et al., 2023). Traditional hedonic price models (HPMs) use ordinary least squares (OLS) regression in log-linear or semi-logarithmic format to determine the hidden value of green space attributes (Lieber, 2022; Aziz et al., 2021).
Recognizing the limits of classic models, particularly their incapacity to account for regional dependency and local variation, researchers are increasingly using complex spatial econometric models. Geographic lag and error models are common approaches for addressing autocorrelation in property pricing.
Geographically weighted regression (GWR) takes into consideration local changes in the link between greenness and property values (Chen, Lin et al., 2022; Chen, K. et al., 2022).
Machine Learning and Scenario Simulation Collective learning approaches, such as XGBoost, deep neural networks, and SHAP value interpretation, are widely used to improve prediction precision and simulate nonlinear interactions (Chen, L. et al., 2020; Droj et al., 2024). Nonetheless, criticisms of their "black box" character persist (Chen, L. et al., 2020).
Machine learning approaches such as random forests, XGBoost, and neural networks are increasingly being used to determine property value. These models often exceed OLS and spatial regression models in predicting accuracy due to their capacity to capture nonlinearities and high-dimensional interactions (Ben et al., 2023; Chen, L. et al., 2020). XGBoost explained 20% more variance in housing prices than OLS, particularly in places with diverse green space distributions (Chen et al., 2020).
To overcome the "black box" difficulty, recent research has adopted explainable AI approaches such as SHAP (Shapley Additive Explanations), which divide model predictions into progressive contributions from each input variable. This makes it easier to analyze the impact of variables such as NDVI or green view index at various spatial scales on property value, connecting forecasting capability to policy significance.
Although machine learning improves predictive accuracy, spatial econometric methods (e.g., GWR, spatial lag/error models) are still preferred for comprehending spatial dependency and heterogeneity. As a result, future research will increasingly combine machine learning into interpretable and spatially specific methodologies.
Although machine learning techniques such as random forests, XGBoost, and neural networks surpass ordinary least squares and traditional spatial regression, they possess limitations. The complex nature of numerous machine learning models often hinders interpretability, creating obstacles for policy implementation and restricting the transparency essential for public sector decision-making. Overfitting, especially in the context of limited sample numbers or biased distributions of greenness variables, might compromise external validity. Moreover, although explainable AI methods such as SHAP values assist in mitigating these challenges by quantifying variable significance, they may inadequately provide the nuanced causal explanations essential for policy formulation and strategic planning. The transferability of machine learning models across different geographies and contexts is inadequately comprehended, and numerous studies lack robust out-of-sample or cross-validation methodologies, hence raising concerns about generalizability.
Geographic Information Systems (GIS) are now an essential methodological tool for identifying properties and green areas, calculating distances, establishing buffer zones (typically 400-1000 meters), doing network modeling, and executing spatial overlays (Droj et al., 2024; Ben et al., 2023). Researchers utilize GIS to show the spatial distribution of facilities, educational institutions, and environmental dangers (Aziz et al., 2021).
Ben et al. (2023) propose using floating catchment areas and network analysis to simulate pedestrian accessibility in real-world scenarios.
Use many geographical data layers, such as green cover, demographics, and zoning, for multivariate analysis and scenario simulations (Chen, L. et al., 2020; Droj et al., 2024).
Some research use 3D GIS, remote sensing categorization (SVM, deep learning), and interaction with BIM or other digital systems to enable advanced modeling (Droj et al., 2024; Chen, L. et al., 2020).
Remote sensing (NDVI, canopy), spatial indicators (e.g., forest size-distance index), and field observations for characteristics such as tree species or ecosystem state are the primary sources of ecological evaluation. While these methods give objective metrics, few research have compared distant measurements to direct ecological assessments.
Behavioral techniques include surveys and interviews to gather information about residents' preferences and perceptions of the quality, safety, and amenities of their green spaces. However, the integration of such data with geographic analysis is rare, and questionnaire validation is inconsistent.
Despite the increasing number of quantitative methods in the literature, qualitative and governance-focused approaches are increasingly being applied to address issues such as urban justice, participation, and context-specificity. Here are a few typical methodologies:
Investigating urban greening policies, zoning regulations, and equity issues (Ledraa & Aldubikhi, 2025).
Stakeholder mapping and governance frameworks identify key actors and power relations, focusing on equality, procedural involvement, and equitable outcomes (Sekulova & Ruiz Mallén, 2024).
Case studies and semi-structured interviews provide context for quantitative findings, especially in distinct urban or socio-cultural settings (Ledraa & Aldubikhi, 2025; Setiowati et al., 2024). Thematic coding and qualitative comparative analysis emphasize local perspectives, sector-specific implications, and details like maintenance quality and safety concerns.
Delphi panels and expert workshops may occasionally help consensus on indicators or evaluation frameworks; however, their usefulness in property value investigations is limited.
Mixed methods design that combine HPM or GIS-based models with qualitative interviews help to explain both observed effects and their underlying causes in urban settings (Ledraa & Aldubikhi, 2025; Setiowati et al., 2024). This integration is especially useful in dry, fast urbanizing, or culturally diverse places, where Western-derived models may not fully reflect crucial phenomena.
Advantages and Common Challenges of Governance/Qualitative Analysis.
Advantages:
Recognize basic mechanisms such as privacy, safety, and local standards, which quantitative approaches may neglect.
Provide sector-specific and equality assessments to inform policy suggestions tailored to the local context (Ledraa & Aldubikhi, 2025; Sekulova & Ruiz Mallén, 2024).
Challenges: Interpretative bias and insufficient reproducibility.
Interview samples may be selected to represent specific demographics or stakeholder groups.
Insufficient integration with spatial-statistical models may limit the policy implications of qualitative insights (Setiowati et al., 2024).
Hedonic pricing models (HPM) continue to be the gold standard for determining the economic worth of green space due to their transparency and consistency across research (Setiowati et al., 2024; Ben et al., 2023). Their extensive application illustrates objectivity and economic theory. Traditional HPMs, on the other hand, are coming under fire for failing to account for spatial dependency, nonlinear effects, and context-specific complications (Chen, L. et al., 2020; Ewane et al., 2023).
Spatial econometric models and machine learning approaches provide methodological breakthroughs that improve precision and allow for the simulation of complicated, context-sensitive interactions. The use of spatial lag/error models, Geographically Weighted Regression (GWR), and artificial intelligence/machine learning techniques (e.g., XGBoost, deep learning) indicates a shift toward more advanced and complex research (Ben et al., 2023; Chen, L. et al., 2020; Droj et al., 2024).
GIS and remote sensing allow for objective and scalable assessments of greenness, as well as complete mapping. The merging of several data kinds (transactional, geographical, and social) exhibits methodological progress and has the potential to improve accuracy and policy significance (Droj et al., 2024; Chen, L.; et al., 2020).
Common Strengths:
Transactional, remote sensing, and GIS data are objective and reproducible, allowing for testing and comparison across cities and regions (Ewane et al., 2023; Droj et al., 2024). Relevant to policy: Quantitative estimates of the "green premium" benefit planners, municipalities, and real estate markets (Setiowati et al., 2024; Ledraa & Aldubikhi, 2025).
Potential for Integration: Recent improvements make it easier to integrate quantitative and qualitative methodologies, increasing explanatory power and practical significance (Ledraa & Aldubikhi, 2025; Setiowati et al., 2024).
Common weaknesses:
A lack of connection. Cross-sectional designs are popular, which limits the ability to deduce causality and comprehend long-term consequences or gentrification processes.
Limitations of universality. The emphasis on research undertaken in a single area, typically in the Global North or East Asia, reduces cross-context validity, particularly for small-scale or schoolyard interventions (Setiowati et al., 2024).
According to Sekulova and Ruiz Mallén (2024), most quantitative models fail to account for the implications of distributive and procedural equity in greening, as well as the distinct effects of wealth, race, and tenancy.
There is a scarcity of research that uses schoolyard-specific data and methodology, and methodological innovation for educational or child-centered greening processes is limited (Clauzel et al., 2025; van den Bogerd & Maas, 2024).
Despite frequent recommendations for increased emphasis, schoolyard greening is underrepresented in the methodological literature. The primary challenges are:
Insufficient data quality to distinguish schoolyard effects from overall green space trends.
The poor use of specialist methodologies such as GSET has resulted in a limited understanding of greening's effects on educational and local outcomes (van den Bogerd & Maas, 2024).
According to Sekulova and Ruiz Mallén (2024), quantitative and qualitative models fail to fully integrate the school community's perspectives.
In contrast to quantitative models, qualitative and governance-oriented methodologies offer essential context and capture mechanisms (e.g., equity, participation, governance configurations) that are often imperceptible to mere statistics models. Nevertheless, these approaches possess inherent limitations, such as subjectivity, limited sample sizes, and challenges in scalability or replicability of results. Although thematic coding, stakeholder interviews, and governance mapping may uncover mechanisms of inclusion or exclusion, the incorporation of quantitative spatial or statistical analyses is infrequent and methodologically challenging. Mixed-methods studies that explicitly connect qualitative findings to model outputs are highly valuable; however, they are underrepresented in the examined literature.
In summary, conventional regression models like HPM and OLS continue to be employed to assess the "green premium" in property values due to their simplicity and clarity. Nonetheless, these techniques typically neglect spatial autocorrelation and local variability. Spatial econometric methodologies (e.g., GWR, spatial lag/error models) address these deficiencies; nonetheless, they are computationally intensive and necessitate high-quality geographical data.
Machine learning models advance the field by encapsulating nonlinearities and intricate interactions, but at the cost of clarity and, in certain instances, causal inference. Explainable AI technologies offer advantages; yet their practical implementation by policymakers remains constrained. Qualitative and governance-oriented studies provide essential insights into context, process, and justice; yet they may be undermined unless carefully combined with quantitative data.
Consequently, the most methodologically rigorous and policy-relevant research increasingly integrates several approaches, leveraging their strengths while mitigating their weaknesses.