1. Introduction
China has entered a phase of high urbanization in which rapid economic expansion coexists with persistent “urban maladies”. By the end of 2023, the urbanization rate of permanent residents reached 66.16%, indicating a predominantly urban society(Organizational Chart, 2022). Yet, high GDP does not necessarily translate into a “healthy city” in residents’daily experience: even economically advanced cities may face severe congestion, recurrent pluvial flooding, public safety concerns, and environmental degradation, while some less affluent cities may offer more affordable housing, more manageable commuting burdens, and better everyday accessibility(Connor et al., 2025). In response, China has promoted the Urban Physical Examination (UPE) as a problem-oriented governance instrument since 2018, with pilots launched in 11 cities in 2019 and subsequent expansion(Liu & Fang, 2025; Zhang et al., 2021). A distinctive feature of UPE is the institutionalization of large-scale resident surveys, where city-level satisfaction serves as a key people-centred indicator to assess perceived “urban health” alongside objective diagnostics(Pazos-García et al., 2025; Shi et al., 2021). Therefore, it is an important initiative that can effectively reflect the health of the city.
From an academic standpoint, the link between economic development and perceived urban health is theoretically non-linear and empirically debated. Subjective well-being (SWB) research generally finds that income is positively associated with evaluative measures (e.g., life satisfaction), but the marginal returns may diminish due to adaptation and social comparison(Kahneman & Deaton, 2010; Killingsworth, 2021; Roszkowska et al., 2025). Cross-country evidence, however, often reports a robust positive association between national income and average SWB, suggesting that the growth–well-being relationship can be context-dependent rather than universally plateauing. These debates imply that the GDP–satisfaction relationship may exhibit threshold, plateau, or other non-linear patterns—functional forms that are substantively important for policy interpretation(Killingsworth et al., 2023). Meanwhile, urban perceived health/quality of life (QoL) is multidimensional and shaped by demographic pressure, the built environment, public service provision, and environmental management—domains emphasized in the “healthy city” paradigm(He et al., 2024; Nicolás-Martínez et al., 2024a). In the UPE context, these domains also tend to covary with economic level; therefore, empirical models that relate GDP to resident-rated city satisfaction should, at minimum, account for key city characteristics (e.g., population, density, industrial structure, fiscal capacity) to reduce omitted-variable bias(Yu et al., 2024). Methodologically, restricted cubic splines (RCS) provide a flexible and interpretable approach to diagnosing and testing non-linearity without imposing arbitrary cut-points(Desquilbet & Mariotti, 2010).
Against this background, this study examines whether and how city economic level (GDP) is associated with city-level overall satisfaction—used as a parsimonious proxy for perceived urban health—using the 2024 nationwide UPE resident survey dataset (details reported in the Data section). We estimate both a linear benchmark model and an RCS specification, and formally test non-linearity via nested-model comparisons, in both unadjusted and covariate-adjusted settings (
Figure 1). The analysis is organized around two core expectations: H1: higher GDP is associated with higher city-level satisfaction; H2: the GDP–satisfaction association is non-linear, consistent with diminishing marginal returns or threshold/plateau patterns suggested by SWB theory. By clarifying the functional form of the GDP–perceived urban health relationship under an institutionalized people-centred governance framework (UPE), the study provides evidence for differentiated policy interpretation beyond “more growth is always better”(Jebb et al., 2018).
2. Data and Methods
2.1. Data Sources
This study draws on the nationwide Urban Physical Examination (UPE) resident survey organized by the Ministry of Housing and Urban-Rural Development (MoHURD) in 2024. The questionnaire was designed to capture residents’ perceived urban conditions across four spatial scales—dwelling, neighborhood/community, block, and city—and operationalized through 45 indicators spanning eight domains (e.g., safety and durability, ecological livability, safety resilience, and jobs–housing balance). The survey was implemented between April and June 2024 within the main urban districts of the sampled cities. In total, 692,800 questionnaires were collected from 47 cities; after data cleaning, 499,500 valid responses were retained, yielding an effective response rate of 71.96%.
The target population comprised local permanent residents aged 16 years and above who had resided in the city for at least six months, excluding short-term visitors and transient workers with shorter stays. To enhance representativeness and comparability across cities, the fieldwork adopted multiple sampling and quota-control strategies, including proportional stratified sampling and cross-quota controls, to ensure broad and balanced coverage in respondents’ spatial distribution and socio-demographic structure (gender, age, occupation, and income).
The 47-city sample was selected to reflect heterogeneity in economic level, population scale, and geographical location, enabling cross-city comparison across diverse development contexts. This rationale is visualized in the national overview figure (
Figure 2), which maps the sample cities and highlights between-city variation in GDP and population size. In the empirical analysis, individual responses are aggregated to the city level to construct the study outcome (overall satisfaction as a proxy for perceived urban health), while GDP and other city characteristics are matched at the same administrative level. Descriptive statistics of all study variables are reported in
Table 1.
2.2. Methods and Preprocessing
- (1)
Data preprocessing and analytic sample
The MoHURD UPE microdata were first screened according to the survey’s eligibility criteria (permanent residents aged ≥16 years with a minimum six-month residence). We then aggregated individual responses to the city level to construct the dependent variable, overall city satisfaction (Satisfaction), using the arithmetic mean of valid responses within each city. City-level socioeconomic covariates were harmonized to the same administrative level and matched to the 47 sampled cities. Given the strong right-skewness typical of macroeconomic indicators, GDP was log-transformed (log(GDP)) prior to modeling. Control variables were transformed when appropriate (e.g., logarithmic transformation for population and fiscal expenditure per capita) to reduce skewness and to improve linearity in covariate–outcome relations. In addition, to ensure model stability, we diagnosed collinearity among candidate covariates (pairwise correlations and VIF) and retained the final set of covariates used in the adjusted models.
- (2)
Restricted cubic splines for non-linear GDP–satisfaction associations
To characterize potential non-linear associations between economic level and perceived urban health, we modeled log(GDP) using restricted cubic splines (RCS) (also referred to as natural cubic splines)(Austin, 2025; Gauthier et al., 2020). RCS offers a flexible yet parsimonious functional form: it allows curvature within the data range while constraining the relationship to be linear in the tails, reducing implausible extrapolation(Discacciati et al., 2025).
Let
denote the city-level satisfaction score for city
,
denote
, and
denote a vector of city-level covariates. The RCS regression can be written as:
where
is approximated by an RCS basis expansion:
with knots
, and
. A commonly used restricted basis is:
In practice, spline complexity can be parameterized either by the number of knots or by the degrees of freedom (df) of the spline basis. Consistent with best practices for small-to-moderate samples, we adopted an RCS specification with df = 4 for the main analysis and evaluated robustness using alternative df settings (df = 3 and df = 5) to ensure that substantive conclusions were not driven by overfitting or underfitting. The df choice reflects a standard trade-off: sufficient flexibility to capture plausible curvature while maintaining interpretability and statistical power given the city-level sample size(Arnes et al., 2023; Discacciati et al., 2025).
- (3)
Model specification: unadjusted vs adjusted
We estimated two nested specifications, each under both linear and RCS functional forms for :
(a) Unadjusted model
where
is either a linear term (
) or the RCS function
.
(b) Adjusted model
where
includes city demographic and structural characteristics (e.g., population size, population density, industrial structure, fiscal capacity) and broad regional indicators to account for spatially clustered heterogeneity.
- (4)
Inference on non-linearity: nested-model F test
To formally assess whether the GDP–satisfaction association departs from linearity, we conducted a nested-model F test comparing the RCS model (full) against the linear model (restricted)(Schuster et al., 2022). The null hypothesis is that all spline terms beyond the linear component jointly equal zero:
Let
and
denote residual sums of squares from the linear and RCS models, respectively; let
be the number of additional spline parameters beyond linearity; and let
be the number of parameters in the full model (including the intercept). The test statistic is:
which follows an
distribution under
. We applied this procedure to both the unadjusted and adjusted specifications, thereby testing non-linearity conditional on covariates.
- (5)
Reporting and visualization
For interpretability, we report model-predicted satisfaction across the observed range of log(GDP) and visualize the fitted RCS curve with 95% confidence intervals (CI). In addition to the spline plots, we present scatterplots of GDP versus satisfaction with an overlaid trend line to convey the raw association prior to functional-form modeling.
4. Discussion
4.1. Main Findings and Iinterpretation oC curSe Shape
This study provides a city-level assessment of how economic capacity (GDP) is associated with perceived urban health, proxied by UPE-based overall satisfaction aggregated to the city level. Across both the descriptive bivariate evidence and the regression-based specifications, the central empirical result is consistent: cities with higher GDP tend to report higher overall satisfaction, but the relationship is far from deterministic, as indicated by substantial between-city dispersion around the fitted trend.
(1) “No plateau within the observed range”: GDP as an enabling capacity for urban health.The near-linear increasing profile can be read as evidence that, for the sampled cities, additional economic capacity continues to translate into improvements in domains that residents experience directly and evaluate positively—such as infrastructure maintenance, public service coverage, environmental management capacity, and the city’s ability to respond to risks and disruptions. Under the UPE framework, overall satisfaction is a comprehensive, people-centred outcome that plausibly reflects the cumulative performance of these systems. If economic resources are effectively converted into public goods and urban management quality, then a plateau is not inevitable; rather, “economic upgrading” may continue to yield observable gains in perceived urban health.
(2) “Diminishing returns may exist but are difficult to detect at city level”: bounded outcomes and offsetting externalities. The absence of statistically detectable curvature does not rule out diminishing marginal returns in principle. Satisfaction is a bounded, composite perception; as baseline conditions improve, incremental gains become harder to achieve and to measure. Moreover, higher-GDP cities frequently face countervailing pressures—congestion, housing cost burdens, environmental externalities, and governance complexity—that can partially offset the benefits of economic affluence. Empirically, this tension is visible in two ways: (i) the fitted curve’s confidence band expands toward the distribution tails (especially at high ), indicating greater uncertainty where observations are sparse; and (ii) cities with similar GDP can differ markedly in satisfaction, suggesting that the conversion efficiency from economic resources to lived urban quality varies widely. In short, a “plateau story” may be conceptually plausible, but the current cross-sectional city-level design—particularly with limited density in the extreme GDP ranges—may be underpowered to distinguish subtle curvature from noise.
(3) The most policy-relevant reading of these findings is therefore not “GDP matters” per se, but what GDP represents and what it does not. GDP appears to operate as a resource base that is generally associated with higher perceived urban health; however, the large dispersion around the curve indicates that GDP is necessary but not sufficient. Some cities achieve relatively high satisfaction at modest GDP levels (suggesting strengths in governance, service delivery, or urban form), while others underperform relative to their GDP (suggesting that negative externalities or misallocation may dominate residents’ daily experience). This “overperformer/underperformer” pattern is precisely why a flexible functional-form assessment is valuable: it frames GDP as an enabling condition, while directing analytical and policy attention toward the factors that shape how economic capacity is translated into resident-perceived urban health.
4.2. Comparison with Prior Lliterature
Our findings contribute to, and partially reconcile, several established but sometimes divergent strands of research on the economy–well-being nexus.
First, the direction and overall monotonicity of the GDP–satisfaction relationship aligns with the dominant “positive gradient” view in macro-level well-being research(Oishi et al., 2022; Prati, 2024). Cross-country and long-run evidence generally indicates that higher average economic development is associated with higher average subjective well-being, and some influential re-analyses argue against a clear “satiation point” at the national level(Nicolás-Martínez et al., 2024b). Consistent with this perspective, our city-level results show a robust positive association between economic capacity and residents’ overall evaluation of their city, and the flexible RCS specification does not yield statistically detectable curvature relative to a linear benchmark(Killingsworth, 2021; Killingsworth et al., 2023).
Second, our results differ in emphasis from the “income satiation / diminishing returns” literature that is largely grounded in individual-level data. Studies distinguishing evaluative well-being from experienced affect have reported plateau-like patterns for some components of well-being (e.g., emotional well-being) beyond a threshold income, while other work identifies turning points and region-specific satiation levels(Killingsworth, 2021; Killingsworth et al., 2023). In contrast, we do not observe a statistically identifiable plateau within the GDP range covered by the 47-city sample. This discrepancy is not necessarily contradictory: (i) our outcome—city-level overall satisfaction—is closer to an evaluative judgment (how residents appraise their city as a whole) than to momentary affect, and evaluative measures are commonly found to scale more linearly with economic resources; (ii) we use city GDP (scale capacity) rather than household income, and the mapping from aggregate resources to individual experience depends on how effectively cities convert fiscal and economic capacity into public goods and everyday urban functioning(Xu et al., 2024).
Third, relative to urban quality-of-life (QoL) and “unhappy cities” scholarship, our evidence underscores the non-sufficiency of economic prosperity and highlights the role of urban externalities and governance capacity in explaining cross-city heterogeneity(Carlsen & Leknes, 2022). In the cities literature, QoL is often conceptualized as multidimensional, where economic performance is only one component among services, environment, safety, housing, mobility, and social cohesion; consequently, GDP per capita tends to be positively associated with life satisfaction, but it does not fully account for the spatial distribution of well-being(Carlsen & Leknes, 2022; Nicolás-Martínez et al., 2024b; Węziak-Białowolska, 2016). Related urban-economics arguments emphasize that large, prosperous cities can simultaneously generate disamenities (e.g., congestion, high costs, stressors) that offset gains in average well-being, producing substantial dispersion even at similar levels of development(Glaeser et al., 2016). Our results mirror this core insight: the scatter around the fitted GDP–satisfaction curve is substantial, and after adjustment the model fit improves (higher R²), implying that non-economic city attributes systematically structure perceived urban health beyond GDP alone(Sørensen, 2024).
Fourth, compared with China-focused evidence, our findings are consistent with the view that urban well-being in China is jointly shaped by structural urbanization conditions and city-level governance/service performance(Guo et al., 2025; Liao et al., 2022a). Research on life satisfaction in urbanizing China highlights the importance of city context (e.g., city size and institutional conditions) for residents’ evaluations(Chen et al., 2015). Complementary applied studies in the Chinese context suggest that resilience, management, and service provision are salient correlates of subjective well-being and satisfaction, reinforcing the interpretation that “conversion efficiency” from resources to lived experience matters(Liao et al., 2022b). Importantly, this resonates with the institutional design of China’s Urban Physical Examination (UPE) practice, which explicitly treats residents’ satisfaction as a core output alongside objective performance metrics and action lists—thereby embedding subjective evaluation into urban governance cycles(Liao et al., 2022b).
4.3. Policy Implications
Differentiated strategies by GDP range remain essential because constraints and marginal returns differ by development stage, even if the overall association is monotonic. For lower-GDP cities, the highest-yield interventions typically concentrate on basic-function reliability—maintenance of aging infrastructure, drainage and flood control, neighborhood safety and lighting, last-mile accessibility, and visible micro-environment upgrades that quickly reduce daily friction. In these contexts, resident satisfaction often responds strongly to improvements that stabilize core services and reduce recurrent disruptions, and the governance priority is to build credible, measurable delivery capacity rather than pursuing broad, capital-intensive expansion.
For middle-GDP cities, the policy frontier shifts from “coverage” to system integration and quality upgrading. Rapid spatial expansion, motorization, and rising expectations mean that residents increasingly evaluate the city through time-cost and convenience: commuting reliability, congestion management, facility accessibility, and the perceived fairness of service distribution. Here, simply increasing investment is rarely sufficient; what matters is coordination across land use–transport–public services–environment, to avoid fragmented projects that raise costs without improving lived experience. In practice, UPE results can be used to identify whether dissatisfaction clusters around jobs–housing balance, mobility, ecological livability, or safety resilience, and then to implement integrated packages rather than single-sector fixes.
For higher-GDP cities, the binding constraints tend to be externalities, cost pressures, and governance complexity. High GDP can coexist with persistent dissatisfaction if congestion, housing burdens, environmental exposure hotspots, or “fine-grained” neighborhood management deficits dominate residents’ daily experience. The implication is to prioritize externality control and governance modernization: demand management and multimodal transport reliability, affordability-oriented housing and rental governance, refined environmental exposure mitigation at the neighborhood scale, and responsive, data-driven urban management that closes the “last-mile governance gap.” In these cities, satisfaction gains are often achieved less by adding more infrastructure and more by improving operational performance and equity in service access.
Regional differentiation should be layered on top of GDP staging because “the same GDP” can imply different structural pressures. In coastal and high-density regions, satisfaction is often constrained by congestion, time-cost, high living expenses, and environmental exposure concentrated in dense neighborhoods; thus, policies should emphasize demand management, exposure hotspot governance, and equitable service reallocation. In inland or lower-density regions, accessibility coverage, basic public-service catchments, and infrastructure reliability frequently dominate perceived outcomes; thus, policies should emphasize service completion, maintenance regimes, and resilience of essential systems. For shrinking or slow-growth regions, the conversion-efficiency lens is particularly important: the priority is not expansion but right-sizing services, maintaining quality under fiscal constraints, and preventing “infrastructure decay” from eroding perceived urban health.
Finally, the results reinforce a governance shift: treat resident satisfaction not as a generic “soft indicator,” but as a performance metric that reveals where economic growth is not translating into lived benefits. The GDP–satisfaction curve can be institutionalized as a diagnostic tool within UPE practice—supporting differentiated renewal strategies, more accountable action lists, and a clearer focus on conversion efficiency. Over time, repeating the same analysis across survey waves would allow policymakers to evaluate whether cities are moving upward (raising satisfaction) and toward the benchmark (improving conversion), which is arguably the most actionable interpretation of “high-quality urban development” in a people-centred governance framework.
5. Conclusions
5.1. Summary
This study examined whether and how economic development (GDP) is associated with perceived urban health, proxied by city-level overall satisfaction aggregated from the 2024 nationwide Urban Physical Examination (UPE) resident survey. Using 47 Chinese cities, we first documented a moderate positive bivariate association between GDP and satisfaction, while also observing substantial cross-city dispersion—indicating that economic prosperity does not automatically translate into uniformly higher perceived urban health.
To move beyond a linear assumption, we employed restricted cubic splines (RCS) to flexibly assess the functional form of the GDP–satisfaction relationship. After log-transforming GDP to reduce skewness and leverage effects, the RCS estimates revealed a smooth, generally increasing association between log(GDP) and city-level satisfaction. However, nested-model tests comparing linear and spline specifications provided no statistical evidence of non-linearity in either the unadjusted model or the covariate-adjusted model. Together, these results suggest that, within the observed GDP range of the sampled cities, higher economic capacity is consistently associated with higher satisfaction, but the magnitude of perceived urban health is also strongly shaped by non-economic factors reflected in the dispersion around the fitted curve.
From a governance perspective, the findings support interpreting GDP as an enabling capacity rather than a sufficient condition. The observed heterogeneity implies that cities differ in their conversion efficiency—how effectively economic resources are translated into service quality, environmental conditions, mobility reliability, safety, and other everyday experiences that residents evaluate in the UPE system.
5.2. Future Work
Several limitations point to directions for future research and policy-oriented refinement.
First, the analysis is conducted at the city level; aggregating satisfaction to a single city mean inevitably masks within-city disparities. Future work should extend the framework to finer spatial units (e.g., districts or neighborhoods) to assess whether the economy–perceived urban health association differs across urban subareas and whether “conversion efficiency” varies within cities.
Second, the study uses a cross-sectional design. While the association between GDP and satisfaction is robust, causal interpretation remains limited. Longitudinal analyses using repeated UPE waves would allow testing whether changes in economic capacity and governance performance precede changes in resident satisfaction, and whether functional form evolves over time.
Third, although we controlled for key city characteristics and screened for collinearity, the current models cannot fully capture institutional and governance processes (e.g., budgeting rules, service delivery efficiency, responsiveness to complaints, and implementation capacity) that likely explain why similarly wealthy cities diverge in perceived outcomes. Future research could integrate administrative performance metrics, digital governance indicators, or objective service accessibility measures to better operationalize the “conversion” mechanism.
Fourth, the non-linearity tests may be constrained by the sample size and tail coverage at extreme GDP levels. With a broader set of cities, or pooled multi-year data, future work could more powerfully detect subtle curvature, potential thresholds, or heterogeneous functional forms across regions and city types (e.g., shrinking vs. fast-growing cities).
Finally, perceived urban health is inherently multidimensional. Future studies could move beyond overall satisfaction to model domain-specific perceptions (e.g., resilience, ecological livability, jobs–housing balance) and to examine whether economic development relates differently to each domain—thereby producing more targeted and actionable policy insights for UPE-guided urban renewal.