Submitted:
04 May 2026
Posted:
05 May 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
- In what ways can semantic image segmentation be applied to capture and quantify perceptual qualities of pedestrianized streets?
- How do patterns of urban morphology correspond with pedestrian movement and spatial behavior in heritage areas?
- Which design strategies can strengthen walkability and cultural identity while improving the overall quality of historic streetscapes?
2. Research Background and Theoretical Framework
2.1. Streetscape in Historic Space
2.2. Perceptual Qualities and Assessment of Urban Streets
2.3. History of Research by Semantic Segmentation on Urban Streetscapes
3. Materials and Methods
3.1. Study Area
3.1.1. Historical and Urban Context
3.1.2. Morphological Characteristics of the Case Studies
3.2. Methodology
3.2.1. Selection of Perceptual Indices
| Indices | Formula | Description |
|---|---|---|
| Imageability | ![]() |
The sum indicates the total number of tree pixels in each image. Bn indicates the proportion of building pixels; TCn indicates the proportion of pixels of trees with columnar shape; TOn indicates the proportion of pixels of trees with oval shape; PDn indicates the proportion of pedestrian pixels; Wtn indicates the proportion of water pixels; |
|
Pixels associated with "Window", "Door" were incorporated into the merged class "merge_B_C_D_" , which was subsequently redefined for the purposes of the correlation computation (Table 4, heatmap). | ||
| Enclosure | ![]() |
The sum indicates the total number of tree pixels in each image: Bn indicates the proportion of building pixels, TCn indicates the proportion of pixels of trees with columnar shape; TOn indicates the proportion of pixels of trees with oval shape; Fn indicates the proportion of fences and balustrade on the wall pixels; PVn indicates the proportion of pavement pixels; SKn indicates the proportion of sky pixels. |
| Pixels associated with "Brick wall with fence", "Wall","Steps","Buildings wall", "Buildings with column","Metal Fence" were incorporated into the merged class "Wall_pixels",which was subsequently redefined for the purposes of the correlation computation (Table 4, heatmap). | ||
| Human Scale | ![]() |
Sn indicates the proportion of seats, symbols, windows pixels; Pln indicates the proportion of plant covers such as grass and flowers pixels; PDn indicates the proportion of pedestrian pixels. Fn indicates the proportion of fences and balustrade on the wall pixels. |
| Pixels associated with " Bench and Food stands", "Bulletin board", "Symbol and Sign" were incorporated into the merged class "merge_E_H_N" , which was subsequently redefined for the purposes of the correlation computation (Table 4, heatmap). | ||
| Walking Index | ![]() |
PDn indicates the proportion of pedestrian pixels, and PVn for pavement pixels |
| Greenness | ![]() |
TCn indicates the proportion of pixels of trees with columnar shape; TOn indicates the proportion of pixels of tTrees with oval shape; PALn indicates the proportion of palm trees pixels; Pln indicates the proportion of plant cover such as grass and flowers pixels |
Imageability
Enclosure
Human Scale
Greenness
Walkability Index
3.3. Technical Information
Performance Metrics
3.4. Measurement of Street Width as Physical Attribute

4. Results
4.1. Performance of the U-Net Framework
4.1.2. Class Merging and Optimization.
4.1.2. Hyper-Parameter Tuning


4.2. Descriptive Statistics of the Segmented Results (First-Person Pedestrian Views)
| Class / Element | Average Share (%) | Notes on Distribution |
|---|---|---|
| Buildings / Walls | ~35–40% | Highest share across all streets; walls substituted for buildings in Sepah South and Ostandari. |
| Sky | ~20–25% | Consistently visible in most panoramas, influencing enclosure. |
| Trees (Oval/Columnar) | ~15–18% | Key contributor to enclosure and greenness; palm trees unique to Amadegah. |
| Pavement (Sidewalks/Pathways) | ~10–12% | Unified class for consistency. |
| Windows / Doors | ~5–7% | More visible in Sepah (windows) and Naqsh-e Jahan (doors). |
| Water | <3% | Only observed in the Chahar Bagh middle pathway. |
| Street Furniture / Symbols | <2% | Scattered elements, often associated with human-scale features. |


4.3. Correlation Analysis of Streetscape Perception Indices and Elements
![]() |
4.4. Spatial Patterns of Aggregated Visual Perception Variables
4.5. Width of Street as a Morphological Attribute
5. Discussion
5.1. Interpretation of Morphological–Perceptual Relationships in Relation to Street Structure
5.1.1. Structurally Enclosed and Architecturally Defined Streets
5.1.2. Highly Walked but Weakly Enclosed Axial Spaces
- Naqsh-e Jahan Ax shows very low Enclosure (0.046) and very low Imageability (0.027), yet a high Walking Index (0.323).
- Naqsh-e Jahan Sidewalk exhibits moderate Enclosure (0.429) and low Imageability (0.239), but similarly elevated Walking Index (0.346).
5.1.3. Human Scale as a Supportive Rather Than Dominant Factor
5.1.4. Vegetation Structure and Greenness
5.1.5. Structural Differentiation of Street Types
5.1.6. Structural Interpretation
- Walkability (Walking Index) operates primarily as a socio-functional condition driven by accessibility and pedestrian aggregation.
- Imageability operates as a spatial–visual condition shaped by enclosure and structured vegetation.
- Enclosure functions as a structural mediator that enhances perceptual coherence but does not independently generate pedestrian flow.
- Human Scale acts as a perceptual regulator that improves comfort within active streets but does not correlate directly with width or movement intensity.
- Importantly, the highest pedestrian density (SepahL: 0.588) occurs in a segment combining moderate enclosure (0.686) with the highest Human Scale (0.412), suggesting that pedestrian vitality emerges when configurational accessibility aligns with perceptual proportionality—not from visual memorability alone.
5.1.7. Implications for Future Research
5.2. Contribution of the Study and Application of Pedestrian-Oriented Streetscapes
5.3. Triangulation with Previous Empirical Findings (Integrated Triangular Conceptual Model)
| Aspect | Previous Study | Current Study (This Manuscript) |
|---|---|---|
| Concept & Method | Depth map-based space syntax (integration & connectivity) combined with field observations of pedestrian activity in heritage streets. | Advances walkability analysis by integrating semantic segmentation data of pedestrian activity (walking, sitting). Pixel-level element classification allows pavement coverage to be linked with pedestrian density and street width. |
| Findings | Integration directly influenced pedestrian flow: streets with higher connectivity supported more movement, regardless of aesthetic quality. | Walkability index showed a positive correlation with buildings (r = 0.698), suggesting urban form influences density. Discrepancies appeared where urban furniture and detailed façades were visually absent, lowering perceived walkability. |
| Innovation | Social behavior understood cognitively through integration maps, without direct visual metrics. | Treating walkability index as density of pedestrian activity, segmentation results can predict high-walkability zones with strong similarity to syntax analysis maps. This provides a visual validation of syntactic predictions. |
| Aspect | Previous Study | Current Study (This Manuscript) |
|---|---|---|
| Method | Recognized tree cover, shading, and vegetation presence as contributors to comfort, walkability, and livability—particularly in Chahar Bagh. Field observations, questionnaires, and user perceptions described greenery’s presence and priority. | Extends greenery analysis through pixel-based segmentation (trees, grass, planters), enabling quantification of vegetation in streetscape images. Allows direct measurement of relationships between greenness and other indices such as imageability and enclosure. |
| Findings | Trees and vegetation enhanced comfort and attractiveness; greenness was valued for shading and functionality but could also disrupt imageability or aesthetics. Favoured by pedestrians but not consistently measured. Suggested integrating ecological design into heritage planning. | Tree alignment and species strongly influence aesthetics and functionality. Trees moderately correlate with enclosure and human scale depending on form and placement. Tree-rich pedestrian streets (e.g., Chahar Bagh) showed higher perceived quality, while in Sepah St., trees combined with walls improved enclosure. Positive correlation observed between fences and greenness (Ma, 2021). |
| Innovation | Greenness acknowledged as beneficial but lacked a quantifiable framework; its role in cognitive perception was suggested but not measured. | Provides a quantifiable, image-based greenness index, differentiating tree types and their contribution to enclosure and imageability. Offers new insight into vegetation’s functional role in shaping cultural perception, shading, and pedestrian wayfinding. |
5.4. Width Measurement Correlations
5.5. Design Strategies for Walkable Public Spaces in Heritage Sites
6. Practical Design Guidelines:
- Enhance shading and enclosure – Introduce continuous tree canopies, arcades, or colonnades that reduce sky openness and improve pedestrian comfort in hot climates.
- Strengthen human-scale features – Integrate benches, shop front windows, signage, and localized cultural motifs at eye level to reinforce street vibrancy and encourage interaction.
- Balance heritage preservation with modern mobility – Relocate conflicting modes (e.g., motorcycles, service vehicles) and create small rest areas, ensuring accessibility while preserving the cultural and visual identity of the street.
7. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Conflicts of Interest
References
- Morales-Flores, P.; Marmolejo-Duarte, C. Understanding the Relationship between Public Space Social Capital Expressions and Pedestrian-Oriented Design: A Case Study within the Superblock Action Plan in Barcelona. Cities 2025, 162, 105968. [Google Scholar] [CrossRef]
- Cervero, R. Public Transport and Sustainable Urbanism: Global Lessons. In Transit Oriented Development: Making It Happen; Curtis, C., Renne, J.L., Bertolini, L., Eds.; Ashgate: Aldershot, UK, 2009; pp. 37–50. [Google Scholar]
- Maniei, H.; Askarizad, R.; Pourzakarya, M.; Gruehn, D. The Influence of Urban Design Performance on Walkability in Cultural Heritage Sites of Isfahan, Iran. Land 2024, 13, 1523. [Google Scholar] [CrossRef]
- Carmona, M.; Gabrieli, T.; Hickman, R.; Laopoulou, T.; Livingstone, N. Street Appeal: The Value of Street Improvements. Prog. Plan. 2018, 126, 1–51. [Google Scholar] [CrossRef]
- Qi, Z.; Li, J.; Yang, X.; et al. How the Characteristics of Street Color Affect Visitor Emotional Experience. Comput. Urban Sci. 2025, 5, 7. [Google Scholar] [CrossRef]
- Harvey, C. Measuring Streetscape Design for Livability Using Spatial Data and Methods. Ph.D. Thesis, University of Vermont, Burlington, VT, USA, 2014.
- Park, K.; Tian, G.; Larsen, S.S. Street Life and the Built Environment in an Auto-Oriented US Region. Cities 2019, 88, 243–251. [Google Scholar] [CrossRef]
- De Vos, J.; Lättman, K.; van der Vlugt, A.L.; Welsch, J.; Otsuka, N. Determinants and Effects of Perceived Walkability: A Literature Review, Conceptual Model and Research Agenda. Transp. Rev. 2022, 43, 303–324. [Google Scholar] [CrossRef]
- Ewing, R.; Handy, S. Measuring the Unmeasurable: Urban Design Qualities Related to Walkability. J. Urban Des. 2009, 14, 65–84. [Google Scholar] [CrossRef]
- Gehl, J. Cities for People; Island Press: Washington, DC, USA, 2013. [Google Scholar]
- Elzeni, M.M.; ElMokadem, A.A.; Badawy, N.M. Impact of Urban Morphology on Pedestrians: A Review of Urban Approaches. Cities 2022, 129, 103840. [Google Scholar] [CrossRef]
- Ibrahim, S.; Younes, A.; Abdel-Razek, S.A. Impact of Neighborhood Urban Morphologies on Walkability Using Spatial Multi-Criteria Analysis. Urban Sci. 2024, 8, 70. [Google Scholar] [CrossRef]
- Huang, X.; Zeng, L.; Liang, H.; et al. Comprehensive Walkability Assessment of Urban Pedestrian Environments Using Big Data and Deep Learning Techniques. Sci. Rep. 2024, 14, 26993. [Google Scholar] [CrossRef]
- Song, L. The Street Space Planning and Design of Artificial Intelligence-Assisted Deep Learning Neural Network in the Internet of Things. Heliyon 2024, 10, e35031. [Google Scholar] [CrossRef]
- Ma, H.; Li, J.; Ye, X. Deep Learning Meets Urban Design: Assessing Streetscape Aesthetic and Design Quality through AI and Cluster Analysis. Cities 2025, 162, 105939. [Google Scholar] [CrossRef]
- Nagata, S.; Nakaya, T.; Hanibuchi, T.; Amagasa, S.; Kikuchi, H.; Inoue, S. Objective Scoring of Streetscape Walkability Related to Leisure Walking: Statistical Modelling Approach with Semantic Segmentation of Google Street View Images. Health Place 2020, 66, 102428. [Google Scholar] [CrossRef]
- Khotbehsara, E.M.; Yu, R.; Somasundaraswaran, K.; Askarizad, R.; Kolbe-Alexander, T. The Walkable Environment: A Systematic Review through the Lens of Space Syntax as an Integrated Approach. Smart Sustain. Built Environ. 2025. [Google Scholar] [CrossRef]
- Ewing, R.; Cervero, R. Travel and the Built Environment: A Meta-Analysis. J. Am. Plan. Assoc. 2010, 76, 265–294. [Google Scholar] [CrossRef]
- Hassan, G.F.; Rashed, R.; El Nagar, S.M. Regenerative Urban Heritage Model: Scoping Review of Paradigms’ Progression. Ain Shams Eng. J. 2022, 13, 101652. [Google Scholar] [CrossRef]
- Jacobs, J. The Death and Life of Great American Cities; Random House: New York, NY, USA, 1961. [Google Scholar]
- Li, H.; Ikebe, K.; Kinoshita, T.; Chen, J.; Su, D.; Xie, J. How Heritage Promotes Social Cohesion: An Urban Survey from Nara City, Japan. Cities 2024, 149, 104985. [Google Scholar] [CrossRef]
- Alnaim, M.M.; Albaqawy, G.; Bay, M.; Mesloub, A. The Impact of Generative Principles on the Traditional Islamic Built Environment: The Context of the Saudi Arabian Built Environment. Ain Shams Eng. J. 2023, 14, 101914. [Google Scholar] [CrossRef]
- Wu, C.; Peng, N.; Ma, X.; Li, S.; Rao, J. Assessing Multi-Scale Visual Appearance Characteristics of Neighborhoods Using Geographically Weighted Principal Component Analysis in Shenzhen, China. Comput. Environ. Urban Syst. 2022, 92, 101732. [Google Scholar] [CrossRef]
- Zube, E.H. Cross-Disciplinary and Intermode Agreement in the Description and Evaluation of Landscape Resources. Environ. Behav. 1974, 6, 68–69. [Google Scholar] [CrossRef]
- Kaplan, R.; Kaplan, S. The Experience of Nature: A Psychological Perspective; Cambridge University Press: Cambridge, UK, 1989. [Google Scholar]
- Ewing, R.; Brownson, R.C.; Clemente, O.; Winston, E.; Handy, S. Identifying and Measuring Urban Design Qualities Related to Walkability. J. Phys. Act. Health 2006, 3, S223–S240. [Google Scholar] [CrossRef]
- Westerholt, R.; Acedo, A.; Naranjo-Zolotov, M. Exploring Sense of Place in Relation to Urban Facilities—Evidence from Lisbon. Cities 2022, 127, 103750. [Google Scholar] [CrossRef]
- Burrage, H. Green Hubs, Social Inclusion and Community Engagement. Proc. Inst. Civ. Eng. Munic. Eng. 2011, 164, 167–174. [Google Scholar] [CrossRef]
- Ogawa, Y.; Oki, T.; Zhao, C.; Sekimoto, Y.; Shimizu, C. Evaluating the Subjective Perceptions of Streetscapes Using Street-View Images. Landsc. Urban Plan. 2024, 247, 105073. [Google Scholar] [CrossRef]
- Lynch, K. The Image of the City; MIT Press: Cambridge, MA, USA, 1960. [Google Scholar]
- Ameli, S.H.; Hamidi, S.; Garfinkel-Castro, A.; Ewing, R. Do Better Urban Design Qualities Lead to More Walking in Salt Lake City, Utah? J. Urban Des. 2015, 20, 393–410. [Google Scholar] [CrossRef]
- Yin, L.; Wang, Z. Measuring Visual Enclosure for Street Walkability Using Machine Learning Algorithms and Google Street View Imagery. Appl. Geogr. 2016, 76, 147–153. [Google Scholar] [CrossRef]
- Jeon, J.; Woo, A. Deep Learning Analysis of Street Panorama Images to Evaluate the Streetscape Walkability of Neighborhoods for Subsidized Families in Seoul, Korea. Landsc. Urban Plan. 2023, 230, 104631. [Google Scholar] [CrossRef]
- Hölscher, C.; Brösamle, M.; Vrachliotis, G. Challenges in Multilevel Wayfinding: A Case Study with the Space Syntax Technique. Environ. Plan. B 2012, 39, 63–82. [Google Scholar] [CrossRef]
- Fang, Y.-N.; Tian, J.; Namaiti, A.; Zhang, S.; Zeng, J.; Zhu, X. Visual Aesthetic Quality Assessment of the Streetscape from the Perspective of Landscape–Perception Coupling. Environ. Impact Assess. Rev. 2024, 106, 107535. [Google Scholar] [CrossRef]
- Wilber, D.N. Persian Gardens and Garden Pavilions, 2nd ed.; Dumbarton Oaks: Washington, DC, USA, 1979. [Google Scholar]
- Ewing, R.; Clemente, O. Measuring Urban Design: Metrics for Livable Places; Island Press: Washington, DC, USA, 2013. [Google Scholar]
- Helbich, M.; Yao, Y.; Liu, Y.; Zhang, J.; Liu, P.; Wang, R. Using Deep Learning to Examine Street View Green and Blue Spaces and Their Associations with Mental Health. Environ. Int. 2019, 126, 136–145. [Google Scholar] [CrossRef]
- Mehrinejad Khotbehsara, E.; Somasundaraswaran, K.; Kolbe-Alexander, T.; Yu, R. The Influence of Spatial Configuration on Pedestrian Movement Behaviour in Commercial Streets of Low-Density Cities. Ain Shams Eng. J. 2025. [Google Scholar] [CrossRef]
- Dubey, A.; Naik, N.; Parikh, D.; Raskar, R.; Hidalgo, C.A. Deep Learning the City: Quantifying Urban Perception at a Global Scale. In Proceedings of the European Conference on Computer Vision (ECCV), 2016.
- Fan, Z.; Feng, C.; Biljecki, F. Coverage and Bias of Street View Imagery in Mapping the Urban Environment. Comput. Environ. Urban Syst. 2025, 117, 102253. [Google Scholar] [CrossRef]
- Zhou, B.; He, S.; Cai, Y.; Wang, M.; Su, S. Social Inequalities in Neighborhood Visual Walkability: Using Street View Imagery and Deep Learning. Sustain. Cities Soc. 2019, 50, 101605. [Google Scholar] [CrossRef]
- Cantacuzino, S.; Bowne, K. Special Issue: Isfahan. Archit. Rev. 1976, 159, 255–321. [Google Scholar]
- Nasar, J.L. The Evaluative Image of the City; Sage Publications: Thousand Oaks, CA, USA, 1998. [Google Scholar]









| Indicators | Visual elements | Mean | Max | Min | S.D. | |
| Classes in Dataset | 1 | Sky | .04 | 0.341271 | 0 | .07 |
| 2 | Buildings (Bn) | .15 | 0.604509 | 0 | .13 | |
| 3 | Trees with columnar shape | .17 | 0.451022063 | 0 | .11 | |
| 4 | Trees with oval shape | .19 | 0.502243287 | 0.017691326 | .13 | |
| 5 | Palm trees | .007 | 0.214644 | 0 | .04 | |
| 6 | Plant cover | .05 | 0.187119872 | 0 | .03 | |
| 7 | Water | .013 | 0.251371126 | 0 | .04 | |
| 8 | Wall (Bn) | .002 | 0.712366595 | 0 | .06 | |
| 9 | Fence (Fn) | .01 | 0.249894023 | 0 | .02 | |
| 10 | Symbol (Sn) | .002 | 0.46962293 | 0 | .03 | |
| 11 | Sun shade (Sn) | .001 | 0.096984 | 0 | .02 | |
| 12 | Pavement (PVn) | .27 | 0.483494471 | 0.028383866 | .13 | |
| 13 | Person in Group (PDn) | .003 | 0.238764738 | 0 | .03 | |
| 14 | Person Standing (PDn) | .03 | 0.426627789 | 0 | .04 | |
| 15 | Person Sitting (PDn) | .003 | 0.091998322 | 0 | .01 | |
| 16 | Shops Shutter (Sn) | .0055 | 0.516099 | 0 | .03 | |
| 17 | Bench and Food stands (Sn) | .005 | 0.186044 | 0 | .03 | |
| 18 | Window (Sn) | .0008 | 0.387737 | 0 | .02 | |
| 19 | Door (Sn) | .007 | 0.15860 | 0 | .03 | |
| 20 | Motorbike | .006 | 0.131518784 | 0 | .03 | |
| 21 | Bike | .0005 | 0.204095 | 0 | .01 | |
| 22 | Car | .003 | 0.029565 | 0 | .03 |
| Aspect | Previous Study | Current Study (This Manuscript) |
|---|---|---|
| Concept & Method | Emphasized landmarks, architectural memorability, and street aesthetics as drivers of spatial imageability and walkability. Based on surveys, expert opinion, and field observation (subjective assessment). | Extends prior focus by quantifying visual components (buildings, trees, pedestrians) through semantic segmentation. Uses pixel-based ratios to construct an imageability index. Non-historic edges were down-weighted in formula. |
| Findings | Imageability linked to symbolic/memorable architecture and pedestrian density. Chahar Bagh benefited from shade and water, enhancing spatial appeal. Landmark buildings acted as termination/orientation points. | Imageability shows trade-offs with other elements; landmark buildings remain decisive (visual termination, contrast, orientation). Negative correlation with trees (r = –0.325) suggests their inclusion in the index requires contextual interpretation. |
| Innovation | Recognized weaker imageability in streets with plain façades (e.g., Chahar Bagh, Sepah) | Combines perception-based indices with pixel data, showing that façade richness, historic character, and functional trees enhance imageability in heritage-sensitive design. Future work can refine landmark detection in segmentation models. |
| Aspect | Previous Study | Current Study (This Manuscript) |
|---|---|---|
| Concept & Method | Discussed enclosure elements only implicitly—through shading, tree canopies, and spatial continuity. No formal definition or measurement of enclosure; pedestrian surveys could not directly capture it. | Pixel-based segmentation quantifies proportions of buildings, trees, fences, pavement, walls, and sky to compute a measurable enclosure index. |
| Findings | Key excerpts point to enclosure-related elements: buildings define edges; fragmented façades reduce continuity; trees add comfort but without measurement. | In heritage streets, enclosure tends to be low where façades are discontinuous. Tree morphology (oval vs. columnar canopies) strongly shapes enclosure, while greater sky visibility signals weaker enclosure. |
| Innovation | Lacked quantification—treated enclosure as theoretical. | Proposes a refined enclosure metric as an image-based index that integrates vegetation, vertical density, and sky exposure. Links enclosure with walkability and street width, highlighting pedestrian-friendly design needs. |
| Aspect | Previous Study | Current Study (This Manuscript) |
|---|---|---|
| Concept & Method | Based on visual analysis, pedestrian surveys, and expert questionnaires. Attention given to benches, cafés, shopfronts, and street furniture. | Semantic segmentation of street images, focusing on spatial proportions and presence of human-scaled elements (façade continuity, windows, seating, shading). |
| Findings | Human scale enhanced by commercial frontage, urban furniture, shade, and active edges (cafés, vans). Elements at eye level encouraged lingering. | Dataset lacked consistent visual capture of furniture, cafés, signage. Windows often undetected due to absence in many images. Human scale therefore depended strongly on spatial framing and façade continuity. |
| Innovation | Addressed design factors qualitatively but without quantification. | Among the first attempts to quantify human scale pixel-wise. Shows its dependence on enclosure and continuous building façades, confirming links between street wall definition and pedestrian comfort. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).





