Submitted:
22 December 2025
Posted:
24 December 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- To build on the Visual Impression in Architectural Space (VIAS) multi-modal framework by integrating NLP-based verbal analysis with established theories of VIAS.
- To examine how linguistic and cultural differences influence the articulation of visual attractiveness, comfort, and engagement in VIAS of temporary events.
- To develop and validate an adaptive sentiment-weighted keyword extraction algorithm that mitigates interviewer bias, adjusts for per-participant verbosity, and provides replicable, objective scoring for qualitative interview data.
1.1. Outline of the Study
2. Background and Related Work
2.1. Visual Impression in Architectural Space: The Designer-User Perception Gap
2.2. A Cross-Cultural Approach using Multi-Modal Framework
2.3. Data-Driven and NLP-Based Analysis of User Perceptions
2.4. Opportunities and Challenges of Multi-Modal Frameworks
3. Methodology
3.1. Setting of Temporary Event Space
3.2. Stage-1: Behavior and Perception Design
3.2.1. Stall Layout (SL)
3.2.2. Stall Visibility
3.2.3. Advertise Strategy (AS)
3.3. Stage-2: Multi-Modal Interview Data Collection
3.3.1. Phase 1: On-Site Observation and Initial Interviews (August 2024)
3.3.2. Phase 2: Post-Event Video-Based Interviews and Virtual Interview with native and non-native (November-December 2024)
3.3.3. Phase 3: Video-based and Virtual Environment Interviews with only native (August–October 2025)
3.4. Stage-3: NLP Backed Integrated Data Interpretation
3.4.1. Keyword Detection and Categorisation
3.4.2. Sentiment-Aware Weighting
3.5. Stage-4: Multi-Modal Interview Analysis Design
3.6. Design Interpretation Modality Effects
4. NLP-Driven Data Interpretation with Sentiment-Aware Weighting
4.1. Baseline Fixed-Parameter Algorithm
4.1.1. Sample Weight Calculation
4.1.2. Aggregated Results by Category
4.1.3. Key Observations and Identified Limitations
4.2. Adaptive Per-Participant Weighting
4.2.1. How the Adaptive System Works
4.2.2. Implementation Parameters
4.3. Comparative Analysis
4.3.1. Impact of Adaptive Weighting on Keyword Scores
4.3.2. Effect on Participant Balance
4.3.3. Top Weighted Keywords by Category
4.4. Sentiment Reclassification Procedures

| Characteristic | Onsite | Video | Virtual |
|---|---|---|---|
| Total participants | 11 | 8 | 8 |
| Average response length (words) | 12.4 | 38.7 | 42.3 |
| Questions per participant | 8–12 | 17 | 9–12 |
| Average interview duration (min) | 5.2 | 16.0 | 12.5 |
| Positive sentiment ratio | 0.82 | 0.64 | 0.71 |
| Negative sentiment ratio | 0.06 | 0.18 | 0.15 |
| Neutral sentiment ratio | 0.12 | 0.18 | 0.14 |
| Unique keywords extracted | 47 | 142 | 98 |
| Keywords per participant | 4.3 | 17.8 | 12.3 |
| Onsite | Video | Virtual | |||
|---|---|---|---|---|---|
| Keyword | Weight | Keyword | Weight | Keyword | Weight |
| easy to notice | 8.2 | children | 17.0 | visible shops | 6.8 |
| suitable | 7.8 | seating area | 10.4 | seating space | 6.5 |
| yes/positive | 7.5 | goldfish scooping | 11.7 | rest space | 5.9 |
| 5.4 | flower shop | 10.9 | appropriate | 5.7 | |
| parking | 4.1 | shaved ice | 12.5 | families | 5.3 |
| signage | 3.9 | more shops | 10.1 | local products | 4.8 |
| vegetables | 3.6 | safety | 8.3 | community | 4.6 |
| relax | 3.2 | atmosphere | 8.0 | entrance | 4.2 |
4.4.1. Modality-Specific Keyword Distributions

5. Language and Questionnaire Structure Effects on Verbal Spatial Evaluation
5.1. Linguistic and Structural Influences on Verbal Responses
5.1.1. Interview Protocol Differences
5.1.2. Dataset Characteristics
5.1.3. Enhanced Weighting Function
5.1.4. Measured Improvements
5.2. Questionnaire Structure and Turn-Taking Stability
6. Architectural Implications and Spatial Design Outcomes
6.1. From Controlled Variables to Validated Experience
6.2. Interpreting Language with Analytical Rigour
6.3. Accounting for Linguistic and Cultural Variation in Spatial Evaluation
6.4. Framework Integration: From Evidence to Design Decision
- Stage-1 documents designer-controlled variables.
- Stage-3 weights participant-generated priorities while correcting for verbosity and sentiment bias.
- Stage-4 validates priorities across modalities and cultures.
7. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Conflicts of Interest
Abbreviations
| AI | Artificial Intelligence |
| NLP | Natural Language Processing |
| EV | Event |
| P | Participant |
| SL | Stall Layout |
| PV | Product Visibility |
| DS | Display Strategy |
| GPT | Generative Pre-trained Transformer |
| MDPI | Multidisciplinary Digital Publishing Institute |
| DOAJ | Directory of Open Access Journals |
| TLA | Three-Letter Acronym |
| LD | Linear Dichroism |
Appendix A. Summary of On-Site Interviews
| Attribute | Description |
|---|---|
| Interview duration | Approximately 5 minutes per participant. |
| Participants | 11 native respondents (6 female, 5 male), ages 20–60. |
| Interview focus | Immediate spatial impressions, perceived comfort, child-friendliness, and stall visibility. |
| Common findings | Participants reported high approachability of stalls, moderate heat-comfort concerns, and limited child-play features. |
| Frequent improvement themes | Additional dining areas, shaded resting zones, and clearer signage orientation. |
| Data availability | Full transcripts and native originals are available upon request (contact: khang.ntr@riko.shimane-u.ac.jp). |
- “More dining space would make it easier to relax.”
- “Children enjoy seeing stalls but lack space to play.”
- “Instagram photos match reality, but signage could be clearer on site.”
Appendix B. English Translation of Japanese Interview Questions
| No. | Question |
|---|---|
| 1 | Can you clearly identify all of the shops? |
| 2 | Are the pop-up displays and other visual features easy to see? |
| 3 | When watching the video, was there any shop you wanted to visit? Why? |
| 4 | Can you imagine the season or time of day based on the information about the shops and locations? |
| 5 | Does this look like a place that would be enjoyable to visit with children? |
| 6 | Is there anything or any area that feels dangerous? Why? |
| 7 | Does this look like a place you could enjoy with all five senses? |
| 8 | Can you predict the general flow or movement route of visitors? |
| 9 | Which type of visitors do you think would enjoy this place the most? |
| 10 | What could make this place more attractive? |
| 11 | If you could add one more shop, what kind would you choose? |
| 12 | Would you like to participate in this event? If so, who would you like to go with? |
| 13 | What are the good and bad aspects of the shop lineup, and why? |
| 14 | If you were the organizer, what would you change and why? (not limited to shops) |
| 15 | If you were to rank the places or shops you would like to visit, what would the order be and why? (not limited to shops) |
| 16 | If you happened to pass by this event, would it be easy to join in? |
| 17 | Would it be easy to participate alone? |
Appendix C. Summary of Video and Virtual Interview Datasets
Appendix C.1. Video-Based Interview Dataset
| Attribute | Description |
|---|---|
| Recording content | Full-length event video showing spatial configuration, stalls, and visitor movement. |
| Interview duration | Average 16 minutes per participant. |
| Participants | 8 Japanese adults (4 male, 4 female), ages 20–55. |
| Focus themes | Layout legibility, product visibility, perceived safety, child-friendliness, and spatial comfort. |
| Output format | Transcribed utterances with sentiment scores and categorized keywords (Japanese). |
| Data availability | See Supplementary File (contact: khang.ntr@riko.shimane-u.ac.jp) for the original interview content. |
Appendix C.2. Virtual Environment Interview Dataset
| Attribute | Description |
|---|---|
| Environment type | 3D reconstruction of the original event space based on participant feedback. |
| Interview duration | Average 12.5 minutes per participant. |
| Participants | Same as the video-based cohort (8 individuals). |
| Evaluation focus | Validation of proposed design improvements and perceptual comparison with the original event. |
| Key features tested | Signage placement, stall arrangement, circulation routes, and family-area inclusivity. |
| Data availability | See Supplementary File (contact: khang.ntr@riko.shimane-u.ac.jp) for the original interview content. |
Appendix C.3. Sample Data Excerpt
| Mode | Excerpt (Translated) | Category | Sentiment () | Weight () |
|---|---|---|---|---|
| Video | “The stall arrangement looks easy to follow.” | Layout Legibility | 0.85 | 0.80 |
| Video | “The drink bar seems too crowded.” | Crowd Flow / Comfort | 0.35 | 0.25 |
| Virtual | “The added signboard makes it clearer for first-time visitors.” | Signage Visibility | 0.95 | 0.90 |
| Virtual | “The child area feels safer now.” | Child-Friendly Zone | 0.90 | 0.85 |
References
- Osgood, C.E.; Suci, G.J.; Tannenbaum, P.H. The Measurement of Meaning; University of Illinois Press: Urbana, IL, 1957. [Google Scholar]
- Bansal, G.; Chamola, V.; Hussain, A.; Guizani, M.; Niyato, D. Transforming Conversations with AI—A Comprehensive Study of ChatGPT. Cognitive Computation 2024, 16, 2487–2510. [Google Scholar] [CrossRef]
- OpenAI. ChatGPT-4, 2024. Accessed online.
- Chen, M.; Yuan, Q.; Yang, C.; Zhang, Y. Decoding Urban Mobility: Application of Natural Language Processing and Machine Learning to Activity Pattern Recognition, Prediction, and Temporal Transferability Examination. IEEE Transactions on Intelligent Transportation Systems 2024, 25, 7151–7173. [Google Scholar] [CrossRef]
- Cantamessa, M.; Montagna, F.; Altavilla, S.; Casagrande-Seretti, A. Data-driven design: the new challenges of digitalization on product design and development. Design Science 2020, 6, e27. [Google Scholar] [CrossRef]
- Nguyen-Tran, Y.K.; Majiid, A.; Mian, R.u.h. Data-Driven Spatial Analysis: A Multi-Stage Framework to Enhance Temporary Event Space Attractiveness. World 2025, 6, 54. [Google Scholar] [CrossRef]
- Lynch, K.M. The Image of the City; MIT Press: MA, 1960. [Google Scholar]
- Cullen, G. The Concise Townscape; Architectural Press, 1995. [Google Scholar]
- Kaplan, R.; Kaplan, S. The Experience of Nature: A Psychological Perspective; Cambridge University Press: New York, NY, 1989. [Google Scholar]
- Stamps, A.E. Use of static and dynamic media to simulate environments: A meta-analysis. Perceptual and Motor Skills 2010, 111, 355–364. [Google Scholar] [CrossRef] [PubMed]
- Nitschke, G. Ma: The Japanese Sense of Place. Architectural Design 1966, 36, 116–156. [Google Scholar]
- Bognar, B. The Japan Style; Kodansha International: Tokyo, 1985. [Google Scholar]
- Nisbett, R.E.; Miyamoto, Y. The influence of culture: Holistic versus analytic perception. Trends in Cognitive Sciences 2005, 9, 467–473. [Google Scholar] [CrossRef]
- Braun, V.; Clarke, V. Using thematic analysis in psychology. Qualitative Research in Psychology 2006, 3, 77–101. [Google Scholar] [CrossRef]
- Strauss, A.; Corbin, J. Basics of Qualitative Research: Techniques and Procedures for Developing Grounded Theory; Sage Publications: Thousand Oaks, CA, 1998. [Google Scholar]
- Hallgren, K.A. Computing inter-rater reliability for observational data: An overview and tutorial. Tutorials in Quantitative Methods for Psychology 2012, 8, 23–34. [Google Scholar] [CrossRef]
- Hillier, B.; Hanson, J. The Social Logic of Space; Cambridge University Press: Cambridge, 1984. [Google Scholar]
- Benedikt, M.L. To Take Hold of Space: Isovists and Isovist Fields. Environment and Planning B: Planning and Design 1979, 6, 47–65. [Google Scholar] [CrossRef]
- Dubey, A.; Naik, N.; Parikh, D.; Raskar, R.; Hidalgo, C.A. Deep Learning the City: Quantifying Urban Perception at a Global Scale. In Proceedings of the European Conference on Computer Vision (ECCV); Springer, 2016; pp. 196–212. [Google Scholar]
- Gibson, J.J. The Ecological Approach to Visual Perception; Houghton Mifflin, 1979. [Google Scholar]
- Heft, H. Environment, cognition, and culture: Reconsidering the cognitive map. Journal of Environmental Psychology 2013, 33, 14–25. [Google Scholar] [CrossRef]
- Alberti, M.; Pahl, S. People’s perception of urban green spaces: A multisensory approach. Urban Forestry & Urban Greening 2018, 36, 104–113. [Google Scholar]
- Warren, W.H. Perceiving the environment: Theoretical perspectives on the ecological approach. Annual Review of Psychology 2019, 70, 1–24. [Google Scholar]
- Li, X.; Zhang, C.; Li, W. A review of visual methods in environmental perception research. Journal of Environmental Psychology 2020, 68, 101389. [Google Scholar]
- Gao, T.; Zhang, S. Using videos to elicit spatial perception: Effects of camera movement on spatial understanding. Environment and Behavior 2021, 53, 763–788. [Google Scholar]
- Spanlang, B.; et al. How to build an embodied VR system for spatial behavior studies. Frontiers in Psychology 2014, 5, 879. [Google Scholar]
- Kuliga, S.F.; Thrash, T.; Dalton, R.C.; Hölscher, C. Virtual reality as an empirical research tool — Exploring user experience in a real building and a corresponding virtual model. Computers, Environment and Urban Systems 2015, 54, 363–375. [Google Scholar] [CrossRef]
- Carrozzino, M.; Bergamasco, M. Beyond virtual museums: Experiencing immersive virtual reality in real contexts. Journal of Cultural Heritage 2010, 11, 452–458. [Google Scholar] [CrossRef]
- Chen, H.; Suhr, A.; Misra, D.K.; Snavely, N.; Artzi, Y. TOUCHDOWN: Natural Language Navigation and Spatial Reasoning in Visual Street Environments. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018; pp. 12530–12539. [Google Scholar]
- Singh, M.; Basu, A.; Mandal, M.K. Event Dynamics Based Temporal Registration. IEEE Transactions on Multimedia 2007, 9, 1004–1015. [Google Scholar] [CrossRef]
- Morgan, M.A. Festival Spaces and the Visitor Experience; 2007. [Google Scholar]
- Watson, S.; Studdert, D.M. Markets as sites for social interaction: Spaces of diversity; 2006. [Google Scholar]
- OpenAI. ChatGPT: Optimizing Language Models for Dialogue. 2023. Available online: https://openai.com/chatgpt Accessed: [2025-02-1=22].
- Anthropic. Claude: A Next-Generation AI Assistant. 2023. Available online: https://www.anthropic.com/claude Accessed: [2025-02-1=22].
- Google. Gemini: Multimodal AI Model by Google. 2023. Available online: https://blog.google/technology/ai/google-gemini-ai/ Accessed: [2025-02-1=22].
- DeepSeek. DeepSeek: Advanced AI for Natural Language Processing. 2023. Available online: https://www.deepseek.com Accessed: [2025-02-1=22].
- Microsoft. GitHub Copilot: Your AI Pair Programmer. 2023. Available online: https://github.com/features/copilot Accessed: [2025-02-1=22].
- Campos, R.; Mangaravite, V.; Pasquali, A.; Jorge, A.M.; Nunes, C.; Jatowt, A. YAKE! Keyword extraction from single documents using multiple local features. Information Sciences 2020, 509, 257–289. [Google Scholar] [CrossRef]
- Rose, S.; Engel, D.; Cramer, N.; Cowley, W. Automatic keyword extraction from individual documents. In Proceedings of the Text Mining: Applications and Theory; John Wiley & Sons, Ltd, 2010; pp. 1–20. [Google Scholar]
- Mihalcea, R.; Tarau, P. TextRank: Bringing order into text. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing; 2004; pp. 404–411. [Google Scholar]
- Karlgren, J.; Li, R.; Meyersson Milgrom, E.M. Text Mining for Processing Interview Data in Computational Social Science. arXiv 2020, arXiv:2011.14037. [Google Scholar] [CrossRef]
- Ushio, A.; Liberatore, F.; Camacho-Collados, J. Back to the Basics: A Quantitative Analysis of Statistical and Graph-Based Term Weighting Schemes for Keyword Extraction. arXiv 2021, arXiv:2104.08028. [Google Scholar] [CrossRef]
- Wang, X.; Zhang, L.; Klabjan, D. Keyword-based Topic Modeling and Keyword Selection. arXiv 2020, arXiv:2001.07866. [Google Scholar] [CrossRef]
- Salton, G.; Buckley, C. Term-weighting approaches in automatic text retrieval. Information Processing & Management 1988, 24, 513–523. [Google Scholar] [CrossRef]
- Blei, D.M.; Ng, A.Y.; Jordan, M.I. Latent Dirichlet Allocation. Journal of Machine Learning Research 2003, 3, 993–1022. [Google Scholar]
- Grootendorst, M. KeyBERT: Minimal keyword extraction with BERT. arXiv preprint arXiv:2010.11918 2020. Accessed: 2025-02-03.
- Liu, B. Sentiment Analysis and Opinion Mining; Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers, 2012. [Google Scholar]
- Narahara, T. Kurashiki Viewer: Qualitative Evaluations of Architectural Spaces inside Virtual Reality. In Proceedings of the International Conference for The Association for Computer Aided Architectural Design Research in Asia (CAADRIA); 2022; Vol. 1, pp. 11–18. [Google Scholar]
- and, V.M. Look Closely and You Will See, Listen Carefully and You Will Hear: Urban Design and Social Interaction on Streets. Journal of Urban Design 2009, 14, 29–64. [CrossRef]
- Gehl, J.; Svarre, B. How to Study Public Life; Island Press: Washington, DC, 2013. [Google Scholar]
- nana, media. Imagine Coffee Morning Market] Held once a month! Handmade sweets and fresh vegetables lined up | Matsue City | na-na (Nana) | Tottori and Shimane gourmet, outing, and lifestyle magazine — na-na.media. 2023. Available online: https://na-na.media/imagine-coffee-morning/ (accessed on 09-02-2025).
- instagram. imagine coffee. 2025. Available online: https://www.instagram.com/imagine_coffee_/ (accessed on 15-12-2025).
- Rezaei, N.; Mirzaei, R.; Abbasi, R. A study on motivation differences among traditional festival visitors based on demographic characteristics, case study: Gol-Ghaltan festival, Iran. Journal of Convention & Event Tourism 2018, 19, 120–137. [Google Scholar] [CrossRef]
- Walters, T.; Insch, A. How community event narratives contribute to place branding. Journal of Place Management and Development 2018, 11, 130–144. [Google Scholar] [CrossRef]










| Q# | Category | Speaker | Mention # | ||||
|---|---|---|---|---|---|---|---|
| 1 | Information | Interviewer | 0 | – | – | – | 0.00 |
| 1 | Information | Participant | 1 | 1 | 1.00 | 1.0 | 1.00 |
| 2 | Information | Interviewer | 0 | – | – | – | 0.00 |
| 2 | Information | Participant | 1 | 2 | 0.75 | 1.0 | 0.75 |
| 5 | Activity | Interviewer | 0 | – | – | – | 0.00 |
| 5 | Activity | Participant | 1 | 1 | 1.00 | 1.0 | 1.00 |
| 6 | Activity | Interviewer | 0 | – | – | – | 0.00 |
| 6 | Activity | Participant | 1 | 2 | 0.75 | 0.5 | 0.38 |
| 6 | Activity | Interviewer | 0 | – | – | – | 0.00 |
| 6 | Activity | Participant | 1 | 3 | 0.50 | 0.0 | 0.00 |
| Participant 1 Raw Score (partial): | 3.13 | ||||||
| Keyword | Category | P1 | P2 | P5 | P7 | Mean | Change |
|---|---|---|---|---|---|---|---|
| Baseline Algorithm (fixed decay) | |||||||
| shaved ice | Information | 2.25 | 1.75 | 1.50 | 1.00 | 1.63 | – |
| children | Activity | 2.50 | 2.00 | 1.25 | 2.75 | 2.13 | – |
| flower shop | Information | 1.75 | 2.25 | 1.75 | 1.50 | 1.81 | – |
| seating area | Impression | 1.00 | 1.50 | 0.75 | 2.00 | 1.31 | – |
| Adaptive Algorithm (participant-specific decay + sentiment) | |||||||
| shaved ice | Information | 2.18 | 1.52 | 1.58 | 0.94 | 1.56 | -4.3% |
| children | Activity | 2.35 | 1.76 | 1.38 | 2.51 | 2.00 | -6.1% |
| flower shop | Information | 1.68 | 2.02 | 1.82 | 1.41 | 1.73 | -4.4% |
| seating area | Impression | 0.88 | 1.26 | 0.81 | 1.84 | 1.20 | -8.4% |
| Category | Keyword | Aggregate Weight |
|---|---|---|
| Information | shaved ice shop | 12.48 |
| flower shop | 10.92 | |
| hot snacks | 9.36 | |
| signage/displays | 7.80 | |
| summer atmosphere | 6.24 | |
| Activity | children/child-friendly | 17.00 |
| couples | 11.56 | |
| seating/benches | 10.40 | |
| five senses | 9.88 | |
| safety concerns | 8.32 | |
| Impression | goldfish scooping | 11.70 |
| more shops needed | 10.14 | |
| seating area | 9.60 | |
| food variety | 8.58 | |
| decorations | 7.02 | |
| NA (general) | casual visit | 8.84 |
| welcoming atmosphere | 8.32 | |
| easy to drop by | 7.28 | |
| local residents | 6.76 | |
| alone-friendly | 5.20 |
| Metric | EN–Video (P1) | EN–Virtual (P1) | JP–Video (P2) | JP–Virtual (P2) | Onsite (JP) |
|---|---|---|---|---|---|
| Participants (n) | 5 | 5 | 8 | 8 | 14 |
| Total turns | 410 | 230 | 274 | 160 | 269 |
| Avg. response length | 11.8 words | 13.5 words | 73.4 chars | 83.0 chars | 12.0 words |
| Avg. duration (min) | 9.2 | 11.0 | 16.0 | 12.5 | 5.2 |
| Positive sentiment ratio | 0.61 | 0.58 | 0.64 | 0.71 | 0.82 |
| Neutral sentiment ratio | 0.22 | 0.25 | 0.18 | 0.14 | 0.12 |
| Negative sentiment ratio | 0.17 | 0.17 | 0.18 | 0.15 | 0.06 |
| Unique keywords* | 112 | 97 | 142 | 98 | 47 |
| Keywords per participant | 15.3 | 12.8 | 17.8 | 12.3 | 4.3 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).