Submitted:
30 May 2025
Posted:
30 May 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Historical Value and Digital Transformation
1.2. From GANs to Diffusion: Technological Foundations from a Typological Perspective
1.3. Phenomenological Emulation via Deep Learning
1.4. Research Objectives and Contributions
2. Materials and Methods
2.1. Case-Study Selection: Krakow’s Eclectic Facades
2.2. Image Data Acquisition and Preprocessing
2.2.1. Initial Collection and Screening Criteria for Image Samples
2.2.2. Typology-Based Label Generation and Keyword Optimisation
2.3. Typological Transcoding Framework
2.3.1. Brief Introduction to Diffusion Models and LoRA Technology
Diffusion Models
Low-Rank Adaptation
2.3.2. LoRA Model Training Workflow and Key Parameter Regulation
2.3.3. The Guiding Role of Typological Theory in Training and Inference Processes
3. Results and Analysis
3.1. Influence of LoRA Model Parameters on Stylistic Generation Outcomes
LoRA Loss and Weight Tuning for Style Transfer
3.2. Comparison of Generated Facade Styles
3.2.1. Quantitative Metrics
3.2.2. Qualitative Evaluation by Expert Panel
4. Discussion
4.1. Interpreting Stylistic Learning in AI Models
4.1.1. Progressive Learning in LoRA Models
4.1.2. Correlation with Architectural Typological Theory
4.2. Methodological Efficacy and Value
4.2.1. Handling Complex, Data-Scarce Styles
4.2.2. The Role of Typological Guidance in Enhancing Generation Quality and Interpretability
4.2.3. Potential Enhancements to Traditional 'Stylistic Emulation' Practices
4.3. Limitations of the Study
4.3.1. Predominant Focus on 2D Facades and Significant Challenges in 3D Modeling
4.3.2. Limitations in AI's Understanding of Deep Structural Logic and Functional Organization
4.3.3. Deficiencies in the Profound Transcoding of Specific Regional Cultural Connotations
4.4. Future Prospects and Research Directions
4.4.1. Advancing Towards Intelligent Architectural Generation: Integrating Deeper Typological Knowledge
4.4.2. Extension to Three-Dimensional Architectural Modeling and Urban Design
4.4.3. Application Potential in Virtual Reality, Digital Cultural Heritage, and Related Fields
5. Conclusions
6. Patents
Author Contributions
Acknowledgments
Conflicts of Interest
References
- Petzet, M.; Ziesemer, J., eds. International Charters for Conservation and Restoration / Chartes …; Monuments & Sites 1; ICOMOS, Paris, France; Lipp GmbH: Munich, Germany, 2004; ISBN 3 - 87490 - 676 - 0.
- Jokilehto, J. A History of Architectural Conservation, 2nd ed.; Routledge: London, UK, 2017; ISBN 978 - 1 - 138 - 63999 - 7. [Google Scholar]
- Boccardi, G. Authenticity in the heritage context: A reflection beyond the Nara Document. The Historic Environment: Policy & Practice. 2019, 10, 4–18. [Google Scholar] [CrossRef]
- He, M.; Qi, J. Study on the theory of Rafael Moneo architectural typology. IOP Conference Series: Materials Science and Engineering. 2019, 592, 012105. [Google Scholar] [CrossRef]
- Plevoets, B.; Van Cleempoel, K. Adaptive Reuse of the Built Heritage: Concepts and Cases of an Emerging Discipline; Routledge: London, UK, 2019; 256 pp.; ISBN 978 - 1 - 138 - 06276 - 4. [Google Scholar]
- Plevoets, B.; Van Cleempoel, K. Adaptive reuse as a strategy towards conservation of cultural heritage: A literature review. WIT Transactions on The Built Environment. 2011, 118, 155–164. [Google Scholar] [CrossRef]
- Tang, Q.; Liang, H.; Li, J.; et al. Innovative design method for Lingnan region veranda architectural heritage (Qi - Lou) façades based on computer vision. Buildings 2025, 15, 368. [Google Scholar] [CrossRef]
- Yuan, F.; Xu, X.; Wang, Y. Toward the era of generative - AI - augmented design. Archit. J. (China) 2023, 659, 14–20. (In Chinese).
- Yang, J.; Tan, M.; Chen, X.; et al. Exploration of theories and technical mechanisms for smart city planning. J. Southeast Univ. (Nat. Sci. Ed.) 2024, 54, 1066–1079. (In Chinese) [Google Scholar]
- Csiszár, I. The method of types. IEEE Transactions on Information Theory 1998, 44, 2505–2523. [Google Scholar] [CrossRef]
- Rossi, A. The Architecture of the City; MIT Press: Cambridge, MA, USA, 1984. [Google Scholar]
- Viollet - le - Duc, E. - E. Dictionnaire raisonné de l’architecture française du XIe au XVIe siècle, Vol. 1; B. Bance: Paris, France, 1854. [Google Scholar]
- Bressani, M. Architecture and the Historical Imagination: Eugène - Emmanuel Viollet - le - Duc, 1814–1879; Routledge: London, UK, 2016. [Google Scholar]
- Bressani, M. Notes on Viollet - le - Duc’s philosophy of history: Dialectics and technology. The Journal of the Society of Architectural Historians. 1989, 48, 327–350. [Google Scholar] [CrossRef]
- Bonini, E. Building virtual cultural heritage environments: The embodied mind at the core of the learning processes. International journal of digital culture and electronic tourism. 2008, 1, 113–125. [Google Scholar] [CrossRef]
- Zhong, H.; Wang, L.; Zhang, H. The application of virtual reality technology in the digital preservation of cultural heritage. Computer Science and Information Systems. 2021, 18, 535–551. [Google Scholar] [CrossRef]
- Selmanović, E.; Rizvic, S.; Harvey, C.; et al. Improving accessibility to intangible cultural heritage preservation using virtual reality. Journal on Computing and Cultural Heritage (JOCCH). 2020, 13, 13. [Google Scholar] [CrossRef]
- Barceló, J.A.; Forte, M.; Sanders, D.H. (Eds.) Virtual Reality in Archaeology; BAR Int. Ser. 843; Archaeopress: Oxford, UK, 2000; ISBN 978 - 1841710471. [Google Scholar]
- Santana Quintero, M.; Georgopoulos, A.; Stylianidis, E.; Lerma García, J.L.; Remondino, F. CIPA’s mission: Digitally documenting cultural heritage. APT Bulletin: The Journal of Preservation Technology. 2017, 48, 51–54. [Google Scholar]
- Poyck, G. Procedural City Generation with Combined Architectures for Real - Time Visualization; Master’s Thesis, Clemson University, Clemson, SC, USA, May 2023. [Google Scholar]
- Goodfellow, I.; Pouget - Abadie, J.; Mirza, M.; et al. Generative adversarial nets. Advances in neural information processing systems. 2014, 27, 2672–2680. [Google Scholar]
- Sohl - Dickstein, J.; Weiss, E.; Maheswaranathan, N.; Ganguli, S. Deep unsupervised learning using nonequilibrium thermodynamics. International conference on machine learning. pmlr; Lille, France, 6 - 11 July 2015; pp. 2256 - 2265. arXiv:1503.03585.
- Hinton, G.E.; Salakhutdinov, R.R. Reducing the dimensionality of data with neural networks. Science 2006, 313, 504–507. [Google Scholar] [CrossRef] [PubMed]
- Hinton, G.E. A fast learning algorithm for deep belief nets. Neural computation. 2006, 18, 1527–1554. [Google Scholar] [CrossRef]
- Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet classification with deep convolutional neural networks. Advances in neural information processing systems 2012, 25, 1097–1105. [Google Scholar] [CrossRef]
- Gulrajani, I.; Ahmed, F.; Arjovsky, M.; et al. Improved training of Wasserstein GANs. Advances in neural information processing systems. 2017, 30, 5767–5777. [Google Scholar] [CrossRef]
- Brock, A.; Donahue, J.; Simonyan, K. Large - scale GAN training for high - fidelity natural image synthesis. arXiv:1809.11096 (preprint).
- Radford, A.; Metz, L.; Chintala, S. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv:1511.06434 (preprint).
- Karras, T.; Aila, T.; Laine, S.; Lehtinen, J. Progressive growing of GANs for improved quality, stability, and variation. arXiv:1710.10196 (preprint).
- Zhu, J.Y.; Park, T.; Isola, P.; Efros, A.A. Unpaired image - to - image translation using cycle - consistent adversarial networks. Proceedings of the IEEE international conference on computer vision; Venice, Italy, 22 - 29 Oct 2017; pp. 2223 - 2232.
- Bachl, M.; Ferreira, D.C. City - GAN: Learning architectural styles using a custom conditional GAN architecture. arXiv:1907.05280 (preprint).
- Dhariwal, P.; Nichol, A. Diffusion models beat GANs on image synthesis. Advances in neural information processing systems. 2021, 34, 8780–8794. [Google Scholar]
- Ho, J.; Jain, A.; Abbeel, P. Denoising diffusion probabilistic models. Advances in neural information processing systems. 2020, 33, 6840 - 6851.
- Rombach, R.; Blattmann, A.; Lorenz, D.; et al. High - resolution image synthesis with latent diffusion models. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition; New Orleans, LA, USA, 19 - 24 Jun 2022; pp. 10684 - 10695.
- Radford, A.; Kim, J.W.; Hallacy, C.; et al. Learning transferable visual models from natural language supervision. International conference on machine learning. PmLR, 18 - 24 Jul 2021; pp. 8748 - 8763.
- Zhang, L.; Rao, A.; Agrawala, M.; et al. Adding conditional control to text - to - image diffusion models. Proceedings of the IEEE/CVF international conference on computer vision; Paris, France, 2 - 6 Oct 2023; pp. 3836 - 3847.
- Hu, E.J.; Shen, Y.; Wallis, P.; et al. LoRA: Low - rank adaptation of large language models. In Proc. 10th ICLR; Kigali, Rwanda, 25 - 29 Apr 2022.
- Huang, W.; Zheng, H. Architectural drawings recognition and generation through machine learning. Proceedings of the 38th annual conference of the association for computer aided design in architecture; Mexico City, Mexico, 17 - 20 Oct 2018; pp. 616 - 625.
- Nauata, N.; Chang, K.H.; Cheng, C.Y.; et al. House - GAN: Relational generative adversarial networks for graph - constrained house layout generation. Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings; Glasgow, UK, 23 - 28 Aug 2020; pp. 162 - 177.
- Sun, C.; Zhou, Y.; Han, Y. Automatic generation of architecture façade for historical urban renovation using generative adversarial network. Building and Environment. 2022, 212, 108781. [Google Scholar] [CrossRef]
- Zhang, L.; Huang, Y.; Li, Z.; et al. CGAN - assisted renovation of the styles and features of street façades—A case study of the Wuyi area in Fujian, China. Sustainability 2022, 14, 16575. [Google Scholar] [CrossRef]
- Xu, S.; Zhang, J.; Li, Y. Knowledge - driven and diffusion model - based methods for generating historical building façades: A case study of traditional Minnan residences in China. Information 2024, 15, 344. [Google Scholar] [CrossRef]
- Zhang, J.; Huang, Y.; Li, Z.; et al. Development of a method for commercial style transfer of historical architectural façades based on stable diffusion models. Journal of Imaging 2024, 10, 165. [Google Scholar] [CrossRef] [PubMed]
- Xu, Z.; Wang, L. Reshaping classicism—An Abnormal Landscape of Paestum and the rise of Neoclassicism. World Architure. 2023, 3, 110–115. (in Chinese). [Google Scholar]
- Houlsby, N.; Giurgiu, A.; Jastrzebski, S.; et al. Parameter - efficient transfer learning for NLP. International conference on machine learning. PMLR; Long Beach, CA, USA, 9 - 15 Jun 2019; pp. 2790 - 2799.
- Dietterich, T.G. Overfitting and underfitting in machine learning. ACM computing surveys (CSUR)–1995 Workshop on Overfitting; Seattle, WA, USA, 5 - 9 Aug 1995; pp. 114 - 122.
- LeCun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient - based learning applied to document recognition. Proceedings of the IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef]
- Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016; ISBN 978 - 0262035613. [Google Scholar]
- Heusel, M.; Ramsauer, H.; Unterthiner, T.; et al. GANs trained by a two time - scale update rule converge to a local Nash equilibrium. Advances in neural information processing systems. 2017, 30, 6626–6637. [Google Scholar]
- Zhang, R.; Isola, P.; Efros, A.A.; et al. The unreasonable effectiveness of deep features as a perceptual metric. Proceedings of the IEEE conference on computer vision and pattern recognition.; Salt Lake City, UT, USA, 18 - 23 Jun 2018; pp. 586 - 595.
- Hore, A.; Ziou, D. Image quality metrics: PSNR vs. SSIM. 2010 20th international conference on pattern recognition. IEEE; Istanbul, Turkey, 23 - 26 Aug 2010; pp. 2366 - 2369. [CrossRef]
- Akbar, P.N.G.; Edelenbos, J. Positioning place - making as a social process: A systematic literature review. Cogent Social Sciences. 2021, 7, 1905920. [Google Scholar] [CrossRef]
- Zhang, R.; Guo, Z.; Wei, Z.; et al. PointCLIP: Point cloud understanding by CLIP. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; New Orleans, LA, USA, 19-24 Jun 2022; pp. 5790 - 5800. [CrossRef]
- Wu, S.; et al. Direct3D: Scalable image - to - 3D generation via 3D latent diffusion transformer. arXiv:2405.14832 (preprint).
- Tochilkin, D.; et al. TriPoSR: Fast 3D object reconstruction from a single image. arXiv:2403.02151 (preprint).
- Xiang, J.; et al. Structured 3D latents for scalable and versatile 3D generation. arXiv:2412.01506 (preprint).
- Liu, J.; et al. A comprehensive survey on 3D content generation. arXiv:2402.01166 (preprint).






















| Model-train-type | pretrained-model | AE model | t5xxl model |
| flux-lora | flux1-dev.safetensors | ae.sft | t5xxl fp16.safetensors |
| clip-l | timestep sampling | model prediction type | Loss-type |
| Clip-l.safetensors | sigmoid | raw | I2 |
| resolution | save precision | Epochs | Batch Size |
| 1024,1024 | bf16 | 20 | 4 |
| GPU equipped | Learning Rate | unet Learning Rate | Text-encoder Learning Rate |
| NVIDIA RTX 4090 | 1e-4 | 5e-4 | 1e-5 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).