Submitted:
18 July 2025
Posted:
21 July 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Data Processing and Algorithm Design
2.1. Study Area and Data Source
2.2. Preprocessing
2.3. Algorithm
2.3.1. Auxiliary Branch Based on Mamba


2.3.2. Multiscale Attention Feature Fusion Module
- –
- extracts features related to the current semantic basic unit to be attended to
- –
- extracts features related to how other basic units are attended to
- –
- extracts the actual semantic content delivered Backpropagation adjusts the different mapping matrices. Calculate the similarity of each basic unit with and all units with , then weight the aggregated to obtain the attention weights:
2.3.3. Model Enhancement
- Loss Function
- 2.
- Optimizer
- 3.
- Predictive reprocessing
3. Feature Extraction Process and Implementation
3.1. Dataset Labeling

3.2. Parameter
3.3. Model Performance Evaluation
3.4. Realization of Results
| User\Reference Class | Background | Building | Sum |
|---|---|---|---|
| Background | 5038452 | 177017 | 5215469 |
| Building | 150224 | 578513 | 728737 |
| Sum | 5188676 | 755530 |
4. Discussion
4.1. Comparison of Base Algorithm Accuracy
4.2. Local Visualization Analysis

4.3. Ablation Experiments

5. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Liu, C.; Xu, M. Characteristics and Influencing Factors on the Hollowing of Traditional Villages—Taking 2645 Villages from the Chinese Traditional Village Catalogue (Batch 5) as an Example. Int. J. Environ. Res. Public. Health 2021, 18, 12759. [Google Scholar] [CrossRef] [PubMed]
- Smith, G. The Hollow State: Rural Governance in China. China Q. 2010, 203, 601–618. [Google Scholar] [CrossRef]
- Comprehensive Evaluation of Rural Courtyard Utilization Efficiency: A Case Study in Shandong Province, Eastern China | Journal of Mountain Science. Available online: https://link.springer.com/article/10.1007/s11629-019-5824-x (accessed on 16 July 2025).
- Characteristics and Influencing Factors on the Hollowing of Traditional Villages—Taking 2645 Villages from the Chinese Traditional Village Catalogue (Batch 5) as an Example. Available online: https://www.mdpi.com/1660-4601/18/23/12759 (accessed on 16 July 2025).
- Liu, Y.; Liu, Y.; Chen, Y.; Long, H. The Process and Driving Forces of Rural Hollowing in China under Rapid Urbanization. J. Geogr. Sci. 2010, 20, 876–888. [Google Scholar] [CrossRef]
- Sun, H.; Liu, Y.; Xu, K. Hollow Villages and Rural Restructuring in Major Rural Regions of China: A Case Study of Yucheng City, Shandong Province. Chin. Geogr. Sci. 2011, 21, 354–363. [Google Scholar] [CrossRef]
- Li, S. The Renewal of Hollow Village A Research and Proposal of Erhai Area in YunNan, China 2014.
- Sun, H.; Liu, Y.; Xu, K. Hollow Villages and Rural Restructuring in Major Rural Regions of China: A Case Study of Yucheng City, Shandong Province. Chin. Geogr. Sci. 2011, 21, 354–363. [Google Scholar] [CrossRef]
- Chen, Y.; Wang, Y.; Xiong, S.; Lu, X.; Zhu, X.X.; Mou, L. Integrating Detailed Features and Global Contexts for Semantic Segmentation in Ultra-High-Resolution Remote Sensing Images. IEEE Trans. Geosci. Remote Sens. 2024. [Google Scholar]
- Zeng, Q.; Zhou, J.; Tao, J.; Chen, L.; Niu, X.; Zhang, Y. Multiscale Global Context Network for Semantic Segmentation of High-Resolution Remote Sensing Images. IEEE Trans. Geosci. Remote Sens. 2024, 62, 1–13. [Google Scholar] [CrossRef]
- Wu, Z.; Li, J.; Wang, Y.; Hu, Z.; Molinier, M. Self-Attentive Generative Adversarial Network for Cloud Detection in High Resolution Remote Sensing Images. IEEE Geosci. Remote Sens. Lett. 2020, 17, 1792–1796. [Google Scholar] [CrossRef]
- Fan, R.; Wang, L.; Feng, R.; Zhu, Y. Attention Based Residual Network for High-Resolution Remote Sensing Imagery Scene Classification. In Proceedings of the IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium, Yokohama, Japan, July 2019; IEEE; pp. 1346–1349. [Google Scholar]
- Ji, H.; Li, X.; Wei, X.; Liu, W.; Zhang, L.; Wang, L. Mapping 10-m Resolution Rural Settlements Using Multi-Source Remote Sensing Datasets with the Google Earth Engine Platform. Remote Sens. 2020, 12, 2832. [Google Scholar] [CrossRef]
- Li Zheng. Research on Key Technology for Acquiring Building Information of Hollow Village Based on UAV High Resolution Imagery [D]. Southwest Jiaotong University,2018.in Chinese (accessed on 16 July 2025).
- Chiu, W.-T.; Lin, C.-H.; Jhu, C.-L.; Lin, C.; Chen, Y.-C.; Huang, M.-J.; Liu, W.-M. Semantic Segmentation of Lotus Leaves in UAV Aerial Images via U-Net and DeepLab-Based Networks. In Proceedings of the 2020 International Computer Symposium (ICS), Tainan, Taiwan, December 2020; IEEE; pp. 535–540. [Google Scholar]
- Qian, Z.; Cao, Y.; Shi, Z.; Qiu, L.; Shi, C. A Semantic Segmentation Method for Remote Sensing Images Based on Deeplab V3. In Proceedings of the 2021 2nd International Conference on Big Data & Artificial Intelligence & Software Engineering (ICBASE), Zhuhai, China, September 2021; IEEE; pp. 396–400. [Google Scholar]
- Zhang, R.; Zhang, Q.; Zhang, G. LSRFormer: Efficient Transformer Supply Convolutional Neural Networks With Global Information for Aerial Image Segmentation. IEEE Trans. Geosci. Remote Sens. 2024, 62, 1–13. [Google Scholar] [CrossRef]
- He, X.; Zhou, Y.; Zhao, J.; Zhang, D.; Yao, R.; Xue, Y. Swin Transformer Embedding UNet for Remote Sensing Image Semantic Segmentation. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–15. [Google Scholar] [CrossRef]
- Wu, H.; Huang, P.; Zhang, M.; Tang, W.; Yu, X. CMTFNet: CNN and Multiscale Transformer Fusion Network for Remote-Sensing Image Semantic Segmentation. IEEE Trans. Geosci. Remote Sens. 2023, 61, 1–12. [Google Scholar] [CrossRef]
- Wang, T.; Chen, J.; Liu, L.; Guo, L. A Review: How Deep Learning Technology Impacts the Evaluation of Traditional Village Landscapes. Buildings 2023, 13, 525. [Google Scholar] [CrossRef]
- Zhao, H.; Li, X.; Gu, Y.; Deng, W.; Huang, Y.; Zhou, S. Integrating Time-Series Nighttime Light Data With Static Remote Sensing and Village View Images for Hollow Villages Identification. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2025, 18, 9151–9165. [Google Scholar] [CrossRef]
- Automatic Classification of Rural Building Characteristics Using Deep Learning Methods on Oblique Photography | Building Simulation. Available online: https://link.springer.com/article/10.1007/s12273-021-0872-x (accessed on 28 June 2025).
- Wang, M.; Xu, W.; Cao, G.; Liu, T. Identification of Rural Courtyards’ Utilization Status Using Deep Learning and Machine Learning Methods on Unmanned Aerial Vehicle Images in North China. Build. Simul. 2024, 17, 799–818. [Google Scholar] [CrossRef]
- Huang, X.; Liu, H.; Zhang, L. Spatiotemporal Detection and Analysis of Urban Villages in Mega City Regions of China Using High-Resolution Remotely Sensed Imagery. IEEE Trans. Geosci. Remote Sens. 2015, 53, 3639–3657. [Google Scholar] [CrossRef]
- Zhan, Z.; Zhang, X.; Liu, Y.; Sun, X.; Pang, C.; Zhao, C. Vegetation Land Use/Land Cover Extraction From High-Resolution Satellite Images Based on Adaptive Context Inference. IEEE Access 2020, 8, 21036–21051. [Google Scholar] [CrossRef]
- A Novel Framework Based on Mask R-CNN and Histogram Thresholding for Scalable Segmentation of New and Old Rural Buildings. Available online: https://www.mdpi.com/2072-4292/13/6/1070 (accessed on 16 July 2025).
- Land Use Transitions and Their Dynamic Mechanism: The Case of the Huang-Huai-Hai Plain | Journal of Geographical Sciences. Available online: https://link.springer.com/article/10.1007/s11442-016-1283-2 (accessed on 16 July 2025).
- Ma, X.; Zhang, X.; Pun, M.-O. RS3 Mamba: Visual State Space Model for Remote Sensing Image Semantic Segmentation. IEEE Geosci. Remote Sens. Lett. 2024, 21, 1–5. [Google Scholar] [CrossRef]
- Chen, K.; Chen, B.; Liu, C.; Li, W.; Zou, Z.; Shi, Z. RSMamba: Remote Sensing Image Classification with State Space Model 2024.
- Shawky, O.A.; Hagag, A.; El-Dahshan, E.-S.A.; Ismail, M.A. Remote Sensing Image Scene Classification Using CNN-MLP with Data Augmentation. Optik 2020, 221, 165356. [Google Scholar] [CrossRef]
- Gated Recurrent Multiattention Network for VHR Remote Sensing Image Classification. Available online: https://ieeexplore.ieee.org/abstract/document/9495118 (accessed on 3 July 2025).
- Comparison between Adam, AdaMax and Adam W Optimizers to Implement a Weather Forecast Based on Neural Networks for the Andean City of Quito. Available online: https://ieeexplore.ieee.org/abstract/document/9590681 (accessed on 3 July 2025).





| Method | Building | Background | mF1 | mIoU |
|---|---|---|---|---|
| U-Net | 0.6316/0.4615 | 0.9046/0.8259 | 0.7680 | 0.6437 |
| ResNet | 0.7020/0.5558 | 0.8905/0.8888 | 0.7962 | 0.7223 |
| Transformer | 0.6605/0.5102 | 0.8869/0.8478 | 0.7737 | 0.6784 |
| RS3Mamba+(Ours) | 0.8131/0.7626 | 0.9772/0.9055 | 0.8815 | 0.7964 |
| Model Name | Building IoU | Background IoU | mF1 | mIoU | Kappa |
|---|---|---|---|---|---|
| Backbone | 0.626 | 0.7646 | 0.7124 | 0.6953 | 0.6851 |
| Backbone+ Mamba | 0.7251 | 0.8272 | 0.8692 | 0.7762 | 0.7739 |
| Backbone+ Gated -Attention |
0.6797 | 0.7912 | 0.8125 | 0.7355 | 0.7251 |
| RS3Mamba+(Ours) | 0.7626 | 0.9055 | 0.8815 | 0.7964 | 0.7889 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).