Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

An Improved Large Kernel-Based Remote Sensing Land Cover Segmentation Algorithm

Version 1 : Received: 15 May 2024 / Approved: 15 May 2024 / Online: 15 May 2024 (13:31:54 CEST)

How to cite: Liu, G.; Liu, C.; Wu, X.; Li, Y.; Zhang, X.; Xu, J. An Improved Large Kernel-Based Remote Sensing Land Cover Segmentation Algorithm. Preprints 2024, 2024051035. https://doi.org/10.20944/preprints202405.1035.v1 Liu, G.; Liu, C.; Wu, X.; Li, Y.; Zhang, X.; Xu, J. An Improved Large Kernel-Based Remote Sensing Land Cover Segmentation Algorithm. Preprints 2024, 2024051035. https://doi.org/10.20944/preprints202405.1035.v1

Abstract

Land cover segmentation, a fundamental task within the domain of remote sensing, boasts a broad spectrum of application potential. In this article, we focus on the land cover segmentation tasks and complete the following research work: Firstly, to address the issues of uneven distribution of foreground and background and significant differences in target scales in remote sensing images, we propose a decoder called MDCFD based on multi-dilation rate convolution fusion. The decoder utilizes dilated convolution to expand the receptive field, enhancing the model's ability to capture global features and thus improving the model's ability to distinguish between foreground and background. Meanwhile, we design a multi-dilation rate convolution fusion module (MDCFM), which fuses the outputs of different dilation rate convolution layers. Secondly, aiming at the problems of diverse scenes, significant differences between categories, and many background interferences in remote sensing images, we propose a hybrid attention module called LKSHAM based on large kernel convolution. This module combines spatial attention and channel attention mechanisms and combines the two attention modules in series. By introducing large kernel convolution, the model's ability to extract contextual information is improved. At the same time, we adopt a strategy of decomposing large kernel convolution into multiple depthwise convolutions to reduce computational complexity. The improved network models designed by this paper can achieve an improvement of over 1.1% in the mIoU metric on segmentation tasks on the Potsdam and Vaihingen datasets.

Keywords

Remote Sensing Images; Land Cover Segmentation; Dilated Convolution; Attention Mechanism; Large Kernel Convolution

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.