Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

A Multi-Modality Fusion and Gated Multi-Filter U-net for Water Area Segmentation of Remote Sensing

Version 1 : Received: 14 December 2023 / Approved: 15 December 2023 / Online: 15 December 2023 (11:40:07 CET)

A peer-reviewed article of this Preprint also exists.

Wang, R.; Zhang, C.; Chen, C.; Hao, H.; Li, W.; Jiao, L. A Multi-Modality Fusion and Gated Multi-Filter U-Net for Water Area Segmentation in Remote Sensing. Remote Sens. 2024, 16, 419. Wang, R.; Zhang, C.; Chen, C.; Hao, H.; Li, W.; Jiao, L. A Multi-Modality Fusion and Gated Multi-Filter U-Net for Water Area Segmentation in Remote Sensing. Remote Sens. 2024, 16, 419.

Abstract

Water area segmentation in remote sensing is of great importance for flood monitoring. To overcome some challenges in this task, we construct the Water Index and Polarization Information (WIPI) multi-modality dataset and propose a multi-Modality Fusion and Gated multi-Filter U-Net (MFGF-UNet) convolutional neural network. The WIPI dataset enhances the water information and reduces the data dimension. In particular, the Cloud-Free label provided in the database effectively mitigates the issue of scarcity of labeled samples Since a single form or uniform kernel size cannot handle the variety in size and shape of water bodies, we proposed the gated multi-filter inception (GMF-Inception) module in our MFGF-UNet. Moreover, we utilize the attention mechanism by introducing the Gated Channel Transform (GCT) skip connection and integrating GCT into GMF- Inception to further improve model performance. Extensive experiments on three benchmarks, including WIPI, Chengdu and GF2020 datasets demonstrate that our method achieves favorable performance with lower complexity and better robustness against six competing approaches. For example, on WIPI, Chengdu and GF2020 datasets, the proposed MFGF-UNet model achieves F1 of 0.9191, 0.7410, and 0.8421, respectively, which average F1 on three datasets is 0.0045 higher than the U-Net model and reduces 62% average GFLOPs. New WIPI dataset, code and trained models have been released on GitHub

Keywords

water area segmentation; multi-modality fusion; multi-filter inception; attention mechanism; remote sensing

Subject

Computer Science and Mathematics, Computer Vision and Graphics

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.