Submitted:
03 June 2024
Posted:
04 June 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Materials
2.2. Methods
2.2.1.
2.2.2. Water Extraction Network Based on Swin Transformer
2.2.3. Deformable Convolution
3. Results
3.1. Experimental Environment and Parameter Settings
3.2. Evaluation Metrics
3.3. Method Comparison
3.4. Analysis of Experimental Results
4. Discussion
5. Conclusions
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
Appendix A: Cloud Area

Appendix B: Mountainous Area

References
- Parajuli J, Fernandez-Beltran R, Kang J, et al. Attentional Dense Convolutional Neural Network for Water Body Extraction From Sentinel-2 Images. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2022, 15, 6804-6816. [CrossRef]
- Luo X, Hu Z, Liu L. Investigating the seasonal dynamics of surface water over the Qinghai–Tibet Plateau using Sentinel-1 imagery and a novel gated multiscale ConvNet. International Journal of Digital Earth, 2023, 16(1), 1372-1394. [CrossRef]
- Li H, Zech J, Ludwig C, et al. Automatic mapping of national surface water with OpenStreetMap and Sentinel-2 MSI data using deep learning. International Journal of Applied Earth Observation and Geoinformation, 2021, 104, 102571. [CrossRef]
- Jiang C, Zhang H, Wang C, et al. Water Surface Mapping from Sentinel-1 Imagery Based on Attention-UNet3+: A Case Study of Poyang Lake Region. Remote Sensing, 2022, 14(19),4708. [CrossRef]
- Zhao B, Sui H, Liu J. Siam-DWENet: Flood inundation detection for SAR imagery using a cross-task transfer siamese network. International Journal of Applied Earth Observation and Geoinformation, 2023, 116, 103132. [CrossRef]
- Tong X, Luo X, Liu S. An approach for flood monitoring by the combined use of Landsat 8 optical imagery and COSMO-SkyMed radar imagery. ISPRS Journal of Photogrammetry and Remote Sensing, 2018, 136, 144-153. [CrossRef]
- McFeeters S K. The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features. International Journal of Remote Sensing, 1996, 17, 1425-1432. [CrossRef]
- Xu H. Modification of normalised difference water index (NDWI) to enhance open water features in remotely sensed imagery. International Journal of Remote Sensing, 2006, 27, 3025-3033. [CrossRef]
- Wang X, Xie S, Zhang X, et al. A robust Multi-Band Water Index (MBWI) for automated extraction of surface water from Landsat 8 OLI imagery. International Journal of Applied Earth Observation and Geoinformation, 2018, 68, 73-91. [CrossRef]
- Li L, Su H, Du Q, et al. A novel surface water index using local background information for long term and large-scale Landsat images. ISPRS Journal of Photogrammetry and Remote Sensing, 2021, 172, 59-78. [CrossRef]
- Cai Y, Shi Q, Liu X. Spatiotemporal Mapping of Surface Water Using Landsat Images and Spectral Mixture Analysis on Google Earth Engine. Journal of Remote Sensing, 2024, 4 ,117. [CrossRef]
- Sun Q, Li J. A method for extracting small water bodies based on DEM and remote sensing images. Scientific reports, 2024, 14, 760. [CrossRef]
- Yan X, Song J, Liu Y, et al. A Transformer-based method to reduce cloud shadow interference in automatic lake water surface extraction from Sentinel-2 imagery. Journal of Hydrology, 2023, 620, 129561. [CrossRef]
- Isikdogan F, Bovik A C, Passalacqua P. Surface Water Mapping by Deep Learning. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2017,10(11), 4909-4918. [CrossRef]
- Chen F, Chen X, de Voorde T, et al. Open water detection in urban environments using high spatial resolution remote sensing imagery. Remote Sensing of Environment, 2020, 242,111706. [CrossRef]
- Kang J, Guan H, Peng D, et al. Multi-scale context extractor network for water-body extraction from high-resolution optical remotely sensed images. International Journal of Applied Earth Observation and Geoinformation, 2021, 103, 102499. [CrossRef]
- Luo X, Tong X, Hu Z. An applicable and automatic method for earth surface water mapping based on multispectral images. International Journal of Applied Earth Observation and Geoinformation,2021, 103, 102472. [CrossRef]
- Li Z, Zhang X, Xiao P. Spectral index-driven FCN model training for water extraction from multispectral imagery. ISPRS Journal of Photogrammetry and Remote Sensing, 2022, 192, 344-360. [CrossRef]
- Zhang X, Li J, Hua Z. MRSE-Net: Multiscale Residuals and SE-Attention Network for Water Body Segmentation From Satellite Images. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing. 2022, 15, 5049-5064. [CrossRef]
- Yu Y, Huang L, Lu W, et al. WaterHRNet: A multibranch hierarchical attentive network for water body extraction with remote sensing images. International Journal of Applied Earth Observation and Geoinformation, 2022, 115, 103103. [CrossRef]
- Lyu X, Fang Y, Tong B, et al. Multiscale Normalization Attention Network for Water Body Extraction from Remote Sensing Imagery. Remote Sensing, 2020, 14, 4983. [CrossRef]
- Kang J, Guan H, Ma L, et al. WaterFormer: A coupled transformer and CNN network for waterbody detection in optical remotely-sensed imagery. ISPRS Journal of Photogrammetry and Remote Sensing, 2023, 206, 222-241. [CrossRef]
- Liu Z, Lin Y, Cao Y, et al. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. IEEE International Conference on Computer Vision, 2021, 9992-10002. [CrossRef]
- Dai J, Qi H, Xiong Y, et al. Deformable Convolutional Networks. IEEE International Conference on Computer Vision, 2017, 764-773.
- Seale C, Redfern T, Chatfield P, et al. Coastline detection in satellite imagery: A deep learning approach on new benchmark data. Remote Sensing of Environment, 2022, 278, 113044. [CrossRef]
- Ronneberger O, Fischer P, Brox T. U-Net: Convolutional Networks for Biomedical Image Segmentation. International Conference on Medical Image Computing and Computer-Assisted Intervention, 2015.
- Chen L, Zhu Y, Papandreou G, et al. Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation. European Conference on Computer Vision, 2018.
- Zhao H, Shi J, Qi X, et al. Pyramid Scene Parsing Network. IEEE Conference on Computer Vision and Pattern Recognition, 2017, 6230-6239.
- Pan H, Chen H, Hong Z, et al. A Novel Boundary Enhancement Network for Surface Water Mapping Based on Sentinel-2 MSI Data. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2023, 16, 9207-9222. [CrossRef]
- He K, Zhang X, Ren S, et al. Deep Residual Learning for Image Recognition. IEEE Conference on Computer Vision and Pattern Recognition, 2015, 770-778.













| Region | Characteristics | Challenges |
|---|---|---|
| Mountainous Area | Large undulating terrain, complex landforms, scattered water bodies, and high-altitude areas that may be covered by ice and snow. | Water bodies are often obstructed by mountains, resulting in incomplete extraction information; their scattered distribution leads to a small extraction scale range; there is also potential interference from ice and snow. |
| Cloudy Area | Under conditions of frequent clouds, rain, and fog, cloud cover areas are large and last for long durations. Clouds exhibit spectral characteristics similar to water bodies in some bands. | Cloud cover affects the transmission and reflection characteristics of remote sensing images, increasing the difficulty of water body extraction. Due to their spectral similarities, confusion between clouds and water bodies is prone to occur. |
| Method | Accuracy(%) | Precision(%) | Recall(%) | f1_score(%) |
|---|---|---|---|---|
| Ours | 97.89 | 94.98 | 90.05 | 92.33 |
| Unet | 90.79 | 95.24 | 72.17 | 77.03 |
| Resnet | 97.65 | 97.26 | 88.96 | 92.68 |
| Deeplabv3_plus | 97.27 | 93.80 | 86.53 | 89.22 |
| Deepwatermapv2 | 97.41 | 99.07 | 81.89 | 88.69 |
| Method | Ours | Unet | Resnet | Deeplabv3+ | Deepwatermapv2 |
|---|---|---|---|---|---|
| Mountainous Area | |||||
| Accuracy | 98.03% | 86.77% | 97.14% | 97.88% | 97.47% |
| Precision | 95.99% | 89.24% | 97.97% | 95.42% | 99.34% |
| Recall | 91.61% | 49.18% | 87.91% | 89.18% | 85.65% |
| f1_score | 93.52% | 55.56% | 92.32% | 91.08% | 91.46% |
| Cloud Area | |||||
| Accuracy | 98.30% | 90.93% | 98.49% | 97.64% | 97.14% |
| Precision | 94.97% | 93.66% | 97.12% | 93.81% | 98.61% |
| Recall | 92.22% | 63.05% | 92.12% | 89.27% | 84.85% |
| f1_score | 93.46% | 67.29% | 94.46% | 91.33% | 88.19% |
| Model | Swin Transform | Deformable Conv | Accuracy(%) | Precision(%) | Recall(%) | f1_score(%) |
|---|---|---|---|---|---|---|
| 1 | ✓ | ✓ | 97.89 | 94.98 | 90.05 | 92.33 |
| 2 | ✓ | 97.67 | 94.96 | 89.04 | 91.73 | |
| 3 | ✓ | 96.70 | 88.17 | 89.77 | 88.40 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).