Submitted:
08 July 2024
Posted:
09 July 2024
You are already at the latest version
Abstract
Keywords:
Introduction
2. Materials and Methods
Datasets
Gaofen Image Dataset (GID)
Water Land Dataset (WLD)
Water-Land Boundary Attention Network (WLBANet)
Loss Function
Implementation Details
3. Results
Results on GID
Results on WLD
4. Discussion
5. Conclusions
References
- Long, J.; Shelhamer, E. Long, J.; Shelhamer, E., and Darrell, T., 2015. Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431-3440.
- 2. Badrinarayanan, V., Kendall, A., & Cipolla, R., 2017. Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE transactions on pattern analysis and machine intelligence, 39(12), 2481-2495.
- Basit, A., Siddique, M. A., Bashir, S., Naseer, E., & Sarfraz, M. S. 2024. Deep Learning-Based Detection of Oil Spills in Pakistan’s Exclusive Economic Zone from January 2017 to December 2023. Remote Sensing, 16(13), 2432.
- Ronneberger, O., Fischer, P., & Brox, T., 2015. U-net: Convolutional networks for biomedical image segmentation. Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, pp. 234-241. Springer International Publishing.
- 5. Zhao, H., Shi, J., Qi, X., Wang, X., & Jia, J., 2017. Pyramid scene parsing network. Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2881-2890.
- Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., & Hu, Q., 2020. ECA-Net: Efficient channel attention for deep convolutional neural networks. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 11534-11542.
- Kaiser, P., Wegner, J. D., Lucchi, A., Jaggi, M., Hofmann, T., & Schindler, K., 2017. Learning aerial image segmentation from online maps. IEEE Transactions on Geoscience and Remote Sensing, 55(11), 6054-6068.
- Tong, X. Y., Xia, G. S., Lu, Q., Shen, H., Li, S., You, S., & Zhang, L., 2020. Land-cover classification with high-resolution remote sensing images using transferable deep models. Remote Sensing of Environment, 237, 111322.
- He, K., Zhang, X., Ren, S., & Sun, J., 2016. Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770-778.
- Hoffman, J. P., Rahmes, T. F., Wimmers, A. J., & Feltz, W. F.,2023. The application of a convolutional neural network for the detection of contrails in satellite imagery. Remote Sensing, 15(11), 2854.
- Sobel, Irwin.,2014. An Isotropic 3x3 Image Gradient Operator. Presentation at Stanford A.I. Project 1968.
- Kingma, D. P., & Ba, J., 2014. Adam: A method for stochastic optimization. arXiv preprint, arXiv:1412.6980.
- Dice, L. R., 1945. Measures of the amount of ecologic association between species. Ecology, 26(3), 297-302.
- Li, X., Sun, X., Meng, Y., Liang, J., Wu, F., & Li, J., 2019. Dice loss for data-imbalanced NLP tasks. arXiv preprint arXiv:1911.02855.
- Marmanis, D., Schindler, K., Wegner, J. D., Galliani, S., Datcu, M., & Stilla, U.,2018. Classification with an edge: Improving semantic image segmentation with boundary detection. ISPRS Journal of Photogrammetry and Remote Sensing, 135, 158-172.
- Tasar, O., Tarabalka, Y., & Alliez, P.,2019. Incremental learning for semantic segmentation of large-scale remote sensing data. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 12(9), 3524-3537.
- Krizhevsky, A., Sutskever, I., & Hinton, G. E., 2017. Imagenet classification with deep convolutional neural networks. Communications of the ACM, 60(6), 84-90.







| Categories | Amount of pixels |
|---|---|
| Built-up(red) | 105471990 |
| Farmland(green) | 122294497 |
| Forest(bluegreen) | 18111326 |
| Meadow(yellow) | 190830 |
| Water(blue) | 302786845 |
| Categories | Amount of pixels |
|---|---|
| Land(red) Water(green)Bridge(yellow) Harbor(blue) Others(purple) |
87355878 119803284 9508132 105842 52045 |
| Environment | Configuration |
|---|---|
| GPU | GeForce RTX 2080 8GB |
| Memory | 32GB |
| Deep Learning Framework | TensorFlow/Keras |
| Programming Languages | python 3.9 |
| GPU Processing Framework | CUDA 11.7 |
| Parameter | Value |
|---|---|
| Batchsize | 4 |
| Optimizer | Adam |
| Initial Learning Rate | 0.0001 |
| Reduce Learning Rate | 0.3 * 3 |
| OA | mIoU |
|---|---|
| 78.64 | 75.32 |
| OA | mIoU |
|---|---|
| 90.65 | 80.37 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).