Submitted:
28 February 2025
Posted:
03 March 2025
You are already at the latest version
Abstract
It is important in agriculture the role of remote sensing applied to the classification of LandCover. Recently deep convolutional neural networks (CNN) have become increasingly and widely popular for their application to the study of monitoring and mapping of the land. In this work, we study existing semantic networks when applying to public datasets such as LandCover.ai. A comparison of fifteen neural networks is made and we find out that, in spite of they all have good performances, there are differences in the state of the outliers so we carry on a sistematical study of them. Our outcomes show that the most promising models achieve an accuracy of 99.11%, with a 71.5% of intersection over union (IoU) and 89.29% of recall, based on test set. We also conduct a study of the outliers dividing the misclassifications for tipology and find out that ANN, BiSeNetV2 and SETR-Naïve are the most effective models for handling the outliers. The dataset on which this research was carried out is publicly available at https://landcover.ai.linuxpolska.com/.

Keywords:
1. Introduction
2. Materials and Methods
2.1. Data Preparation
2.2. Semantic Segmentation Neural Networks
2.2.1. Asymmetrical Non-Local Neural Network for Semantic Segmentation (ANN) [24]
2.2.2. APCNet [25]
2.2.3. BiSeNetV2 [26]
2.2.4. CCNet [27]
2.2.5. DANet [28]
2.2.6. DeepLabV3+ [29]
2.2.7. FastFCN [30]
2.2.8. Fast-SCNN [31]
2.2.9. FCN [32]
2.2.10. GCNet [33]
2.2.11. ICNet [34]
2.2.12. ISANet [35]
2.2.13. OCRNet [36]
2.2.14. PSPNet [37]
2.2.15. UperNet [38]
2.3. Training
2.4. Study of the Outliers

3. Experiments
3.1. Experimental Setup
3.2. Results of Semantic Segmentation
3.3. Results of the Study of the Outliers
| Accuracy | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Neural network | Background | Buildings | Woodland | Water | Roads | Mean accuracy | ||||
| ANN | 0.9770 | 0.9990(1) | 0.9825 | 0.9974 | 0.9957 | 0.9903 | ||||
| APCNet | 0.9775(2) | 0.9990(1) | 0.9830(2) | 0.9978(1) | 0.9956 | 0.9906(2) | ||||
| BiSeNetV2 | 0.9732 | 0.9984 | 0.9800 | 0.9969 | 0.9952 | 0.9887 | ||||
| CCNet | 0.9770 | 0.9990(1) | 0.9825 | 0.9976(2) | 0.9956 | 0.9904 | ||||
| DANet | 0.9752 | 0.9989(2) | 0.9821 | 0.9963 | 0.9956 | 0.9896 | ||||
| DeepLabV3 | 0.9742 | 0.9987 | 0.9811 | 0.9974 | 0.9949 | 0.9896 | ||||
| FastFCN | 0.9739 | 0.9987 | 0.9814 | 0.9960 | 0.9953 | 0.9891 | ||||
| Fast-SCNN | 0.9686 | 0.9981 | 0.9770 | 0.9958 | 0.9942 | 0.9867 | ||||
| FCN | 0.9779(1) | 0.9990(1) | 0.9835(1) | 0.9977 | 0.9960(1) | 0.9908(1) | ||||
| GCNet | 0.9764 | 0.9990(1) | 0.9824 | 0.9981 | 0.9951 | 0.9902 | ||||
| ICNet | 0.9769 | 0.9989(2) | 0.9828 | 0.9973 | 0.9958 | 0.9904 | ||||
| ISANet | 0.9759 | 0.9990(1) | 0.9817 | 0.9972 | 0.9957 | 0.9899 | ||||
| OCRNet | 0.9745 | 0.9990(1) | 0.9815 | 0.9960 | 0.9955 | 0.9893 | ||||
| PSPNet | 0.9765 | 0.9989(2) | 0.9820 | 0.9978(1) | 0.9956 | 0.9902 | ||||
| UPerNet | 0.9758 | 0.9989(2) | 0.9821 | 0.9965 | 0.9959(2) | 0.9899 | ||||
| Intersection over union | |||||||
|---|---|---|---|---|---|---|---|
| Neural network | Background | Buildings | Woodland | Water | Roads | Mean IoU | |
| ANN | 0.8391(1) | 0.6702 | 0.6341 | 0.3382 | 0.5254 | 0.6689 | |
| APCNet | 0.8438 | 0.6746 | 0.6276 | 0.3741 | 0.5139 | 0.6711 | |
| BiSeNetV2 | 0.8267 | 0.5311 | 0.5804 | 0.2932 | 0.4566 | 0.6172 | |
| CCNet | 0.8389(2) | 0.6636 | 0.6474(2) | 0.4118 | 0.5196 | 0.7150(1) | |
| DANet | 0.8374 | 0.6502 | 0.6289 | 0.3385 | 0.4699 | 0.6496 | |
| DeepLabV3 | 0.8338 | 0.5642 | 0.5805 | 0.3136 | 0.4017 | 0.6132 | |
| FastFCN | 0.8325 | 0.6194 | 0.6210 | 0.3843 | 0.4822 | 0.6567 | |
| FCN | 0.8402 | 0.7086(2) | 0.6526(1) | 0.4467(2) | 0.5456(1) | 0.6967(2) | |
| Fast-SCNN | 0.8224 | 0.5165 | 0.5815 | 0.2861 | 0.4338 | 0.6082 | |
| GCNet | 0.8367 | 0.6594 | 0.6232 | 0.4931(1) | 0.5060 | 0.6782 | |
| ICNet | 0.8417 | 0.7085(1) | 0.6305 | 0.3429 | 0.4967 | 0.6654 | |
| ISANet | 0.8370 | 0.6660 | 0.6241 | 0.3380 | 0.5145 | 0.6550 | |
| OCRNet | 0.8355 | 0.6259 | 0.6312 | 0.3307 | 0.5034 | 0.6551 | |
| PSPNet | 0.8391(1) | 0.6168 | 0.6214 | 0.4278 | 0.5116 | 0.6673 | |
| UPerNet | 0.8321 | 0.6604 | 0.6265 | 0.3428 | 0.5299(2) | 0.6608 | |
| Recall | ||||||
|---|---|---|---|---|---|---|
| Neural network | Background | Buildings | Woodland | Water | Roads | Mean IoU |
| ANN | 0.9375 | 0.8414 | 0.8482 | 0.8278(1) | 0.7802 | 0.8815 |
| APCNet | 0.9389 | 0.8405 | 0.8506 | 0.8105 | 0.7798 | 0.8823 |
| BiSeNetV2 | 0.9333 | 0.8148 | 0.8400 | 0.8025 | 0.7472 | 0.8704 |
| CCNet | 0.9338 | 0.8483(2) | 0.8575 | 0.8141 | 0.7807 | 0.8828 |
| DANet | 0.9403 | 0.8276 | 0.8502 | 0.7998 | 0.7594 | 0.8776 |
| DeepLabV3 | 0.9443(2) | 0.7832 | 0.8001 | 0.8187 | 0.6769 | 0.8540 |
| FastFCN | 0.9356 | 0.8245 | 0.8297 | 0.7573 | 0.7447 | 0.8656 |
| Fast-SCNN | 0.9264 | 0.7986 | 0.8334 | 0.7759 | 0.7359 | 0.8628 |
| FCN | 0.9501(1) | 0.8202 | 0.8336 | 0.7902 | 0.7615 | 0.8772 |
| GCNet | 0.9382 | 0.8398 | 0.8523 | 0.7850 | 0.7771 | 0.8807 |
| ICNet | 0.9417 | 0.8452 | 0.8544 | 0.8100 | 0.7788 | 0.8843(1) |
| ISANet | 0.9356 | 0.8378 | 0.8570 | 0.8198(2) | 0.7872(1) | 0.8836 |
| OCRNet | 0.9340 | 0.8403 | 0.8628(1) | 0.8195 | 0.7847(2) | 0.8834 |
| PSPNet | 0.9364 | 0.8498(1) | 0.8609(2) | 0.8148 | 0.7808 | 0.8842(2) |
| UPerNet | 0.9371 | 0.8475 | 0.8517 | 0.8129 | 0.7684 | 0.8806 |

| ANN | APCNet | BiSeNetV2 | CCnet | DANet | DeepLabV3+ | FastFCN | Fast_SCNN | FCN | GCNet | ICNet | ISANet | OCRNet | PSPNet | UPerNet | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Mistakes of the network | ||||||||||||||||
| background | 9 | 8 | 11 | 7 | 3 | 10 | 11 | 11 | 11 | 15 | 9 | 12 | 15 | 5 | 6 | |
| buildings | 1 | 3 | 1 | 0 | 1 | 0 | 2 | 2 | 0 | 2 | 0 | 0 | 4 | 0 | 1 | |
| woodland | 19 | 16 | 12 | 8 | 7 | 11 | 14 | 14 | 7 | 17 | 7 | 9 | 13 | 4 | 7 | |
| water | 9 | 26 | 9 | 3 | 8 | 0 | 2 | 2 | 1 | 13 | 4 | 3 | 33 | 0 | 7 | |
| road | 8 | 23 | 16 | 13 | 10 | 20 | 19 | 19 | 7 | 25 | 7 | 9 | 22 | 12 | 7 | |
| Total mistakes of network | 46 | 76 | 49 | 31 | 29 | 41 | 48 | 48 | 26 | 72 | 27 | 33 | 87 | 21 | 28 | |
| Mistakes of the ground truth | ||||||||||||||||
| background | 17 | 7 | 3 | 13 | 7 | 11 | 14 | 14 | 9 | 15 | 9 | 14 | 17 | 16 | 11 | |
| buildings | 0 | 4 | 0 | 0 | 1 | 0 | 2 | 2 | 0 | 3 | 0 | 0 | 3 | 0 | 1 | |
| woodland | 10 | 53 | 28 | 22 | 17 | 23 | 27 | 27 | 17 | 55 | 16 | 20 | 60 | 15 | 16 | |
| water | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | |
| road | 14 | 22 | 16 | 11 | 13 | 10 | 13 | 13 | 10 | 27 | 13 | 12 | 23 | 11 | 8 | |
| Total mistakes of the groundtruth | 41 | 88 | 47 | 46 | 38 | 44 | 56 | 56 | 36 | 101 | 38 | 46 | 103 | 42 | 36 | |
| Ambiguous mistakes | ||||||||||||||||
| background | 29 | 35 | 44 | 30 | 47 | 41 | 53 | 53 | 33 | 24 | 33 | 35 | 32 | 16 | 39 | |
| buildings | 0 | 5 | 2 | 0 | 0 | 2 | 4 | 4 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | |
| woodland | 43 | 4 | 65 | 45 | 59 | 58 | 68 | 68 | 36 | 6 | 52 | 43 | 3 | 66 | 66 | |
| water | 2 | 1 | 7 | 4 | 7 | 6 | 3 | 3 | 3 | 3 | 6 | 5 | 2 | 0 | 4 | |
| road | 13 | 15 | 19 | 19 | 20 | 20 | 32 | 32 | 10 | 17 | 21 | 18 | 15 | 20 | 9 | |
| Total ambiguous mistakes | 87 | 60 | 137 | 98 | 133 | 127 | 160 | 160 | 83 | 50 | 112 | 101 | 53 | 102 | 118 | |
| Total | ||||||||||||||||
| background | 55 | 50 | 58 | 50 | 57 | 62 | 78 | 78 | 53 | 54 | 51 | 61 | 64 | 37 | 56 | |
| buildings | 1 | 12 | 3 | 0 | 2 | 2 | 8 | 8 | 1 | 5 | 0 | 0 | 8 | 0 | 2 | |
| woodland | 72 | 73 | 105 | 75 | 83 | 92 | 109 | 109 | 60 | 78 | 75 | 72 | 76 | 85 | 89 | |
| water | 11 | 29 | 16 | 7 | 15 | 6 | 5 | 5 | 4 | 17 | 10 | 8 | 35 | 0 | 11 | |
| road | 35 | 60 | 51 | 43 | 43 | 50 | 64 | 64 | 27 | 69 | 41 | 39 | 60 | 43 | 24 | |
| Total | 174 | 224 | 233 | 175 | 200 | 212 | 264 | 264 | 145 | 223 | 177 | 180 | 243 | 165 | 182 | |
3. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| MDPI | Multidisciplinary Digital Publishing Institute |
| DOAJ | Directory of open access journals |
| TLA | Three letter acronym |
| LD | Linear dichroism |
References
- Rong, C.; Fu, W. A Comprehensive Review of Land Use and Land Cover Change Based on Knowledge Graph and Bibliometric Analyses. Land 2023, 12, 1573Talukdar, S.; Singha, P.; Mahato, S.; Pal, S.; Liou, Y.A.; Rahman, A. Land-use land-cover classification by machine learning classifiers for satellite observations—A review. Remote Sens. 2023, 12, 1135.
- Luo, Z.W.; Yuan, Y.; Gou, R.; Li, X. Semantic segmentation of agricultural images. A survey. Information Processing in Agriculture 2023. [Google Scholar] [CrossRef]
- Jadhav, J.K.; Singh, R. Automatic semantic segmentation and classification of remote sensing data for agriculture. Mathematical Models in Engineering 2018, 4, 112–137. [Google Scholar] [CrossRef]
- Kussul, N.; Lavreniuk, M.; Skakun, S.; Shelestov, A. Deep learning classification of land cover and crop types using remote sensing data. IEEE Geoscience and Remote Sensing Letters 2017, 14, 778–782. [Google Scholar] [CrossRef]
- Dhanya, V.; Subeesh, A.; Kushwaha, N.; Vishwakarma, D.K.; Kumar, T.N.; Ritika, G.; Singh, A. Deep learning based computer vision approaches for smart agricultural applications. Artificial Intelligence in Agriculture 2022, 6, 211–229. [Google Scholar] [CrossRef]
- Attri, I.; Awasthi, L.K.; Sharma, T.P.; Rathee, P. A review of deep learning techniques used in agriculture. Ecological Informatics 2023, p. 102217.
- Boguszewski, A.; Batorski, D.; Ziemba-Jankowska, N.; Dziedzic, T.; Zambrzycka, A. LandCover. ai: Dataset for automatic mapping of buildings, woodlands, water and roads from aerial imagery. Proceedings of theence on Computer Vision and Pattern Recognition, 2021.
- Maggiori, E.; Tarabalka, Y.; Charpiat, G.; Alliez, P. Convolutional neural networks for large-scalere mote-sensing image classification. IEEE Transactions on geoscience and remote sensing 2016, 55, 645–657 1102. [Google Scholar] [CrossRef]
- Pauleit, S.; Duhme, F. Assessing the environmental performance of land cover types for urban planning. Landscape and urban planning 2000, 52, 1–20. [Google Scholar] [CrossRef]
- Zhou, W.; Huang, G.; Cadenasso, M.L. Does spatial configuration matter? Understanding the effects of land cover pattern on land surface temperature in urban landscapes. Landscape and urban planning 2011, 102, 54–63. [Google Scholar]
- Weir, N.; Lindenbaum, D.; Bastidas, A.; Etten, A.V.; McPherson, S.; Shermeyer, J.; Kumar, V.; Tang, H. Spacenet mvoi: A multi-view overhead imagery dataset. Proceedings of the ieee/cvf international conference on computer vision, 2019, pp.992-1001.
- Chiu, M.T.; Xu, X.; Wei, Y.; Huang, Z.; Schwing, A.G.; Brunner, R.; Khachatrian, H.; Karapetyan, H.; Dozier, I.; Rose, G. ; others. Agriculture-vision: A large aerial image database for agricultural pattern analysis. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 2828.2838.
- Zhao, X.; Yuan, Y.; Song, M.; Ding, Y.; Lin, F.; Liang, D.; Zhang, D. Use of unmanned aerial vehicle imagery and deep learning unet to extract rice lodging. Sensors 2019, 19, 3859. [Google Scholar] [CrossRef] [PubMed]
- Anand, T.; Sinha, S.; Mandal, M.; Chamola, V.; Yu, F.R. AgriSegNet: Deep aerial semantic segmentation framework for IoT-assisted precision agriculture. IEEE Sensors Journal 2021, 21, 17581–17590. [Google Scholar] [CrossRef]
- Liu, G.; Bai, L.; Zhao, M.; Zang, H.; Zheng, G. Segmentation of wheat farmland with improved U-Net on drone images. Journal of applied remote sensing 2022, 16, 034511–034511. [Google Scholar] [CrossRef]
- Mortensen, A.K.; Dyrmann, M.; Karstoft, H.; Jørgensen, R.N.; Gislum, R. Semantic segmentation of mixed crops using deep convolutional neural network. 2016.
- Wang, A.; Xu, Y.; Wei, X.; Cui, B. Semantic segmentation of crop and weed using an encoder-decoder network and image enhancement method under uncontrolled outdoor illumination. Ieee Access 2020, 8, 81724–81734. [Google Scholar] [CrossRef]
- Sahin, H.M.; Miftahushudur, T.; Grieve, B.; Yin, H. Segmentation of weeds and crops using multispectral imaging imaging and CFR- enhanced U-Net. Computers and Electronics in Agriculture 2023.
- Castillo-Navarro, J.; Le Saux, B.; Boulch, A.; Audebert, N.; Lefèvre, S. Semi-Supervised Semantic Segmentation in Earth Observation: The MiniFrance suite, dataset analysis and multi-task network study. Machine Learning 2022, 111, 3125–3160. [Google Scholar] [CrossRef]
- Chiu, M.T.; Xu, X.; Wei, Y.; Huang, Z.; Schwing, A.G.; Brunner, R.; Khachatrian, H.; Karapetyan, H.; Dozier, I.; Rose, G. ; others. Agriculture-vision: A large aerial image database for agricultural pattern analysis. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 2828-2838.
- Md Jelas, I. Deforestation detection using deep learning-based semantic segmentation techniques: a systematic review. Frontiers in Forests and Global Change 2024, 7, 1300060. [Google Scholar] [CrossRef]
- Xu, J., Chen, K., Lin, D.: MMSegmenation. Available online: https://github.com/openmmlab/mmsegmentation.
- Fernandes, A.A.; Koehler, M.; Konstantinou, N.; Pankin, P.; Paton, N.W.; Sakellariou, R. Data preparation: A technological perspective and review. SN Computer Science 2023, 4, 425. [Google Scholar] [CrossRef]
- Zhu, Z.; Xu, M.; Bai, S.; Huang, T.; Bai, X. Asymmetric non-local neural networks for semantic segmentation. Proceedings of the IEEE/CVF international conference on computer vision, 2019, pp. 593–602.
- He, J.; Deng, Z.; Zhou, L.; Wang, Y.; Qiao, Y. Adaptive pyramid context network for semantic segmentation. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 7519–7528.
- Yu, C.; Gao, C.; Wang, J.; Yu, G.; Shen, C.; Sang, N. Bisenet v2: Bilateral network with guided aggregation for real-time semantic segmentation. International journal of computer vision 2021, 129, 3051–3068. [Google Scholar] [CrossRef]
- Huang, Z.; Wang, X.; Huang, L.; Huang, C.; Wei, Y.; Liu, W. Ccnet: Criss-cross attention for semantic segmentation. Proceedings of the IEEE/CVF international conference on computer vision, 2019, pp. 603-612.
- Fu, J.; Liu, J.; Tian, H.; Li, Y.; Bao, Y.; Fang, Z.; Lu, H. Dual attention network for scene segmentation. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 3146–3154.
- Chen, L.C.; Zhu, Y.; Papandreou, G.; Schroff, F.; Adam, H. Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the European conference on computer vision (ECCV), 2018, pp. 801–818.
- Wu, H.; Zhang, J.; Huang, K.; Liang, K.; Yu, Y. Fastfcn: Rethinking dilated convolution in the backbone for semantic segmentation. arXiv:1903.11816 2019.
- Poudel, R.P.; Liwicki, S.; Cipolla, R. Fast-scnn: Fast semantic segmentation network. arXiv:1902.04502 2019.
- Long, J.; Shelhamer, E.; Darrell, T. Fully convolutional networks for semantic segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 3431–3440.
- Liu, J.; Zhou, W.; Cui, Y.; Yu, L.; Luo, T. GCNet: Grid-like context-aware network for RGB-thermal semantic segmentation. Neurocomputing 2022, 506, 60–67. [Google Scholar] [CrossRef]
- Zhao, H.; Qi, X.; Shen, X.; Shi, J.; Jia, J. Icnet for real-time semantic segmentation on high-resolution images. Proceedings of the European conference on computer vision (ECCV), 2018, pp. 405–420.
- L., Yuan Y., Guo J., Zang C., Chen X., Wang J. Interlaced Sparse Self-Attention for Semantic Segmentation. arXiv:1907.12273 2019.
- Yuan, Y.; Chen, X.; Wang, J. Object-contextual representations for semantic segmentation. Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part VI 16. Springer, 2020, pp. 173–190.
- Zhao, H.; Shi, J.; Qi, X.; Wang, X.; Jia, J. Pyramid scene parsing network. Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 2881–2890.
- Wang, R.; Jiang, H.; Li, Y. UPerNet with ConvNeXt for Semantic Segmentation. 2023 IEEE 3rd International Conference on Electronic Technology, Communication and Information (ICETCI). IEEE, 2023, pp. 764–769.
- Kotsiantis, S.; Kanellopoulos, D.; Pintelas, P.; others. Handling imbalanced datasets: A review. GESTS international transactions on computer science and engineering 2006, 30, 25–36.
- Werner de Vargas, V.; Schneider Aranda, J.A.; dos Santos Costa, R.; da Silva Pereira, P.R.; Victória Barbosa, J.L. Imbalanced data preprocessing techniques for machine learning: a systematic mapping study. Knowledge and Information Systems 2023, 65, 31–57. [Google Scholar] [CrossRef] [PubMed]
- Johnson, J.M.; Khoshgoftaar, T.M. Survey on deep learning with class imbalance. Journal of big data 2019, 6, 1–54. [Google Scholar] [CrossRef]








Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).