Submitted:
13 December 2025
Posted:
16 December 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- We present PlantGraphNet, a scalable hybrid GNN-CNN architecture that fuses graph-level relational reasoning with global visual feature extraction.
- We implement a distributed training pipeline using DDP to handle large dataset inputs efficiently and accelerate GraphPlantNet model training.
- We demonstrate that GraphPlantNet achieves state-of-the-art performance, outperforming several standard CNN architectures in accuracy and robustness in heathland plant species classification.
2. Materials and methods
2.1. Dataset
2.1.1. Study site and Data Acquisition


2.1.2. Preprocessing and Data Split
2.2. Methods
2.2.1. Graph Dataset Construction from Images
| Algorithm 1 Keypoint-based Graph Construction Method |
![]() |
2.2.2. Hybrid GNN-CNN Architecture
Graph Representation Learning
Image Representation Learning
Multimodal Fusion and Classification
2.2.3. Distributed Data-Parallel Training Framework
3. Model Evaluation and Results
3.1. Baseline
- AlexNet [15]: Convolutional neural networks (CNNs) consisting of sequential convolutional and fully connected layers. For the classification task, the original classifier heads are extended with additional non-linear transformations and regularisation techniques such as dropout.
- GoogLeNet [17]: A CNN based on Inception modules, which combine multiple convolutional filter sizes in parallel to capture multi-scale features within the same layer.
- YOLOv8 [46]: An object detection architecture characterized by an anchor-free design, distinct detection heads, and improved backbone efficiency. It has been adapted for classification through the substitution of detection-specific elements with a global pooling and classification head.
3.2. Settings
3.3. Evaluation Metrics.
-
Precision (Positive Predictive Value), defined for class c as:where is the number of true positives and the number of false positives for class c.
-
Recall (Sensitivity or True Positive Rate), defined as:Where is the number of false negatives for class c.
- F1-Score, the harmonic mean of precision and recall:
-
Accuracy, the overall proportion of correctly classified instances, defined as:here N is the total number of samples, the summation adds 1 for each correctly predicted label ; otherwise, it adds 0.
3.4. Results


3.5. Ablation Study
3.5.1. Influence of Graph Construction Method
- Region Adjacency Graph (RAG): Constructs nodes from superpixel segments using SLIC, with edges formed between adjacent regions and edge weights based on mean colour dissimilarity.
- Superpixel Graph: Similar to RAG, but with adjacency derived explicitly from boundary connectivity rather than a region adjacency graph, yielding a finer representation of local transitions.
- Grid Graph: A dense pixel-level graph with raw RGB features and a 4-connected topology, preserving spatial locality but lacking abstraction.
3.5.2. Influence of the Hybrid Model
3.5.3. Influence of Distributed Data-Parallel Training
4. Limitations
5. Conclusions
Funding
Institutional Review Board Statement
Data Availability Statement
Conflicts of Interest
References
- Field, C.D.; Evans, C.D.; Dise, N.B.; Hall, J.R.; Caporn, S.J. Long-term nitrogen deposition increases heathland carbon sequestration. Science of the Total Environment 2017, 592, 426–435. [CrossRef]
- Acreman, M.; Holden, J. How wetlands affect floods. Wetlands 2013, 33, 773–786. [CrossRef]
- Stanton, D.; Dyring, M.; Proud, R.; Williams, P. Wildfire Impact on Coastal Groundwater-Dependent Heathland: Structural and Floristic Responses and Management. Austral Ecology 2025, 50, e70044. [CrossRef]
- Damgaard, C. Ecosystem dynamics in wet heathlands: spatial and temporal effects of environmental drivers on the vegetation. Rangeland Ecology & Management 2025, 100, 47–55. [CrossRef]
- Agboola, P.O.; Bekun, F.V.; Agozie, D.Q.; Gyamfi, B.A. Environmental sustainability and ecological balance dilemma: accounting for the role of institutional quality. Environmental Science and Pollution Research 2022, 29, 74554–74568. [CrossRef]
- Aszalós, R.; Thom, D.; Aakala, T.; Angelstam, P.; Brūmelis, G.; Gálhidy, L.; Gratzer, G.; Hlásny, T.; Katzensteiner, K.; Kovács, B.; et al. Natural disturbance regimes as a guide for sustainable forest management in Europe. Ecological Applications 2022, 32, e2596.
- Damgaard, C. Observed vegetation changes in Danish dry heathlands since 2004. Flora 2025, 327, 152728. [CrossRef]
- Rittl, T.F.; Farsund, P.G.; Pommeresche, R.; Sørheim, K.M.; Wibe, A.; Hellekås, J.; Velle, L.G. Traditional Norwegian farming practices drive biodiversity–A case study from coastal heathlands. Agriculture, Ecosystems & Environment 2025, 388, 109662. [CrossRef]
- Damgaard, C.F.; Irvine, K.M. Using the beta distribution to analyse plant cover data. Journal of Ecology 2019, 107, 2747–2759. [CrossRef]
- Kollert, A.; Bremer, M.; Löw, M.; Rutzinger, M. Exploring the potential of land surface phenology and seasonal cloud free composites of one year of Sentinel-2 imagery for tree species mapping in a mountainous region. International Journal of Applied Earth Observation and Geoinformation 2021, 94, 102208. [CrossRef]
- Dyrmann, M.; Karstoft, H.; Midtiby, H.S. Plant species classification using deep convolutional neural network. Biosystems engineering 2016, 151, 72–80. [CrossRef]
- Chen, S.; Tan, X.; Wang, B.; Hu, X. Reverse attention for salient object detection. In Proceedings of the Proceedings of the European conference on computer vision (ECCV), 2018, pp. 234–250.
- Cai, L.; Kreft, H.; Taylor, A.; Denelle, P.; Schrader, J.; Essl, F.; van Kleunen, M.; Pergl, J.; Pyšek, P.; Stein, A.; et al. Global models and predictions of plant diversity based on advanced machine learning techniques. New Phytologist 2023, 237, 1432–1445. [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770–778.
- Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet classification with deep convolutional neural networks. Communications of the ACM 2017, 60, 84–90.
- Dong, K.; Zhou, C.; Ruan, Y.; Li, Y. MobileNetV2 model for image classification. In Proceedings of the 2020 2nd International Conference on Information Technology and Computer Application (ITCA). IEEE, 2020, pp. 476–480.
- Szegedy, C.; Liu, W.; Jia, Y.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A. Going deeper with convolutions. In Proceedings of the Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 1–9. [CrossRef]
- Barbedo, J.G.A. Plant disease identification from individual lesions and spots using deep learning. Biosystems engineering 2019, 180, 96–107. [CrossRef]
- Ge, X.; Wu, S.; Wen, W.; Shen, F.; Xiao, P.; Lu, X.; Liu, H.; Zhang, M.; Guo, X. LettuceP3D: A tool for analysing 3D phenotypes of individual lettuce plants. Biosystems Engineering 2025, 251, 73–88. [CrossRef]
- Espitalier, V.; Lombardo, J.C.; Goëau, H.; Botella, C.; Høye, T.T.; Dyrmann, M.; Bonnet, P.; Joly, A. Adapting a global plant identification model to detect invasive alien plant species in high-resolution road side images. Ecological Informatics 2025, p. 103129. [CrossRef]
- Wagner, F.H.; Sanchez, A.; Tarabalka, Y.; Lotte, R.G.; Ferreira, M.P.; Aidar, M.P.; Gloor, E.; Phillips, O.L.; Aragao, L.E. Using the U-net convolutional network to map forest types and disturbance in the Atlantic rainforest with very high resolution images. Remote Sensing in Ecology and Conservation 2019, 5, 360–375. [CrossRef]
- Wagner, F.H.; Sanchez, A.; Aidar, M.P.; Rochelle, A.L.; Tarabalka, Y.; Fonseca, M.G.; Phillips, O.L.; Gloor, E.; Aragao, L.E. Mapping Atlantic rainforest degradation and regeneration history with indicator species using convolutional network. PLoS One 2020, 15, e0229448. [CrossRef]
- Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Proceedings of the Medical image computing and computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5-9, 2015, proceedings, part III 18. Springer, 2015, pp. 234–241.
- Guo, Y.; Li, Z.; Chen, E.; Zhang, X.; Zhao, L.; Xu, E.; Hou, Y.; Sun, R. An end-to-end deep fusion model for mapping forests at tree species levels with high spatial resolution satellite imagery. Remote Sensing 2020, 12, 3324. [CrossRef]
- Guo, Y.; Li, Z.; Chen, E.; Zhang, X.; Zhao, L.; Xu, E.; Hou, Y.; Liu, L. A deep fusion unet for mapping forests at tree species levels with multi-temporal high spatial resolution satellite imagery. Remote Sensing 2021, 13, 3613. [CrossRef]
- Cho, M.A.; Malahlela, O.; Ramoelo, A. Assessing the utility WorldView-2 imagery for tree species mapping in South African subtropical humid forest and the conservation implications: Dukuduku forest patch as case study. International Journal of Applied Earth Observation and Geoinformation 2015, 38, 349–357. [CrossRef]
- Hościło, A.; Lewandowska, A. Mapping forest type and tree species on a regional scale using multi-temporal Sentinel-2 data. Remote Sensing 2019, 11, 929.
- Vermeer, M.; Völgyes, D.; Sørensen, T.K.; Miller, H.; Fantin, D. Semi-and weak-supervised learning for Norwegian tree species detection. In Proceedings of the Proceedings of the Northern Lights Deep Learning Workshop, 2023, Vol. 4, pp. 1–7. [CrossRef]
- Bueso-Bello, J.L.; Dal Molin, R.; Carcereri, D.; Posovszky, P.; González, C.; Martone, M.; Rizzoli, P. Monitoring Forest Degradation in the Amazon Basin with Tandem-X High-Resolution Images and Deep Learning Techniques. In Proceedings of the IGARSS 2023-2023 IEEE International Geoscience and Remote Sensing Symposium. IEEE, 2023, pp. 1473–1476.
- Zhang, S.; Zhao, W.; Guan, Z.; Peng, X.; Peng, J. Keypoint-graph-driven learning framework for object pose estimation. In Proceedings of the Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021, pp. 1065–1073.
- Han, K.; Wang, Y.; Guo, J.; Tang, Y.; Wu, E. Vision gnn: An image is worth graph of nodes. Advances in neural information processing systems 2022, 35, 8291–8303.
- Diao, Q.; Dai, Y.; Wang, J.; Feng, X.; Pan, F.; Zhang, C. Spatial-pooling-based graph attention U-Net for hyperspectral image classification. Remote Sensing 2024, 16, 937. [CrossRef]
- Reddy, K.D.; Ashwini, P.; Raju, S.S.H.; Sonia, M.; Ghimiray, D. Enhancing Early Plant Disease Detection with the Advanced GNN-based Model Leveraging Hyperspectral Imaging. In Proceedings of the 2025 IEEE International Conference on Computer, Electronics, Electrical Engineering & their Applications (IC2E3). IEEE, 2025, pp. 1–6.
- Maruthai, S.; Selvanarayanan, R.; Thanarajan, T.; Rajendran, S. Hybrid vision GNNs based early detection and protection against pest diseases in coffee plants. Scientific reports 2025, 15, 11778. [CrossRef]
- Verma, A.; Ranga, V.; Vishwakarma, D.K. A novel approach for forecasting PM2. 5 pollution in Delhi using CATALYST. Environmental Monitoring and Assessment 2023, 195, 1457. [CrossRef]
- Baskar, G.; Parameswaran, A.; Sathyanathan, R. Optimizing papermaking wastewater treatment by predicting effluent quality with node-level capsule graph neural networks. Environmental Monitoring and Assessment 2025, 197, 176. [CrossRef]
- Zheng, Y.; Lin, Y.; Zhao, L.; Wu, T.; Jin, D.; Li, Y. Spatial planning of urban communities via deep reinforcement learning. Nature Computational Science 2023, 3, 748–762. [CrossRef]
- Mostafazadeh, R.; Alaei, N.; Mirchooli, F.; Hussain, S. Changes in urban green space configuration and connectivity using spatial graph-based metrics in Ardabil developing city, Iran. Environmental Monitoring and Assessment 2024, 196, 778. [CrossRef]
- Scarselli, F.; Gori, M.; Tsoi, A.C.; Hagenbuchner, M.; Monfardini, G. The graph neural network model. IEEE transactions on neural networks 2008, 20, 61–80.
- Oloulade, B.M.; Gao, J.; Chen, J.; Lyu, T.; Al-Sabri, R. Graph neural architecture search: A survey. Tsinghua Science and Technology 2021, 27, 692–708. [CrossRef]
- Mulugeta, A.K.; Sharma, D.P.; Mesfin, A.H. Deep learning for medicinal plant species classification and recognition: a systematic review. Frontiers in Plant Science 2024, 14, 1286088. [CrossRef]
- Zhong, L.; Dai, Z.; Fang, P.; Cao, Y.; Wang, L. A review: Tree species classification based on remote sensing data and classic deep learning-based methods. Forests 2024, 15, 852. [CrossRef]
- Liu, S.; Deng, W. Very deep convolutional neural network based image classification using small training sample size. In Proceedings of the 2015 3rd IAPR Asian conference on pattern recognition (ACPR). IEEE, 2015, pp. 730–734. [CrossRef]
- Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 2019, 1556.
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition, 2015, [arXiv:cs.CV/1512.03385].
- Vaghela, R.; Vaishnani, D.; Srinivasu, P.N.; Popat, Y.; Sarda, J.; Woźniak, M.; Ijaz, M.F. Land cover classification for identifying the agriculture fields using versions of yolo v8. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 2025.
- Low, D.G. Distinctive image features from scale-invariant keypoints. Journal of Computer Vision 2004, 60, 91–110.
- Qadir, I.; Iqbal, M.A.; Ashraf, S.; Akram, S. A fusion of CNN And SIFT For multicultural facial expression recognition. Multimedia Tools and Applications 2025, pp. 1–19. [CrossRef]
- Jihad, S.M.; Alsaud, A.A.; Almukhtar, F.H.; Kareem, S.; Yousif, R.Z. Investigating feature extraction by SIFT methods for prostate cancer early detection. Egyptian Informatics Journal 2025, 29, 100607. [CrossRef]
- Cai, T.; Luo, S.; Xu, K.; He, D.; Liu, T.; Wang, L. GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training. In Proceedings of the Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event; Meila, M.; Zhang, T., Eds. PMLR, 2021, Vol. 139, Proceedings of Machine Learning Research, pp. 1204–1215.
- Li, S.; Zhao, Y.; Varma, R.; Salpekar, O.; Noordhuis, P.; Li, T.; Paszke, A.; Smith, J.; Vaughan, B.; Damania, P.; et al. Pytorch distributed: Experiences on accelerating data parallel training. arXiv preprint arXiv:2006.15704 2020.






| Model | Accuracy | Precision | Recall | F1-Score |
|---|---|---|---|---|
| VGG16 | 0.9560 | 0.9583 | 0.9560 | 0.9533 |
| VGG19 | 0.9388 | 0.9419 | 0.9388 | 0.9344 |
| ResNet50 | 0.9786 | 0.9788 | 0.9786 | 0.9785 |
| ResNet101 | 0.9609 | 0.9631 | 0.9609 | 0.9570 |
| MobileNetV2 | 0.9589 | 0.9608 | 0.9589 | 0.9576 |
| GoogleNet | 0.9556 | 0.9556 | 0.9556 | 0.9553 |
| AlexNet | 0.9498 | 0.9608 | 0.9576 | 0.9576 |
| YOLOv8n | 0.9895 | 0.9896 | 0.9895 | 0.9894 |
| PlantGraphNet | 0.9897 | 0.9898 | 0.9897 | 0.9897 |
| Class Name | Precision | Recall | F1-Score |
|---|---|---|---|
| amm | 0.9960 | 1 | 0.9980 |
| calluna | 0.9886 | 0.9914 | 0.9900 |
| empetrum | 1 | 0.9863 | 0.9930 |
| grass | 1 | 0.9985 | 0.9992 |
| lichens | 1 | 0.9285 | 0.9629 |
| myrica | 0.9728 | 0.9748 | 0.9738 |
| nonveg | 1 | 1 | 1 |
| rosa rugosa | 0.9583 | 1 | 0.9787 |
| salix | 0.9838 | 0.9777 | 0.9808 |
| trees | 0.9556 | 0.9584 | 0.9570 |
| Weighted Avg | 0.9898 | 0.9897 | 0.9897 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
