Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Plant Identification Using Convolution Neural Network and Vision Transformer-Based Models

Version 1 : Received: 16 August 2023 / Approved: 17 August 2023 / Online: 18 August 2023 (08:28:28 CEST)

How to cite: Singh, V.; Rees, M.; Hampton, S.; Annadurai, S. Plant Identification Using Convolution Neural Network and Vision Transformer-Based Models. Preprints 2023, 2023081330. https://doi.org/10.20944/preprints202308.1330.v1 Singh, V.; Rees, M.; Hampton, S.; Annadurai, S. Plant Identification Using Convolution Neural Network and Vision Transformer-Based Models. Preprints 2023, 2023081330. https://doi.org/10.20944/preprints202308.1330.v1

Abstract

Identification of plants is a challenging task which aims to identify the family, genus, and species level according to morphological features. Automated deep learning-based computer vision algorithms are widely used for identifying plants and can help users to narrow down the possibilities. However, numerous morphological similarities between and within species make the classification difficult. In this paper, we tested a custom convolution neural network (CNN) and vision transformer (ViT) based models using the PyTorch framework to classify plants. We used a large dataset of 88K and 16K images for classifying plants at genus and species levels respectively. Our results show that for classifying plants at the genus level, ViT models perform better compared to CNN-based models ResNet50 and ResNet-RS-420, and other state-of-the-art CNN-based models suggested in previous studies on a similar dataset. The ViT model achieved top accuracy of 83.3% for classifying plants at the genus level. ViT models also perform better for classifying plants at the species level compared to CNN-based models ResNet50 and ResNet-RS-420, with a top accuracy of 92.5%. We show that the correct set of augmentation techniques plays an important role in classification success.

Keywords

Plant recognition; Image Processing; Convolution neural network; Vision transformer; Classification

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.