ARTICLE | doi:10.20944/preprints202207.0280.v1
Subject: Engineering, General Engineering Keywords: Hyperspectral Technology; Non-destructive Testing; Soybean; Machine Learning; Support Vector Machine; Extreme Gradient Boosting; Tree-structured Parzen Estimator
Online: 19 July 2022 (07:12:32 CEST)
Soybean with insignificant differences in appearance have large differences in their internal physical and chemical components, therefore follow-up storage, transportation and processing require targeted differential treatment. A fast and effective machine learning method based on hyperspectral data of soybean for pattern recognition of categories is designed as a non-destructive testing method in this paper. A hyperspectral-image dataset with 2299 soybean seeds in 4 categories is collected; Ten features is selected by extreme gradient boosting algorithm from 203 hyperspectral bands in range 400 to 1000 nm; A Gaussian radial basis kernel function support vector machine with optimization by the Tree-structured Parzen Estimator algorithm is built as TPE-RBF-SVM model for pattern recognition of soybean categories. The metrics of TPE-RBF-SVM are significantly improved compared with other machine learning algorithms. The accuracy is 0.9165 in the independent test dataset which is 9.786% higher for vanilla RBF-SVM model and 10.02% higher than the extreme gradient boosting model.
ARTICLE | doi:10.20944/preprints202209.0239.v1
Subject: Engineering, General Engineering Keywords: Hyperspectral Technology; Non-destructive Testing; Black Soil; Ensemble learning; Support Vector Machine
Online: 16 September 2022 (07:40:27 CEST)
For the soil in different regions, the nutrient fertility contained in it is different, and the detection and zoning management of soil nutrients before tillage every year can improve grain yield. In this paper, an integrated learning strategy model based on black soil hyperspectral data is designed for rapid classification of organic matter content classification of black soil. Soil hyperspectral image dataset of Xiangyang Experimental Base was collected; by changing the internal structure of the stacking model, an LSVM-stacking model with (MLP, SVC, DTree, XGBl, kNN) five classifiers as the L1 layer was built, and the simulated annealing algorithm was used for hyperparameter optimization. Compared to other stacking models, the LSVM-stacking metrics are significantly improved. The accuracy rate of hyperparameter optimization is improved by 38.6515%, the accuracy rate of the independent test data set is 0.9488, and the comparison of individual learners can improve the recognition classification accuracy of label"1" to 1.0.