4. Results
The analytical processing was conducted on 71 valid observations and 40 original columns. After excluding the three items used to construct the target variable, the modeling system retained 37 numerical predictors associated with participant profile, perceived benefits, barriers, institutional conditions, financial factors, environmental criteria, and organizational capabilities. The main dependent variable was constructed as the average of the INT1, INT2, and INT3 items, generating the continuous Target_Intention_Score indicator. Additionally, a binary variable called Target_High_Intention was defined using a threshold value of 4.0 on the Likert scale, which made it possible to differentiate cases with high adoption or investment intention from cases with low or moderate intention. According to
Table 1, the final analytical configuration presents an almost balanced distribution between classes, with 36 cases of low or moderate intention and 35 cases of high intention, a methodologically favorable condition for training supervised classifiers because it reduces the risk of extreme bias toward a dominant class.
The empirical distribution of the target variable confirms that intention toward the adoption of biogas-solar microgrids does not behave as a marginal or exceptional variable within the sample, but rather as an outcome with substantive presence in almost half of the cases. According to
Figure 1, the intention score is concentrated around medium-high values, although without collapsing completely at the upper end of the scale. This pattern is relevant because it indicates that the phenomenon should not be interpreted only as general acceptance of the circular bioeconomy, but as a heterogeneous disposition modulated by organizational, environmental, financial, and institutional conditions. In modeling terms, this variability is especially important because it allows algorithms to learn decision boundaries and response gradients instead of operating on a database saturated by homogeneous responses.
The descriptive profile of the predictor blocks shows a consistently favorable valuation structure toward environmental components and perceived benefits, although with greater caution regarding barriers and some financial factors. According to
Table 2, the environmental block presents the highest aggregate mean among the observed constructs, with an average of 4.092, followed by perceived benefits with 4.028 and organizational capabilities with 3.984. These values suggest that environmental legitimacy and the perception of strategic usefulness of biogas-solar microgrids constitute mature dimensions within respondents’ imaginaries. However, the barriers block registers the lowest mean, with 3.387, indicating that obstacles do not disappear in the presence of a positive valuation of the technology; rather, they remain a structural dimension that may moderate the conversion of favorable attitudes into effective adoption intention.
The item-level analysis confirms that environmental valuation does not operate as a peripheral element, but rather as one of the model’s highest-intensity cores. The ENV1, ENV8, ENV3, ENV6, and ENV2 indicators are among the variables with the highest means, all above 4.09 on the scale, which evidences that the adoption of biogas-solar microgrids is strongly associated with an expectation of environmental contribution and operational sustainability. In contrast, BAR2 and BAR1 register the lowest averages, with means of 3.366 and 3.408, respectively. This gap suggests a central tension in the results: the field of application perceives high environmental and strategic benefits, but adoption still faces technical, economic, institutional, or implementation barriers that prevent an automatic transition from conceptual acceptance to the investment decision.
The correlation matrix between predictors and the continuous intention score reveals that organizational capabilities explain a substantive part of the observed variability in adoption intention. According to
Table 3, ORG4 shows the highest correlation with intention, with a coefficient of 0.680, followed by ORG5 with 0.657, ORG2 with 0.597, ENV3 with 0.590, ORG6 with 0.575, and ORG3 with 0.565. This result is technically relevant because it indicates that intention does not depend solely on environmental perception or technological attractiveness, but on the internal capacity of organizations to translate that opportunity into a viable decision.
Figure 2 complements this reading by showing that the main predictors are not randomly distributed, but rather form a correlational core in which organizational and environmental variables appear as closely articulated dimensions.
For the prediction of high adoption intention, eight supervised algorithms were compared: logistic regression with L2 regularization, Elastic Net logistic regression, radial-basis-function support vector machines, Extra Trees, Random Forest, Gradient Boosting, HistGradientBoosting, and histogram-based XGBoost. The comparison was performed through stratified cross-validation and hold-out evaluation, using a broad set of metrics that included accuracy, balanced accuracy, precision, recall, specificity, F1-score, ROC-AUC, average precision, Matthews correlation coefficient, Brier score, and log loss. According to
Table 4, the best overall model was ExtraTrees, selected for the highest combined performance in ROC-AUC, F1-score, balanced accuracy, and average precision. In cross-validation, ExtraTrees reached a mean accuracy of 0.723, balanced accuracy of 0.727, F1-score of 0.744, ROC-AUC of 0.819, and average precision of 0.862. In the test set, the same model reached accuracy of 0.833, balanced accuracy of 0.833, precision of 0.875, recall of 0.778, specificity of 0.889, F1-score of 0.824, ROC-AUC of 0.889, and average precision of 0.879.
The visual comparison of classifiers reinforces the selection of ExtraTrees as the model with the best balance between discrimination and stability. According to
Figure 3, tree-ensemble models dominate overall predictive performance, especially ExtraTrees and Random Forest, whereas linear models show a more limited capacity to capture complex relationships among predictors. This difference is methodologically coherent with the nature of the problem, since the adoption intention of biogas-solar microgrids probably responds to nonlinear interactions among organizational capabilities, financial constraints, environmental legitimacy, and institutional conditions. The fact that ExtraTrees outperforms linear models and other boosting algorithms suggests that the decision structure benefits from flexible partitions and randomized tree aggregation, especially in a small sample with multiple ordinal predictors.
The ROC curve of the best classifier confirms high discriminant capacity to distinguish between organizations with high intention and those with low or moderate intention. According to
Figure 4, the area under the curve in the test set was 0.889, which indicates that the model assigns a higher risk or probability score to positive cases in a proportion considerably greater than expected by chance. This result is particularly important because the objective of the model is not only to correctly classify observed cases but also to build an analytical tool capable of prioritizing organizational profiles with a higher probability of moving toward adoption or investment in circular bioenergy solutions. From an applied perspective, a ROC-AUC of 0.889 suggests solid capacity to rank cases according to propensity, which may be useful in policy-targeting scenarios, pilot-organization selection, or the design of differentiated incentives.
The precision-recall analysis adds a more demanding reading of the model’s capacity to recover high-intention cases without inflating false positives. According to
Figure 5, ExtraTrees achieved an average precision of 0.879 in testing, a performance consistent with the observed balance between precision and recall. In substantive terms, this means that the model not only distinguishes reasonably well between classes but also maintains a high capacity to identify relevant positive cases when attention focuses on the class of greatest practical interest: organizations with high willingness to adopt. This aspect is critical for energy-transition and circular-bioeconomy studies because classification errors do not have the same interpretive cost. Classifying an organization as highly willing when it still lacks sufficient conditions may lead to inefficient interventions; conversely, omitting organizations with high intention may limit the identification of early actors for technological scaling processes.
The confusion matrix makes it possible to observe the concrete distribution of correct and incorrect classifications produced by the selected classifier. According to
Figure 6, ExtraTrees correctly classified 8 negative cases and 7 positive cases, with only 1 false positive and 2 false negatives. This configuration explains the observed balance between specificity of 0.889 and recall of 0.778. The result suggests that the model is slightly more conservative in declaring high-intention cases than in identifying low- or moderate-intention cases, which may be a desirable property when the objective is to avoid overestimating system readiness to adopt biogas-solar microgrids. However, the two false negatives also indicate that there are profiles with high intention that the model does not capture, possibly because they combine favorable signals in some blocks with constraints or atypical patterns in others. Consequently, the model should be understood as a decision-support system, not as a substitute for individual technical evaluation.
Calibration assessment shows an additional component of predictive quality, since it is not sufficient to rank cases correctly; estimated probabilities must also correspond reasonably with the observed frequency of the event. According to
Figure 7, the calibration curve makes it possible to assess whether the probabilities generated by ExtraTrees tend to underestimate or overestimate high adoption intention. Given the reduced size of the test set, this reading should be considered exploratory, but it remains methodologically valuable because it incorporates a probabilistic dimension that goes beyond dichotomous accuracy. For the technical study, this result strengthens the presentation of the model by demonstrating that the evaluation is not limited to conventional classification metrics but incorporates discriminant performance, positive-case recovery capacity, error balance, and probabilistic quality.
The regression route was used as a complementary analysis to estimate the continuous intention score, not only its binary version. In this case, RidgeCV, ElasticNetCV, radial-basis-function SVR, ExtraTreesRegressor, RandomForestRegressor, GradientBoostingRegressor, HistGradientBoostingRegressor, and XGBoost_Regressor_Hist were compared. The cross-validation comparison shows that the best performance corresponded to RandomForestRegressor, with a mean RMSE of 0.453, MAE of 0.358, mean R² of 0.453, explained variance of 0.534, and MAPE of 0.100. ExtraTreesRegressor showed practically equivalent performance, with RMSE of 0.454, MAE of 0.348, R² of 0.455, and explained variance of 0.532. According to
Table 5, ensemble models again outperform most linear or boosting alternatives, confirming that the empirical structure of the phenomenon presents relationships among predictors that are not strictly additive.
The graphical comparison of regression models confirms the superiority of Random Forest- and Extra Trees-type ensembles. According to
Figure 8, the distance between the leading models and the remaining algorithms is not extreme, but it is consistent across error metrics. This suggests that the intention score can be approximated with a reasonable level of precision, although continuous prediction is naturally more demanding than binary classification. From the substantive standpoint, this result indicates that adoption intention should not be understood only as a dichotomous category but also as a gradient of readiness or disposition. Nevertheless, the regression-test results sheet recorded a compatibility error in the RMSE calculation with the squared argument; therefore, the interpretation of regression is supported mainly by cross-validation, which is the most stable comparative evidence available in the exported file.
The relationship between observed and predicted values of the best regressor makes it possible to visually evaluate the model’s capacity to approximate the continuous intention score. According to
Figure 9, the expected pattern in a useful model is the concentration of points around the reference diagonal, which would indicate correspondence between observed and estimated intention. In this case, the cross-validation evidence suggests a moderate explanatory capacity, with R² close to 0.45; therefore, the point cloud should be interpreted as a reasonable but imperfect predictive approximation. In applied research terms, this result is useful because it supports the assertion that organizational, environmental, financial, and institutional predictors capture a significant portion of intention, although there are still unobserved components that may be associated with technological, regulatory, cultural, or actual biomass- and capital-availability factors.
The residual distribution complements the evaluation of the regression model by showing whether errors are concentrated around zero or whether relevant asymmetries are present. According to
Figure 10, the technical reading should focus on the shape of the distribution and the presence of tails or systematic deviations. A reasonably centered residual distribution suggests the absence of severe global bias, whereas pronounced tails would indicate atypical cases in which the model underestimates or overestimates intention. Because the dependent variable is constructed from an aggregated Likert scale, small deviations may have important substantive interpretation, especially when working with technological-adoption decisions. Therefore, residual evaluation should not be treated as a secondary formal requirement, but rather as evidence of model stability in the face of organizational heterogeneity.
The plot of residuals against predicted values allows examination of possible heteroscedasticity patterns or systematic errors across the estimated range of intention. According to
Figure 11, the absence of a clearly curved structure or funnel-shaped pattern would be consistent with acceptable predictive behavior. If errors intensify at the upper or lower extremes, this would suggest that the model predicts intermediate cases better than extreme adoption profiles. This reading is important because, in studies of microgrids and circular transition, extreme cases are often the most relevant for public policy and strategic management: those with very high intention may act as early adopters, whereas those with low intention may represent segments where critical financing, information, or organizational-capability barriers are concentrated.
Variable-importance analysis adds a layer of explainability to the classification model. According to
Table 6, ORG4 was the most important predictor according to both permutation importance and the model’s native importance, with values of 0.049 and 0.091, respectively. In addition, ORG6, ENV6, ENV1, ENV2, ORG5, ENV5, ORG1, ORG3, FIN7, and ENV3 appear among the predictors with the greatest contribution by permutation. In technical terms, the convergence among bivariate correlation, permutation importance, and native importance reinforces the interpretive robustness of the organizational and environmental variables. The result should not be read as causality, but rather as evidence that the prediction of high intention depends especially on internal capabilities and environmental criteria that act as readiness signals for translating the circular bioeconomy into a concrete technological decision.
Figure 12 visualizes the hierarchy of predictors from a model-perturbation perspective: a variable is more important when its randomization deteriorates predictive performance to a greater extent. According to
Figure 12, the predominance of ORG and ENV variables confirms that the model is not simply capturing general positive responses, but rather a pattern of organizational readiness and environmental orientation. This distinction is fundamental to the study’s contribution because it places the adoption of biogas-solar microgrids in a space that is more complex than the mere perception of benefits. The results suggest that organizations with higher intention not only value sustainability but also present internal capacity signals to operate, absorb, or manage the transition toward circular energy schemes.
Finally, the unsupervised analysis explored the existence of latent profiles within the sample. Solutions with two, three, and four clusters were evaluated using the silhouette coefficient. According to
Table 7, the two-cluster solution obtained the best silhouette value, with 0.208, above the three-cluster solution, with 0.123, and the four-cluster solution, with 0.097. Although the coefficient does not indicate strong separation, it does suggest that the most interpretable structure of the data is organized into two general profiles. This segmentation is consistent with the classificatory objective of the study because it differentiates a group with lower readiness and intention from another group with greater alignment toward the adoption of biogas-solar microgrids.
The cluster profile confirms that the segmentation differentiates two substantively relevant groups. According to
Table 8, cluster 0 presents lower means across all aggregate blocks, with an intention score of 3.468 and a proportion of high intention of 0.324. In contrast, cluster 1 shows higher means in benefits, institutional conditions, financial factors, environmental criteria, and organizational capabilities, with an intention score of 4.216 and a proportion of high intention of 0.676. This difference suggests that high adoption intention does not emerge from a single isolated factor, but from a systemic configuration in which perceived benefits, environmental legitimacy, institutional support, financial feasibility, and organizational capabilities mutually reinforce one another. Consequently, the adoption of biogas-solar microgrids should be interpreted as a phenomenon of sociotechnical and organizational maturity, not merely as a technological preference.
The PCA representation of the clusters offers a visual reading of the latent structure of the sample. According to
Figure 13, the separation between groups should not be understood as a rigid boundary, but as a gradual differentiation of profiles in a reduced principal-component space. This evidence is coherent with the supervised results: adoption intention presents sufficient structure to be modeled, but preserves internal heterogeneity. Partial overlap between profiles may indicate that some organizations share similar environmental perceptions or benefits while differing in internal capabilities, financial conditions, or institutional maturity. Therefore, cluster analysis does not replace the predictive model, but complements it by showing that the transition toward biogas-solar microgrids can be grouped into differentiated strategic segments.
The results show that the intention to adopt biogas-solar microgrids within a circular bioeconomy approach can be modeled with technically solid performance through machine-learning algorithms, especially tree ensembles. The superiority of ExtraTrees in classification and RandomForestRegressor in regression suggests that the phenomenon has a nonlinear structure, in which organizational and environmental variables operate as central predictive determinants. The combined evidence from correlations, permutation importance, ROC-AUC performance, average precision, confusion matrix, calibration, residual analysis, and PCA-cluster segmentation supports the argument that high intention does not depend solely on recognizing environmental benefits, but on a broader configuration of organizational capabilities, perceived feasibility, and strategic alignment with sustainability criteria. This result strengthens the technical contribution of the study because it transforms a Likert-type perceptual matrix into an interpretable predictive system capable of identifying profiles with greater propensity for adoption, prioritizing critical variables, and providing quantitative evidence for designing implementation strategies for circular microgrids based on biogas and solar energy.