4. Results
Step 1. Development of a dataset based on the prepared measurement scale
Table 2 presents a custom-developed measurement scale regarding the occurrence of employee voluntary turnover intentions (after the whitening process of grey numbers).
Additionally, for machine learning purposes in particular, the survey questionnaire included a question asking whether the respondent demonstrates an intention to leave their job voluntarily. The survey was conducted between August 1 and September 30, 2024. The sample included in the present study comprised 854 individuals.
Step 2. Construction of a structural model in which the latent variable is the occurrence of employee voluntary turnover intention
The developed SEM model consisted of two components:
Measurement model – the latent factor is voluntary turnover intention, onto which all 27 items are loaded. This model tests whether all 27 items can be reduced to a single component (turnover intention),
Structural model – this model tests the regression relationship between the 27 items and the label, which is the occurrence of turnover intention. The label is the dependent variable in this model, and all 27 items are predictors.
The key parameters of the measurement and structural models are presented in
Table 3.
In the measurement model, all loadings are statistically significant (p < 0.001) and generally high (> 0.8), which confirms that each indicator effectively reflects the latent construct. In the structural model, we examine the influence of this construct on the label (turnover intention). The negative coefficient (Estimate = –0.332, p < 0.001) indicates that a higher level of the latent construct is associated with a lower probability of turnover intention. Both variances are significant, suggesting meaningful variability in both the construct and the intention to leave. The SEM model fit indices are presented in
Table 4.
The overall model fit can be considered good despite the statistically significant Chi² test (p < .001)—a typical result for large samples. The key RMSEA index of 0.073 falls below the 0.08 threshold, indicating an acceptable approximation error. The CFI = 0.878 and TLI = 0.868, though slightly below the conventional 0.90 cutoff, still suggest satisfactory model fit. Additionally, GFI = 0.856, AGFI = 0.844, and NFI = 0.856 confirm that the model structure adequately reflects the data. The AIC and BIC values can be used for comparison with alternative models, but in themselves, they raise no concerns.
Step 3. Selection of the Best Machine Learning Algorithm for Predicting the Occurrence of Voluntary Employee Turnover
Following the methodology outlined in the previous section, a training process was carried out using the following algorithms: naive Bayes, linear and nonlinear support vector machines, decision trees, k-nearest neighbors, and logistic regression. Cross-validation was used in the analysis.
Table 5 presents the training process results for all algorithms, along with the standard deviations of the accuracy metric.
Based on the obtained results, it can be concluded that the analysed models perform well in predicting the occurrence of voluntary employee turnover intentions. Each of the analysed models demonstrates over 80% accuracy. For further research, the nonlinear support vector machine algorithm was the most effective of the analysed algorithms.
Step 4. Simulations of the impact of factor removal on the SEM model and on the effectiveness of the machine learning model
At the beginning of this step, simulations of the SEM model were conducted by successively excluding individual items from the scale. The results of the SEM model fit, measured by changes in the RMSEA index following successive reductions, are presented in
Table 6.
In the next step, the impact of removing successive variables on the accuracy metric of the best-performing model – the nonlinear support vector machine – was verified. The results of these simulations are presented in
Table 7.
Step 5. Improvement of the psychometric scale based on the conducted SEM-ML simulations
In the next step, those factors were identified whose potential removal neither worsens the fit of the SEM model (i.e., leads to a decrease in the RMSEA index or maintains it at the same level) nor reduces the predictive performance of the machine learning model (measured by the average accuracy value in the cross-validation method).
It was found that out of the 27 analysed items in the scale measuring turnover intention, three indicators meet the exclusion criteria. These factors are presented in
Table 8.
The table identifies three items (X₉, X₄, X₁₈) whose exclusion from the "voluntary turnover intentions" scale does not deteriorate either the measurement validity assessed by SEM (ΔRMSEA ≥ 0) or the predictive power of the best ML model (Δaccuracy ≤ 0). The removal of X₉ and X₁₈ even leads to a slight reduction in RMSEA without any change in accuracy, while the removal of X₄ results in the most considerable reduction in RMSEA (–0.00506) with a negligible impact on classification performance (–0.00001), making these items natural candidates for elimination. As a result, a shorter and more structurally compact scale is obtained, while still maintaining satisfactory SEM fit indices and high predictive power.
In the final step, a SEM model was constructed, and the accuracy metric was calculated for the measurement scale after removing factors X₉, X₄, and X₁₈.
Table 9 presents the fit indices of the new, simplified SEM model for voluntary employee turnover intention.
The results presented in the table for the simplified SEM model (after removing X₉, X₄, and X₁₈) show a clear improvement in all key fit indices compared to the initial model. RMSEA decreased from 0.073 to 0.065, indicating a significant enhancement in model quality. At the same time, CFI increased from 0.878 to 0.911, and TLI from 0.868 to 0.903 – both now exceed the commonly accepted threshold of 0.90, signaling a strong representation of the theoretical structure. GFI (0.856 → 0.890), AGFI (0.844 → 0.880), and NFI (0.856 → 0.890) also improved by more than 0.03 points, confirming the overall better quality of the model. Lower values of the information criteria AIC (from 107.4 to 97.0) and BIC (from 373.4 to 334.5) indicate that a more economical model was obtained with fewer parameters, offering a better balance between parsimony and accuracy.
The new machine learning model (nonlinear support vector machine), without variables X₉, X₄, and X₁₈, achieved an average accuracy metric (calculated via cross-validation) of 0.8630 with a standard deviation of 0.017565.
The predictive performance of the selected ML classifier (nonlinear SVM) for the shortened scale also appears promising – the mean accuracy increased from 0.862 to 0.863, and the standard deviation remained at a similar level. Although the accuracy gain is modest, it demonstrates that eliminating the three variables improved SEM fit without any loss, and even with a slight enhancement of the ML model's predictive power. These results confirm that the applied method for item selection achieves its intended trade-off: it yields a more concise and theoretically coherent measurement tool while maintaining (and even slightly improving) its practical utility in classifying turnover intentions.