3. Results
This study examined the effects of the AI-FIT digital game-based physical activity program on the health-related physical fitness of elementary school students by analyzing pre- and post-intervention changes in both the experimental and control groups. The results indicated that the experimental group showed significant improvements in multiple fitness indicators, including body mass index (BMI), flexibility, muscular endurance, and overall PAPS grade. In contrast, the control group demonstrated little to no improvement in most variables and, in some cases, even a decline in performance. The key findings are summarized as follows.
3.1. Changes in Body Mass Index (BMI)
Analysis of BMI changes revealed a statistically significant improvement in the experimental group. The mean BMI decreased from 24.46 (±5.06) kg/m² at pre-test to 23.69 (±5.21) kg/m² at post-test (t = 3.13, p = .006), with a mean change of –0.77 (±1.08). The effect size (Cohen’s d) was 0.72, indicating a moderate-to-large effect.
In contrast, the control group showed no statistically significant change in BMI, with pre- and post-test means of 23.32 (±5.69) kg/m² and 23.18 (±5.90) kg/m², respectively (t = 0.71, p = .487). The effect size was –0.17, considered negligible.
These results suggest that the AI-FIT program had a positive effect on improving body composition among elementary school students.
Table 3.
Pre–post comparison of BMI changes between the experimental and control groups following participation in the AI-FIT program.
Table 3.
Pre–post comparison of BMI changes between the experimental and control groups following participation in the AI-FIT program.
| Variable |
Group |
Pre_Mean_Exp (SD) |
Post_Mean_Exp (SD) |
Δ(Post–Pre)
|
t |
p-value |
Cohen's d |
| BMI (kg/m²) |
Experimental |
24.46 ± 5.06 |
23.69 ± 5.21 |
-0.77 ± 1.08 |
3.13 |
0.006** |
-0.72 |
| Control |
23.32 ± 5.69 |
23.18 ± 5.90 |
-0.14 ± 0.86 |
0.71 |
0.487 |
-0.17 |
Figure 2.
Pre-post changes in BMI by group.
Figure 2.
Pre-post changes in BMI by group.
In the gender-specific subgroup analysis within the experimental group, male students showed a statistically significant reduction in BMI, decreasing from a pre-test mean of 24.90 kg/m² (±5.30) to a post-test mean of 23.78 kg/m² (±5.01) (p = .003). Female students also exhibited a decrease in BMI, from 24.50 kg/m² (±5.40) to 24.06 kg/m² (±5.33), but the change did not reach statistical significance (p = .056). These findings suggest that the AI-FIT program had a more pronounced effect on improving body composition in male students.
Table 4.
Pre–post comparison of BMI between male and female subgroups within the experimental group.
Table 4.
Pre–post comparison of BMI between male and female subgroups within the experimental group.
| Variable |
Boy |
p-value |
Girl |
p-value |
| Pre_Mean_Boy (SD) |
Post_Mean_Boy (SD) |
Pre_Mean_Girl (SD) |
Post_Mean_Girl (SD) |
| BMI (kg/m²) |
24.9 ± 5.3 |
23.78 ± 5.01 |
.003 ** |
24.5 ± 5.4 |
24.06 ± 5.33 |
.056 |
Figure 3.
Pre-post changes in BMI by gender (experimental group).
Figure 3.
Pre-post changes in BMI by gender (experimental group).
3.2. Changes in Flexibility
Flexibility was assessed using the sit-and-reach test. In the experimental group, the mean flexibility score significantly increased from 6.10 cm (±6.33) at pre-test to 8.65 cm (±5.89) at post-test (t = –4.03, p = .001), with a mean change of +2.55 cm (±2.83). The effect size (Cohen’s d) was 0.90, indicating a large effect.
In contrast, the control group showed a slight decrease in flexibility from 7.95 cm (±7.83) to 7.13 cm (±8.82), which was not statistically significant (t = 0.70, p = .491). The effect size was –0.16, indicating a negligible effect.
These results suggest that digital game-based physical activity can be particularly effective in improving flexibility among children.
Table 5.
Pre–post comparison of flexibility changes between the experimental and control groups following participation in the AI-FIT program.
Table 5.
Pre–post comparison of flexibility changes between the experimental and control groups following participation in the AI-FIT program.
| Variable |
Group |
Pre_Mean_Exp (SD) |
Post_Mean_Exp (SD) |
Δ(Post–Pre)
|
t |
p-value |
Cohen's d |
| Flexibility (cm) |
Experimental |
6.10 ± 6.33 |
8.65 ± 5.89 |
+2.55 ± 2.83 |
-4.03 |
0.001*** |
0.90 |
| Control |
7.95 ± 7.83 |
7.13 ± 8.82 |
-0.82 ± 5.23 |
0.70 |
0.491 |
-0.16 |
Figure 4.
Pre-post changes in flexibility by gender (experimental group).
Figure 4.
Pre-post changes in flexibility by gender (experimental group).
In the gender-specific subgroup analysis within the experimental group, both male and female students demonstrated statistically significant improvements in flexibility. Male students improved from a pre-test mean of 8.00 cm (±5.21) to a post-test mean of 10.67 cm (±4.97) (p = .019), while female students improved from 7.91 cm (±5.81) to 10.36 cm (±6.38) (p = .003). These findings suggest that the AI-FIT program consistently had a positive effect on enhancing flexibility in children, regardless of gender.
Table 6.
Pre–post comparison of flexibility between male and female subgroups within the experimental group.
Table 6.
Pre–post comparison of flexibility between male and female subgroups within the experimental group.
| Variable |
Boy |
p-value |
Girl |
p-value |
| Pre_Mean_Boy (SD) |
Post_Mean_Boy (SD) |
Pre_Mean_Girl (SD) |
Post_Mean_Girl (SD) |
| Flexibility (cm) |
8.00 ± 5.21 |
10.67 ± 4.97 |
.019 * |
7.91 ± 5.81 |
10.36 ± 6.38 |
.003 ** |
Figure 5.
Pre-post changes in flexibility by group.
Figure 5.
Pre-post changes in flexibility by group.
3.3. Changes in Muscular Endurance
Muscular endurance was assessed by the number of sit-ups performed within one minute. In the experimental group, the mean number of sit-ups significantly increased from 22.15 (±11.95) at pre-test to 38.00 (±26.49) at post-test (t = –3.58, p = .002), with a mean change of +15.85 (±19.78). The effect size (Cohen’s d) was 0.80, indicating a large effect.
In contrast, the control group showed an increase from 46.32 (±36.03) to 53.84 (±60.81), but the difference was not statistically significant (t = –0.54, p = .597), and the effect size was only 0.12, indicating a negligible effect.
These findings suggest that the AI-FIT program had a positive effect on improving muscular endurance in elementary school students.
Table 7.
Pre–post comparison of muscular endurance changes between the experimental and control groups following participation in the AI-FIT program.
Table 7.
Pre–post comparison of muscular endurance changes between the experimental and control groups following participation in the AI-FIT program.
| Variable |
Group |
Pre_Mean_Exp (SD) |
Post_Mean_Exp (SD) |
Δ(Post–Pre)
|
t |
p-value |
Cohen's d |
| muscular endurance(rep) |
Experimental |
22.15 ± 11.95 |
38.00 ± 26.49 |
+15.85 ± 19.78 |
-3.58 |
0.002** |
0.80 |
| Control |
46.32 ± 36.03 |
53.84 ± 60.81 |
+7.53 ± 60.98 |
-0.54 |
0.597 |
0.12 |
Figure 6.
Pre-post changes in muscle endurance by group.
Figure 6.
Pre-post changes in muscle endurance by group.
In the gender-specific subgroup analysis within the experimental group, changes in muscular endurance were not statistically significant for either male or female students. Male students showed a decrease from a pre-test mean of 35.11 repetitions (±16.62) to a post-test mean of 30.11 repetitions (±15.35), which was not statistically significant (p = .083). Female students also demonstrated a decline, from 30.36 repetitions (±13.32) to 26.36 repetitions (±13.63), with the result approaching but not reaching statistical significance (p = .058).
These findings suggest that while muscular endurance significantly improved at the overall experimental group level, statistically significant changes were not observed within gender-specific subgroups.
Table 8.
Pre–post comparison of muscular endurance between male and female subgroups within the experimental group.
Table 8.
Pre–post comparison of muscular endurance between male and female subgroups within the experimental group.
| Variable |
Boy |
p-value |
Girl |
p-value |
| Pre_Mean_Boy (SD) |
Post_Mean_Boy (SD) |
Pre_Mean_Girl (SD) |
Post_Mean_Girl (SD) |
| muscular endurance |
35.11 ± 16.62 |
30.11 ± 15.35 |
.083 |
30.36 ± 13.32 |
26.36 ± 13.63 |
.058 |
Figure 7.
Pre-post changes in muscle endurance by gender (experimental group).
Figure 7.
Pre-post changes in muscle endurance by gender (experimental group).
3.4. Changes in Overall Physical Fitness (PAPS Grade)
Although the PAPS grade is an ordinal scale ranging from 1 to 5, it was treated as an interval scale in this study to allow for the calculation of means and standard deviations. The PAPS (Physical Activity Promotion System) consists of five levels, where Grade 1 indicates excellent physical fitness and Grade 5 indicates poor fitness. Thus, a decrease in the PAPS grade represents an improvement in overall health-related fitness.
In the experimental group, the mean PAPS grade significantly improved from 2.90 (±0.64) at pre-test to 2.75 (±0.53) at post-test (t = 3.11, p = .006), with an effect size (Cohen’s d) of 0.69, indicating a moderate-to-large effect. In contrast, the control group showed no significant change, with the mean grade slightly increasing from 2.80 (±0.65) to 2.85 (±0.70) (t = –0.41, p = .684), and a negligible effect size of –0.07.
Additionally, a nonparametric Wilcoxon signed-rank test was conducted to further validate the findings. The results revealed a statistically significant improvement in the experimental group (Z = –3.720, p < .001), reinforcing the reliability of the observed enhancement in overall physical fitness as measured by the PAPS grade.
Table 9.
Pre–post comparison of overall physical fitness (PAPS grade) between the experimental and control groups following participation in the AI-FIT program.
Table 9.
Pre–post comparison of overall physical fitness (PAPS grade) between the experimental and control groups following participation in the AI-FIT program.
| Variable |
Group |
Pre_Mean_Exp (SD) |
Post_Mean_Exp (SD) |
Δ(Post–Pre)
|
t |
p-value |
Cohen's d |
| PAPS Score |
Experimental |
3.55 ± 0.60 |
2.75 ± 0.91 |
-0.80 ± 1.15 |
3.11 |
0.006** |
-0.69 |
| Control |
2.89 ± 0.74 |
2.47 ± 1.17 |
-0.42 ± 0.77 |
2.39 |
0.028* |
-0.55 |
Figure 8.
Pre-post changes in PAPS grade by group.
Figure 8.
Pre-post changes in PAPS grade by group.
In the gender-specific subgroup analysis within the experimental group, both male and female students demonstrated statistically significant improvements in PAPS grade. Male students improved from a pre-test mean of 3.00 (±0.87) to a post-test mean of 2.67 (±0.87) (p = .013), while female students showed an even greater improvement, from 2.82 (±0.70) to 2.73 (±0.74) (p = .001). These results suggest that the AI-FIT program had a positive effect on enhancing overall physical fitness in children, regardless of gender.
Table 10.
Pre–post comparison of overall physical fitness (PAPS grade) between male and female subgroups within the experimental group.
Table 10.
Pre–post comparison of overall physical fitness (PAPS grade) between male and female subgroups within the experimental group.
| Variable |
Boy |
p-value |
Girl |
p-value |
| Pre_Mean_Boy (SD) |
Post_Mean_Boy (SD) |
Pre_Mean_Girl (SD) |
Post_Mean_Girl (SD) |
| PAPS Score |
3.00 ± 0.87 |
3.67 ± 0.87 |
.013 * |
2.82 ± 0.70 |
3.73 ± 0.74 |
.001 *** |
Figure 9.
Pre-post changes in PAPS grade by gender (experimental group).
Figure 9.
Pre-post changes in PAPS grade by gender (experimental group).
3.5. Changes in Cardiorespiratory Endurance
Cardiorespiratory endurance was assessed using a standardized shuttle run test (Physical Endurance Index, PEI). In the experimental group, the mean score slightly decreased from 65.77 (±21.98) at pre-test to 62.95 (±18.88) at post-test, but this change was not statistically significant (t = 0.52, p = .607). The mean change was –2.82 (±23.54), and the effect size (Cohen’s d) was –0.12, indicating a negligible effect.
In contrast, the control group showed a statistically significant decrease, with the mean score dropping from 77.54 (±17.28) to 68.50 (±17.22) (t = 3.36, p = .002). The mean change was –9.04 (±17.20), and the effect size was –0.56, indicating a moderate decline.
These findings suggest that while the AI-FIT program did not lead to significant improvements in cardiorespiratory endurance, it may have played a role in mitigating the decline observed in the control group.
Table 11.
Pre–post comparison of cardiorespiratory endurance between the experimental and control groups following participation in the AI-FIT program.
Table 11.
Pre–post comparison of cardiorespiratory endurance between the experimental and control groups following participation in the AI-FIT program.
| Variable |
Group |
Pre_Mean_Exp (SD) |
Post_Mean_Exp (SD) |
Δ(Post–Pre)
|
t |
p-value |
Cohen's d |
| Endurance (PEI) |
Experimental |
65.77 ± 21.98 |
62.95 ± 18.88 |
-2.82 ± 23.54 |
0.52 |
0.607 |
-0.12 |
| Control |
77.54 ± 17.28 |
68.54 ± 36.03 |
-9.00 ± 30.50 |
1.22 |
0.241 |
-0.30 |
Figure 10.
Pre-post changes in cardiorespiratory endurance by group.
Figure 10.
Pre-post changes in cardiorespiratory endurance by group.
In the gender-specific subgroup analysis within the experimental group, male students showed a slight, non-significant increase in cardiorespiratory endurance, from a pre-test mean of 30.33 repetitions (±11.42) to a post-test mean of 31.11 repetitions (±13.18) (p = .683). Female students also demonstrated an upward trend, increasing from 28.45 repetitions (±15.13) to 33.00 repetitions (±13.98), but the change did not reach statistical significance (p = .091). These results indicate that no clear improvements in cardiorespiratory endurance were observed in either gender subgroup.
Table 12.
Pre–post comparison of cardiorespiratory endurance between male and female subgroups within the experimental group.
Table 12.
Pre–post comparison of cardiorespiratory endurance between male and female subgroups within the experimental group.
| Variable |
Boy |
p-value |
Girl |
p-value |
| Pre_Mean_Boy (SD) |
Post_Mean_Boy (SD) |
Pre_Mean_Girl (SD) |
Post_Mean_Girl (SD) |
| Endurance (count) |
30.33 ± 11.42 |
31.11 ± 13.18 |
.683 |
28.45 ± 15.13 |
33.00 ± 13.98 |
.091 |
Figure 11.
Pre-post changes in cardiorespiratory endurance by gender (experimental group).
Figure 11.
Pre-post changes in cardiorespiratory endurance by gender (experimental group).
3.6. Changes in Power
Power was evaluated using the standing long jump test. In the experimental group, the mean jump distance slightly decreased from 124.60 cm (±25.30) at pre-test to 123.80 cm (±47.08) at post-test, but this change was not statistically significant (t = 0.07, p = .941). The mean change was –0.80 cm (±47.76), and the effect size (Cohen’s d) was –0.02, indicating virtually no effect.
Similarly, the control group showed a decline from a pre-test mean of 118.85 cm (±34.10) to a post-test mean of 108.25 cm (±51.06), which also was not statistically significant (t = 0.72, p = .480). The mean change was –10.60 cm (±65.73), and the effect size was –0.16, suggesting only a minimal effect.
These findings suggest that the digital game-based physical activity program used in this study may not have been sufficient to produce measurable improvements in power within a short-term intervention period.
Table 13.
Pre–post comparison of power between the experimental and control groups following participation in the AI-FIT program.
Table 13.
Pre–post comparison of power between the experimental and control groups following participation in the AI-FIT program.
| Variable |
Group |
Pre_Mean_Exp (SD) |
Post_Mean_Exp (SD) |
Δ(Post–Pre)
|
t |
p-value |
Cohen's d |
| Power (cm) |
Experimental |
124.60 ± 25.30 |
123.80 ± 47.08 |
-0.80 ± 47.76 |
0.07 |
0.941 |
-0.02 |
| Control |
118.85 ± 34.10 |
108.25 ± 51.06 |
-10.60 ± 65.73 |
0.72 |
0.480 |
-0.16 |
Figure 12.
Pre-post changes in power by group.
Figure 12.
Pre-post changes in power by group.
In the gender-specific subgroup analysis within the experimental group, male students showed virtually no change in power, with a pre-test mean of 152.78 cm (±30.15) and a post-test mean of 152.83 cm (±31.02) (p = .999). Female students showed a slight decline, from 142.18 cm (±25.80) to 140.00 cm (±21.48), but the change was not statistically significant (p = .414). These findings indicate that the program did not have a notable effect on improving power in either gender subgroup.
Table 14.
Pre–post comparison of power between male and female subgroups within the experimental group.
Table 14.
Pre–post comparison of power between male and female subgroups within the experimental group.
| Variable |
Boy |
p-value |
Girl |
p-value |
| Pre_Mean_Boy (SD) |
Post_Mean_Boy (SD) |
Pre_Mean_Girl (SD) |
Post_Mean_Girl (SD) |
| Power (cm) |
152.78 ± 30.15 |
152.83 ± 31.02 |
.999 |
142.18 ± 25.80 |
140.00 ± 21.48 |
.414 |
Figure 13.
Pre-post changes in power by gender (experimental group).
Figure 13.
Pre-post changes in power by gender (experimental group).