Submitted:
16 March 2026
Posted:
17 March 2026
Read the latest preprint version here
Abstract

Keywords:
1. Introduction
1.1. Background of the Study
1.2. Problem Statement
1.3. Research Objectives
1.4. Research Questions
1.5. Significance of the Study
2. Literature Review
2.1. Learning Analytics in Education
2.2. Predictive Models in Education
2.3. Online Learning Environments
2.4. Key Variables Affecting Student Performance
2.5. Research Gap
3. Conceptual Framework
4. Methodology
4.1. Research Design
4.2. Data Source
4.3. Data Collection
| Data Type | Source | Derived Variables | Measurement Level |
|---|---|---|---|
| Student interaction logs | LMS activity reports | Login frequency, session duration, activity timestamps, resource access patterns | Ratio (counts, time) |
| Forum participation data | Discussion forum module | Number of posts, number of replies, threads viewed, post length | Ratio (counts) |
| Learning resource engagement | Content delivery logs | Video views, video completion rate, resource access frequency, time on task | Ratio (counts, percentage) |
| Assessment data | Gradebook module | Assignment scores, quiz scores, final grades, submission timeliness | Interval/ratio (percentages, points) |
| Course completion data | Registrar records | Final grade, pass/fail status, course withdrawal indicator | Nominal/ordinal |
| Course metadata | Course syllabi, LMS structure | Module count, assessment types, resource types, discussion requirements | Nominal |
4.4. Data Analysis Techniques
| Algorithm | Type | Strengths | Limitations | Primary Application |
|---|---|---|---|---|
| Logistic Regression | Classification | Highly interpretable, computationally efficient, provides probability estimates | Assumes linear relationships, may underperform with complex interactions | Binary pass/fail prediction |
| Decision Trees | Classification/Regression | Transparent, handles nonlinear relationships, no distributional assumptions | Prone to overfitting, unstable to data variations | Identifying decision rules for at-risk students |
| Random Forest | Ensemble | High accuracy, robust to overfitting, provides variable importance | Less interpretable, computationally intensive | Accurate prediction for early warning systems |
| Support Vector Machines | Classification | Effective in high-dimensional spaces, handles nonlinear relationships | Computationally intensive, difficult to interpret | Complex classification tasks with many predictors |
5. Results and Findings
Sample Characteristics and Descriptive Statistics
Model Prediction Accuracy
| Model | Accuracy | Precision | Recall | F1-Score | AUC-ROC |
|---|---|---|---|---|---|
| Logistic Regression | 0.782 | 0.754 | 0.721 | 0.737 | 0.814 |
| Decision Tree | 0.803 | 0.776 | 0.768 | 0.772 | 0.831 |
| Random Forest | 0.841 | 0.823 | 0.809 | 0.816 | 0.879 |
| Support Vector Machine | 0.827 | 0.809 | 0.791 | 0.800 | 0.858 |
| Gradient Boosting | 0.838 | 0.818 | 0.802 | 0.810 | 0.872 |
Key Predictors of Student Performance
| Rank | Predictor Variable | Mean Decrease in Accuracy | Theoretical Construct |
|---|---|---|---|
| 1 | Assignment submission rate | 0.142 | Goal attainment, time management |
| 2 | Login frequency (weekly average) | 0.118 | Sustained engagement, motivation |
| 3 | Video lecture completion rate | 0.097 | Content engagement, cognitive processing |
| 4 | Assignment submission timeliness | 0.089 | Planning, self-regulation |
| 5 | Forum posting frequency | 0.076 | Social engagement, help-seeking |
| 6 | Learning resource access diversity | 0.068 | Resource utilization, strategic learning |
| 7 | Session duration (average) | 0.054 | Time investment, depth of engagement |
| 8 | Quiz attempt frequency | 0.047 | Formative assessment engagement |
| 9 | Forum reading activity | 0.039 | Vicarious learning, social presence |
| 10 | Login regularity (variance) | 0.031 | Consistency, routine establishment |
Visualization of Learning Behavior Patterns
| Engagement Indicator | Top Quartile (n=87) | Bottom Quartile (n=86) | Effect Size (Cohen’s d) |
|---|---|---|---|
| Weekly login consistency (coefficient of variation) | 0.28 | 0.67 | 1.84 |
| Assignment submission rate | 0.98 | 0.71 | 1.62 |
| Average submission earliness (days before deadline) | 3.4 | 0.8 | 1.41 |
| Video lecture completion rate | 0.89 | 0.52 | 1.38 |
| Forum posts per week | 1.8 | 0.3 | 1.23 |
| Resource types accessed (count) | 6.2 | 3.1 | 1.19 |
| Peak engagement timing | Distributed | Deadline-concentrated | N/A |
6. Discussion
Comparison with Previous Research
Implications for Teachers
Implications for Online Course Design
7. Implications
Practical Implications
Early Warning Systems for Struggling Students
Improved Online Teaching Strategies
Policy Implications
Institutional Adoption of Learning Analytics
8. Limitations
Data Limited to One Institution
Limited Variables
9. Future Research Directions
Cross-Institution Datasets
AI-Based Adaptive Learning Systems
10. Conclusion
The Importance of Learning Analytics
Key Predictors of Student Performance
Potential Benefits for Online Education
References
- Alalawi, K.; Athauda, R.; Chiong, R. An extended learning analytics framework integrating machine learning and pedagogical approaches for student performance prediction and intervention. International Journal of Artificial Intelligence in Education 2024, 34. [Google Scholar] [CrossRef]
- Alhothali, A.; Albsisi, M.; Assalahi, H.; Aldosemani, T. Predicting student outcomes in online courses using machine learning techniques: A review. Sustainability 2022, 14(10), 6199. [Google Scholar] [CrossRef]
- Anthonysamy, L.; Koo, A. C.; Hew, S. H. Self-regulated learning strategies in higher education: Fostering digital literacy for sustainable lifelong learning. Education and Information Technologies 2020, 25(4), 2393–2414. [Google Scholar] [CrossRef]
- Bashiru, S.; Malgwi, Y. M. A machine learning-based early warning system for identifying at-risk university students in Nigerian higher education. MJSE 2026, 1–15. Available online: http://oer.tsuniversity.edu.ng/index.php/mjse/article/view/1759.
- Broadbent, J.; Poon, W. L. Self-regulated learning strategies in online learning environments: A systematic review. The Internet and Higher Education 2023, 57, 100890. [Google Scholar] [CrossRef]
- Clarin, A. S.; Baluyos, E. L. Challenges encountered in the implementation of online distance learning. EduLine: Journal of Education and Learning Innovation 2022, 2(1), 33–46. [Google Scholar] [CrossRef]
- Hubbard, K.; Amponsah, S. Feature engineering on LMS data to optimize student performance prediction. arXiv 2025, arXiv:2504.02916. [Google Scholar] [CrossRef]
- Jo, I. H.; Park, Y.; Lee, H. Learning analytics in higher education: A review of the literature from 2012 to 2022. Educational Technology Research and Development 2022, 70(3), 987–1015. [Google Scholar] [CrossRef]
- Liu, M.; Yu, D. Towards intelligent E-learning systems. Education and Information Technologies 2023, 28(7), 7845–7876. [Google Scholar] [CrossRef] [PubMed]
- Moreno-Marcos, P. M.; et al. Integration of multiple sources to anticipate student performance using learning analytics. In 2025 13th International Conference on Education Technology and Computers; 2025, IEEE; Available online: https://ieeexplore.ieee.org/document/11194780.
- Qiu, F.; et al. Predicting students’ performance in e-learning using learning process and behaviour data. Scientific Reports 2022, 12(1), 453. [Google Scholar] [CrossRef] [PubMed]
- Stratview Research. E-learning market trend, share & forecast 2022-2026 . 2026. Available online: https://www.stratviewresearch.com/2748/e-learning-market.html.
- Wong, J.; Baars, M.; Davis, D.; Van Der Zee, T.; Houben, G. J.; Paas, F. Supporting self-regulated learning in online learning environments and MOOCs: A systematic review. International Journal of Human-Computer Interaction 2021, 37(4), 309–327. [Google Scholar] [CrossRef]
- Wu, Y. Academic performance in the digital age: Rethinking student success through digital learning ecosystems. In Academic performance - Student success in the transformative digital age; Hermann, J. R., Ed.; IntechOpen, 2026. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).