Submitted:
24 May 2026
Posted:
25 May 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Why LLMs Change the Problem: Fluency, Delegation, and the Governance of Judgment
2.1. Fluency as Epistemic Risk
2.2. Delegation Without Visibility
2.3. From Outputs to Consequences
2.4. Why Stewardship Becomes Unavoidable
3. Generative AI and Learning Analytics
3.1. Areas of Robust Technical Performance
3.2. Limited Pedagogical and Institutional Effects
3.3. Inflation of Interpretive and Pedagogical Claims
3.4. Implications for the Present Argument
3.5. Generative AI and the Transformation of Data Science Work
4. Stewardship as a Paradigm for Educational Data Science
4.1. Stewardship as the Governance of Judgment
4.2. Core Commitments of a Stewardship Paradigm
4.3. Operationalizing Stewardship
4.4. Implications for Design, Practice and the Field
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| EDM | Educational Data Mining |
| AIED | Artificial Intelligence in Education |
| GenAI | Generative AI |
| LLM | Large Language Model |
| LA | Learning Analytics |
References
- Design Recommendations for Intelligent Tutoring Systems. In - Intelligent Tutoring Systems with Generative AI; Sinatra, A.M., Rus, V., Lawton, P., Graesser, A.C., Eds.; US Army Combat Capabilities Development Command - Soldier Center: Orlando, FL, USA, 2025; Volume 12. [Google Scholar]
- Skinner, B.F. Teaching machines. Science 1958, 128, 969–977. [Google Scholar] [CrossRef]
- Khosravi, H.; Viberg, O.; Kovanović, V.; Ferguson, R. Generative AI and learning analytics. J. Learn. Anal. 2023, 10, 1–6. [Google Scholar] [CrossRef]
- Misiejuk, K.; López-Pernas, S.; Kaliisa, R.; Saqr, M. Mapping the landscape of generative artificial intelligence in learning analytics: A systematic literature review. J. Learn. Anal. 2025, 12, 12–31. [Google Scholar] [CrossRef]
- Yan, L.; Martinez-Maldonado, R.; Gašević, D. Generative Artificial Intelligence in Learning Analytics: Contextualising Opportunities and Challenges through the Learning Analytics Cycle. In Proceedings of the 14th Learning Analytics and Knowledge Conference, 2024; pp. 101–111. [Google Scholar] [CrossRef]
- Baker, R.S.; Inventado, P.S. Educational data mining and learning analytics. In Learning Analytics: From Research to Practice; Larusson, J.A., White, B., Eds.; Springer: New York, NY, USA, 2014; pp. 61–75. [Google Scholar] [CrossRef]
- Baker, R.S.; Siemens, G. Learning analytics and educational data mining. In The Cambridge Handbook of the Learning Sciences, 3rd ed.; Sawyer, R.K., Ed.; Cambridge University Press: Cambridge, UK, 2022; pp. 259–278. [Google Scholar] [CrossRef]
- Greller, W.; Drachsler, H. Translating learning into numbers: A generic framework for learning analytics. Educ. Technol. Soc. 2012, 15, 42–57. [Google Scholar]
- Clow, D. The learning analytics cycle: Closing the loop effectively. In Proceedings of the 2nd International Conference on Learning Analytics and Knowledge; ACM: New York, NY, USA, 2012; pp. 134–138. [Google Scholar] [CrossRef]
- Wise, A.F. Designing pedagogical interventions to support student use of learning analytics. In Proceedings of the 4th International Conference on Learning Analytics and Knowledge; ACM: New York, NY, USA, 2014; pp. 203–211. [Google Scholar]
- Azad, A.K.M.; Goodell, J.; Kessler, A.; Craig, S.D.; Saliah-Hassane, H. Learning Engineering - A System Design Approach for Engineering Education. In In Proceedings of the 2025 ASEE Annual Conference & Exposition, Montreal, QC, Canada, June 2025. [Google Scholar] [CrossRef]
- Baker, R.S.; Boser, U.; Snow, E. Learning engineering: A view on where the field is at, where it is going, and the research needed. Technol. Mind Behav. 2022, 3. [Google Scholar] [CrossRef]
- Learning Engineering Toolkit: Evidence-Based Practices from the Learning Sciences, Instructional Design, and Beyond; Goodell, J., Kolodner, J.L., Eds.; Routledge: London, UK, 2023. [Google Scholar] [CrossRef]
- Pargman, T.C.; McGrath, C.; Viberg, O.; Knight, S. New vistas on responsible learning analytics: A data feminist perspective. J. Learn. Anal. 2023, 10, 133–148. [Google Scholar] [CrossRef]
- Prinsloo, P.; Slade, S. An elephant in the learning analytics room: The obligation to act. In Proceedings of the Seventh International Learning Analytics & Knowledge Conference, 2017; pp. 46–55. [Google Scholar] [CrossRef]
- Amershi, S.; Weld, D.; Vorvoreanu, M.; Fourney, A.; Nushi, B.; Collisson, P.; et al. Guidelines for human-AI interaction. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, 2019. [Google Scholar] [CrossRef]
- Vasconcelos, H.; Jörke, M.; Grunde-McLaughlin, M.; Gerstenberg, T.; Bernstein, M.S.; Krishna, R. Explanations can reduce overreliance on AI systems during decision-making. Proc. ACM Hum.-Comput. Interact. 2023, 7, 1–38. [Google Scholar] [CrossRef]
- Kaliisa, R.; Misiejuk, K.; López-Pernas, S.; Khalil, M.; Saqr, M. Have learning analytics dashboards lived up to the hype? A systematic review of 38 empirical studies. In Proceedings of the 14th Learning Analytics and Knowledge Conference, 2024; pp. 716–726. [Google Scholar] [CrossRef]
- Lekan, K.; Pardos, Z.A. AI-augmented advising: A comparative study of GPT-4 and advisor-based major recommendations. J. Learn. Anal. 2025, 12, 110–128. [Google Scholar] [CrossRef]
- Yan, L.; Zhao, L.; Echeverria, V.; Jin, Y.; Alfredo, R.; Li, X.; et al. VizChat: Enhancing learning analytics dashboards with contextualised explanations using multimodal generative AI chatbots. In Artificial Intelligence in Education; Springer Nature Switzerland: Cham, Switzerland, 2024; pp. 180–193. [Google Scholar] [CrossRef]
- Madsen, A.; Chandar, S.; Reddy, S. Are self-explanations from large language models faithful? Findings of the Association for Computational Linguistics: ACL 2024. 2024, pp. 295–337. Available online: https://aclanthology.org/2024.findings-acl.19/.
- Parcalabescu, L.; Frank, A. On measuring faithfulness or self-consistency of natural language explanations. Proc. 62nd Annu. Meet. Assoc. Comput. Linguist. 2024, Volume 1, 6048–6089. [Google Scholar] [CrossRef]
- Khosravi, H.; Shibani, A.; Jovanovic, J.; Pardos, Z.A.; Yan, L. Generative AI and learning analytics: Pushing boundaries, preserving principles. J. Learn. Anal. 2025, 12, 1–11. [Google Scholar] [CrossRef]
- Long, Y.; Luo, H.; Zhang, Y. Evaluating large language models in analysing classroom dialogue. npj Sci. Learn. 2024, 9, 60. [Google Scholar] [CrossRef] [PubMed]
- Ochoa, X.; Huang, X.; Shao, Y. Exploring the potential of generative AI to support non-experts in learning analytics practice. J. Learn. Anal. 2025, 12, 65–90. [Google Scholar] [CrossRef]
- Bender, E.M.; Koller, A. Climbing towards NLU: On meaning, form, and understanding in the age of data. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020; pp. 5185–5198. Available online: https://aclanthology.org/2020.acl-main.463/.
- Bender, E.M.; Gebru, T.; McMillan-Major, A.; Shmitchell, S. On the dangers of stochastic parrots: Can language models be too big? In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, 2021; pp. 610–623. [Google Scholar] [CrossRef]
- Si, C.; Goyal, N.; Wu, T.; Zhao, C.; Feng, S.; Daumé, H., III; Boyd-Graber, J. Large Language Models Help Humans Verify Truthfulness - Except When They Are Convincingly Wrong. Proc. NAACL 2024, 2024, 1459–1474. [Google Scholar] [CrossRef]
- Zhang, T.; Zhang, M.; Low, W.Y.; Yang, X.J.; Li, B.A. Conversational explanations: Discussing explainable AI with non-AI experts. In Proceedings of the 30th International Conference on Intelligent User Interfaces, 2025; pp. 409–424. [Google Scholar] [CrossRef]
- Schoeffer, J.; De-Arteaga, M.; Kuehl, N. Explanations, fairness, and appropriate reliance in human-AI decision-making. In Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems, 2024. [Google Scholar] [CrossRef]
- Buçinca, Z.; Malaya, M.B.; Gajos, K.Z. To trust or to think: Cognitive forcing functions can reduce overreliance on AI in AI-assisted decision-making. Proc. ACM Hum.-Comput. Interact. 2021, 5, 1–21. [Google Scholar] [CrossRef]
- Klingbeil, A.; Grützner, C.; Schreck, P. Trust and reliance on AI - An experimental study on the extent and costs of overreliance on AI. Comput. Hum. Behav. 2024, 160, 108352. [Google Scholar] [CrossRef]
- Salvi, F.; Horta Ribeiro, M.; Gallotti, R.; West, R. On the conversational persuasiveness of GPT-4. Nat. Hum. Behav. 2025, 9, 1645–1653. [Google Scholar] [CrossRef] [PubMed]
- Liu, X.; Zambrano, A.F.; Baker, R.S.; Barany, A.; Ocumpaugh, J.; Zhang, J.; Pankiewicz, M.; Nasiar, N.; Wei, Z. Qualitative coding with GPT-4: Where it works better. J. Learn. Anal. 2025, 12, 169–185. [Google Scholar] [CrossRef]
- Chaleshtori, F.H.; Ghosal, A.; Gill, A.; Bambroo, P.; Marasović, A. On evaluating explanation utility for human-AI decision making in NLP. Find. Assoc. Comput. Linguist. EMNLP 2024, 7456–7504. [Google Scholar] [CrossRef]
- Holstein, K.; McLaren, B.M.; Aleven, V. Co-designing a real-time classroom orchestration tool to support teacher-AI complementarity. J. Learn. Anal. 2019, 6, 27–52. [Google Scholar] [CrossRef]
- Kasepalu, R.; Prieto, L.P.; Ley, T.; Chejara, P. Teacher artificial intelligence-supported pedagogical actions in collaborative learning coregulation: A wizard-of-oz study. Front. Educ. 2022, 7, 736194. [Google Scholar] [CrossRef]
- Olsen, J.K.; Rummel, N.; Aleven, V. Designing for the co-orchestration of social transitions between individual, small-group and whole-class learning in the classroom. Int. J. Artif. Intell. Educ. 2021, 31, 24–56. [Google Scholar] [CrossRef]
- Whitehead, R.; Nguyen, A.; Järvelä, S. Utilizing multimodal large language models for video analysis of posture in studying collaborative learning: A case study. J. Learn. Anal. 2025, 12, 186–200. [Google Scholar] [CrossRef]
- Sellberg, C.; Sharma, A. Toward multimodal learning analytics in simulation-based collaborative learning: A design ethnography of maritime training. Int. J. Comput.-Support. Collab. Learn. 2025, 20, 201–221. [Google Scholar] [CrossRef]
- Zhou, Q.; Suraworachet, W.; Cukurova, M. Detecting non-verbal speech and gaze behaviours with multimodal data and computer vision to interpret effective collaborative learning interactions. Educ. Inf. Technol. 2024, 29, 1071–1098. [Google Scholar] [CrossRef]
- Kizilcec, R.F. To advance AI use in education, focus on understanding educators. Int. J. Artif. Intell. Educ. 2024, 34, 12–19. [Google Scholar] [CrossRef] [PubMed]
- Lai, V.; Zhang, Y.; Chen, C.; Liao, Q.V.; Tan, C. Selective explanations: Leveraging human input to align explainable AI. Proc. ACM Hum.-Comput. Interact. 2023, 7, 1–35. [Google Scholar] [CrossRef]
- Baker, R.S.; Hawn, A. Algorithmic bias in education. Int. J. Artif. Intell. Educ. 2022, 32, 1052–1092. [Google Scholar] [CrossRef]
- Green, B.; Chen, Y. The principles and limits of algorithm-in-the-loop decision making. Proc. ACM Hum.-Comput. Interact. 2019, 3, 1–24. [Google Scholar] [CrossRef]
- Selbst, A.D.; Boyd, D.; Friedler, S.A.; Venkatasubramanian, S.; Vertesi, J. Fairness and abstraction in sociotechnical systems. In Proceedings of the Conference on Fairness, Accountability, and Transparency, 2019. [Google Scholar] [CrossRef]
- de la Iglesia, D.H.; Thomas, M.B.; Fuentes, C. AI-assisted qualitative analysis at scale: Opportunities and constraints for text-rich research. In Quality & Quantity; 2025. [Google Scholar] [CrossRef]
- Morris, W.; Holmes, L.; Choi, J.S.; Crossley, S. Automated scoring of constructed response items in math assessment using large language models. Int. J. Artif. Intell. Educ. 2025, 35, 559–586. [Google Scholar] [CrossRef]
- Acosta, H.; Lee, S.; Bae, H.; Feng, C.; Rowe, J.; Glazewski, K.; et al. Recognizing multi-party epistemic dialogue acts during collaborative game-based learning using large language models. Int. J. Artif. Intell. Educ. 2025, 35, 677–701. [Google Scholar] [CrossRef]
- Guerrero-Sosa, J.D.; Romero, F.P.; Menéndez-Domínguez, V.H.; Serrano-Guerrero, J.; Montoro-Montarroso, A.; Olivas, J.A. A comprehensive review of multimodal analysis in education. Appl. Sci. 2025, 15, 5896. [Google Scholar] [CrossRef]
- Schneider, B.; Worsley, M.; Martinez-Maldonado, R. Gesture and gaze: Multimodal data in dyadic interactions. In International Handbook of Computer-Supported Collaborative Learning; Springer International Publishing: Cham, Switzerland, 2021; pp. 625–641. [Google Scholar] [CrossRef]
- Alfredo, R.; Echeverria, V.; Jin, Y.; Yan, L.; Swiecki, Z.; Gašević, D.; Martinez-Maldonado, R. Human-centred learning analytics and AI in education: A systematic literature review. Comput. Educ. Artif. Intell. 2024, 6, 100215. [Google Scholar] [CrossRef]
- Lee, H.Y.; Chen, P.H.; Wang, W.S.; Huang, Y.M.; Wu, T.T. Empowering ChatGPT with guidance mechanism in blended learning: Effect of self-regulated learning, higher-order thinking skills, and knowledge construction. Int. J. Educ. Technol. High. Educ. 2024, 21, 16. [Google Scholar] [CrossRef]
- Wang, J.; Fan, W. The effect of ChatGPT on students’ learning performance, learning perception, and higher-order thinking: Insights from a meta-analysis. Humanit. Soc. Sci. Commun. 2025, 12, 1–21. [Google Scholar] [CrossRef]
- Curran, N.; de Leeuw, S.; Malyuga, E. AI and native speakerism: The intersections of technology, language assessment, and linguistic objectivity. Lang. Assess. Q. 2025. [Google Scholar] [CrossRef]
- Chen, G.H.; Chen, S.; Liu, Z.; Jiang, F.; Wang, B. Humans or LLMs as the judge? A study on judgement bias. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024; pp. 8301–8327. Available online: https://aclanthology.org/2024.emnlp-main.474/.
- Gu, J.; Chen, H.; Feng, Y.; Chen, J.; Li, M.; Wu, Y. A survey on LLM-as-a-judge. arXiv 2024, arXiv:2411.15594. [Google Scholar] [CrossRef]
- Tan, S.; Zhuang, S.; Montgomery, K.; Tang, W.Y.; Cuadron, A.; Wang, C.; et al. Judgebench: A benchmark for evaluating LLM-based judges. arXiv 2024, arXiv:2410.12784. [Google Scholar] [CrossRef]
- Wataoka, K.; Takahashi, T.; Ri, R. Self-preference bias in LLM-as-a-judge. arXiv 2024, arXiv:2410.21819. [Google Scholar] [CrossRef]
- Sheng, Y.; Wang, C.; Chen, X. Effect of GenAI dependency on university students’ academic achievement: False self-efficacy and the moderating role of perceived teacher caring. Behav. Sci. 2025, 15, 1348. [Google Scholar] [CrossRef]
- Fitsilis, P.; Damasiotis, V.; Dervenis, C.; Kyriatzis, V.; Tsoutsa, P. Effective data stewardship in higher education: Skills, competences, and the emerging role of open data stewards. arXiv 2024, arXiv:2410.20361. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).