Submitted:
28 April 2025
Posted:
29 April 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Personalized Learning.
1.2. Evaluating Text Differences Using NLP
1.3. Linguistic Features and Text Readability
2. Experiment 1: LLM-Text Personalization
3. Materials and Methods
3.1. LLM Selection and Implementation Details
3.2. Text Corpus
3.3. Reader Profile
3.4. Natural Language Processing
3.5. Procedure
4. Results
4.1. Main Effect of Reader Profile on Variations in Linguistic Features of Modified Texts.
4.2. Main Effect of LLMs
5. Experiment 1 Discussion
6. Experiment 2: Prompt Refinements
Prompt Improvements
7. Materials and Methods
7.1. LLM Selection
7.2. Text Corpus
7.3. Reader Profile
7.4. Procedure
8. Results
8.1. Academic Writing

8.2. Conceptual Density and Cohesion

8.3. Syntactic and Lexical Complexity


9. Experiment 2 Discussion
10. General Discussion
Limitations & Future Direction
11. Conclusion
Funding
Abbreviations
| PK | Prior knowledge |
| RS | Reading skills |
| GenAI | Generative AI |
| LLM | Large Language Model |
Appendix A. LLM Descriptions
- Version Used: Claude 3.5
- Date Accessed: August 31, 2024
- Accessed via Poe.com web deployment, default configurations were used
- Training Size: Claude is trained on a large-scale, diverse dataset derived from a broad range of online and curated sources. The exact size of the training data remains proprietary.
- Number of Parameters: The exact number of parameters for Claude 3.5 is not disclosed by Anthropic, but it is estimated to be between 70–100 billion parameters.
- Version Used: Llama 3.1
- Date Accessed: August 31, 2024
- Accessed via Poe.com web deployment, default configurations were used
- Llama 3.1 was trained on 2 trillion tokens sourced from publicly available datasets, including books, websites, and other digital content.
- Number of Parameters: Llama 3.1 consists of 70 billion parameters.
- Version Used: Gemini Pro 1.5
- Date Accessed: August 31, 2024
- Accessed via Poe.com web deployment, default configurations were used
- Training Size: Gemini is trained on 1.5 trillion tokens, sourced from a wide variety of publicly available and curated data, including text from books, websites, and other large corpora.
- Number of Parameters: Gemini 1.0 operates with 100 billion parameters.
- Version Used: GPT-4o
- Date Accessed: August 31, 2024
- Accessed via Poe.com web deployment, default configurations were used
- Training Size: GPT-4 was trained on an estimated 1.8 trillion tokens from diverse sources, including books, web pages, academic papers, and large text corpora.
- Number of Parameters: The exact number of parameters for GPT-4 is not publicly disclosed, but in the range of 175 billion parameters.
Appendix B. Single-Shot Prompt Experiment 1
- Analyze the input text and determine its reading level (e.g., Flesch-Kincaid Grade Level), linguistic complexity (e.g., sentence length, vocabulary), and the assumed background knowledge required for comprehension.
- Analyze the reader profile and identify key information: Age, Reading Level (e.g., beginner, intermediate, advanced), Prior Knowledge (specific knowledge related to the text's topic), Reading Goals (e.g., learn new concepts, enjoyment, research, pass an exam), Interests (what topics or themes are motivating for the reader?), Accessibility Needs (specify any learning disabilities or preferences that require text adaptations, dyslexia, visual impairments).
- Reorganize information, modify the syntax, vocabulary, and tone to tailor to the reader's characteristics.
- If the reader has less knowledge about the topic, then provide sufficient background knowledge or relatable examples and analogies to support comprehension and engagement. If the reader has strong background knowledge and high reading skill, then increase depth of information and avoid overly explaining details.
- [Insert Reader 1 Description]
- [Insert Text]
Appendix C. Augmented Prompt Experiment 2

References
- Abbes, F.; Bennani, S.; Maalel, A. Generative AI and gamification for personalized learning: Literature review and future challenges. SN Comput. Sci. 2024, 5, 1–12. [Google Scholar] [CrossRef]
- Martínez, P.; Ramos, A.; Moreno, L. Exploring large language models to generate easy-to-read content. Front. Comput. Sci. 2024, 6, 1394705. [Google Scholar] [CrossRef]
- Major, L.; Francis, G.A.; Tsapali, M. The effectiveness of technology-supported personalized learning in low- and middle-income countries: A meta-analysis. Br. J. Educ. Technol. 2021, 52, 1935–1964. [Google Scholar] [CrossRef]
- McNamara, D.S.; Levinstein, I.B.; Boonthum, C. : iSTART: Interactive strategy training for active reading and thinking. Behav. Res. Methods Instrum. Comput. 2004, 36, 222–233. [Google Scholar] [CrossRef]
- FitzGerald, E.; Jones, A.; Kucirkova, N.; Scanlon, E. : A literature synthesis of personalised technology-enhanced learning: What works and why. Res. Learn. Technol. 2018, 26, 1–13. [Google Scholar] [CrossRef]
- Leong, J.; Pataranutaporn, P.; Danry, V.; Perteneder, F.; Mao, Y.; Maes, P.: Putting things into context: Generative AI-enabled context personalization for vocabulary learning improves learning motivation. In Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024, 1–15.
- McNamara, D.S. : Chasing theory with technology: A quest to understand understanding. Discourse Process. 2021, 58, 422–448. [Google Scholar] [CrossRef]
- Crossley, S.A., Skalicky, S., Dascalu, M., McNamara, D., Kyle, K. Predicting text comprehension, processing, and familiarity in adult readers: New approaches to readability formulas. Discourse Processes 2017, 54(1), 1–20.
- Crossley, S.A., Skalicky, S., McNamara, D.S., Paquot, M. Large-scale crowdsourcing of human judgments of text difficulty: A methodology for collecting reliable data. Behavior Research Methods 2023, 1–20.
- McNamara, D.S. , Louwerse, M.M., McCarthy, P.M., Graesser, A.C. Coh-Metrix: Capturing linguistic features of cohesion. Discourse Processes 2010, 47(4), 292–330. [Google Scholar] [CrossRef]
- McNamara, D.S. Reading both high-coherence and low-coherence texts: Effects of text sequence and prior knowledge. Discourse Processes 2001, 31(1), 63–88. [Google Scholar] [CrossRef]
- Natriello, G. The adaptive learning landscape. Teachers College Record 2017, 119(3), 1–22. [Google Scholar] [CrossRef]
- du Boulay, B., Poulovassilis, A., Holmes, W., Mavrikis, M. What does the research say about how artificial intelligence and big data can close the achievement gap? In R. Luckin (Ed.) Enhancing Learning and Teaching with Technology: What the Research Says 2018, pp. 256–285.
- Kucirkova, N. Personalised learning with digital technologies at home and school: Where is children's agency. In Mobile Technologies in Children's Language and Literacy: Innovative Pedagogy in Preschool and Primary Education; Oakley, G., Ed.; Emerald Publishing Limited: Bingley, UK, 2018; pp. 133–153. [Google Scholar]
- Mesmer, H.A.E. Tools for Matching Readers to Texts: Research-Based Practices, 1st ed.; Guilford Press: New York, NY, USA, 2008; pp. 1–234. [Google Scholar]
- Lennon, C.; Burdick, H. The Lexile Framework as an approach for reading measurement and success. Available online: https://metametricsinc.com/wp-content/uploads/2017/07/Lexile-Framework-for-Reading.pdf (accessed on 22 April 2025).
- Stenner, A.J.; Burdick, H.; Sanford, E.E.; Burdick, D.S. How accurate are Lexile text measures. J. Appl. Meas. 2007, 8, 307–322. [Google Scholar]
- Pesovski, T.; Dorr, M.; Thoms, B. AI-generated personalized learning materials: Improving engagement and comprehension in learning management systems. In Proceedings of the International Conference on Educational Technology, Virtual, Switzerland, 12 March 2024; pp. 112–124.
- Dunn, T.J.; Kennedy, M. Technology-enhanced learning in higher education: Motivators and demotivators of student engagement. Comput. Educ. 2019, 129, 13–22. [Google Scholar]
- Sonia, A.N.; Magliano, J.P.; McCarthy, K.S.; Creer, S.D.; McNamara, D.S.; Allen, L.K. Integration in multiple-document comprehension: A natural language processing approach. Discourse Process. 2022, 59, 417–438. [Google Scholar] [CrossRef]
- Allen, L.K.; Creer, S.C.; Öncel, P. Natural language processing as a tool for learning analytics—Towards a multi-dimensional view of the learning process. In Proceedings of the Artificial Intelligence in Education Conference (AIED 2022), Durham, UK, 27–31 July 2022. [Google Scholar]
- Crossley, S.A.; Allen, L.K.; Snow, E.L.; McNamara, D.S. Incorporating learning characteristics into automatic essay scoring models: What individual differences and linguistic features tell us about writing quality. J. Educ. Data Min. 2016, 8, 1–19. [Google Scholar]
- Crossley, S.A.; Kim, M.; Allen, L.; McNamara, D.S. Automated summarization evaluation (ASE) using natural language processing tools. In Proceedings of the 20th International Conference on Artificial Intelligence in Education (AIED 2019), Chicago, IL, USA, 25–29 June 2019; pp. 84–95. [Google Scholar]
- Allen, L.K.; Graesser, A.C.; McNamara, D.S. Automated analyses of natural language in psychological research. Trends Psychol. Sci. 2023. submitted. [Google Scholar]
- McNamara, D.S.; Graesser, A.C.; McCarthy, P.M.; Cai, Z. Automated Evaluation of Text and Discourse with Coh-Metrix, 1st ed.; Cambridge University Press: Cambridge, UK, 2014; pp. 1–312. [Google Scholar]
- O’Reilly, T.; McNamara, D.S. Reversing the reverse cohesion effect: Good texts can be better for strategic, high-knowledge readers. Discourse Process. 2007, 43, 121–152. [Google Scholar] [CrossRef]
- Nagy, W.; Townsend, D. Words as tools: Learning academic vocabulary as language acquisition. Read. Res. Q. 2012, 47, 91–108. [Google Scholar] [CrossRef]
- Snow, C.E. Academic language and the challenge of reading for learning about science. Science 2010, 328, 450–452. [Google Scholar] [CrossRef]
- Frantz, R.S.; Starr, L.E.; Bailey, A.L. Syntactic complexity as an aspect of text complexity. Educ. Res. 2015, 44, 387–393. [Google Scholar] [CrossRef]
- Magliano, J.P.; Millis, K.K. Assessing reading skill with a think-aloud procedure. Cogn. Instr. 2004, 21, 251–283. [Google Scholar] [CrossRef]
- Arner, T.; McCarthy, K.S.; McNamara, D.S. iSTART StairStepper—Using comprehension strategy training to game the test. Computers 2021, 10, 48. [Google Scholar] [CrossRef]
- Perret, C.A.; Johnson, A.M.; McCarthy, K.S.; Guerrero, T.A.; McNamara, D.S. StairStepper: An adaptive remedial iSTART module. In Proceedings of the 18th International Conference on Artificial Intelligence in Education (AIED 2017), Wuhan, China, 28 June–2 July 2017; pp. 557–560. [Google Scholar]
- Li, T.; Creer, S.D.; Arner, T.; Roscoe, R.D.; Allen, L.K.; McNamara, D.S. Participatory design of a writing analytics tool: Teachers’ needs and design solutions. In Companion Proceedings of the 12th International Conference on Learning Analytics & Knowledge (LAK22).
- Öncel, P.; Flynn, L.E.; Sonia, A.N.; Barker, K.E.; Lindsay, G.C.; McClure, C.M.; McNamara, D.S.; Allen, L.K. Automatic student writing evaluation: Investigating the impact of individual differences on source-based writing. In Proceedings of the 11th International Learning Analytics and Knowledge Conference (LAK21), CA, USA, 2021, ACM: Irvine; pp. 260–270.
- Potter, A.; Shortt, M.; Goldshtein, M.; Roscoe, R.D. Assessing academic language in tenth-grade essays using natural language processing. Assess. Writ. in press, pp. 1–13.
- Beck, I.L.; McKeown, M.G.; Sinatra, G.M.; Loxterman, J.A. Revising social studies text from a text-processing perspective: Evidence of improved comprehensibility. Read. Res. Q. 1991, 26, 251–276. [Google Scholar] [CrossRef]
- Ozuru, Y.; Dempsey, K.; McNamara, D.S. Prior knowledge, reading skill, and text cohesion in the comprehension of science texts. Learn. Instr. 2009, 19, 228–242. [Google Scholar] [CrossRef]
- Brown, T.; Mann, B.; Ryder, N.; Subbiah, M.; Kaplan, J.; Dhariwal, P.; Neelakantan, A.; Shyam, P.; Sastry, G.; Askell, A.; et al. Language models are few-shot learners. Adv. Neural Inf. Process. Syst. 2020, 33, 1877–1901. [Google Scholar]
- Srivastava, A.; Rastogi, A.; Rao, A.; Shoeb, A.A.M.; Abid, A.; Fisch, A.; Wang, G. Beyond the imitation game: Quantifying and extrapolating the capabilities of language models. arXiv 2022, arXiv:2206.04615. [Google Scholar]
- Sahoo, P.; Singh, A.K.; Saha, S.; Jain, V.; Mondal, S.; Chadha, A. A systematic survey of prompt engineering in large language models: Techniques and applications. arXiv 2024, arXiv:2402.07927. [Google Scholar]
- Bisk, Y.; Zellers, R.; Bras, R.L.; Gao, J.; Choi, Y. Experience grounds language. arXiv 2020, arXiv:2004.10151. [Google Scholar]
- Wei, J.; Wang, X.; Schuurmans, D.; Bosma, M.; Xia, F.; Chi, E.; Zhou, D. Chain-of-thought prompting elicits reasoning in large language models. Adv. Neural Inf. Process. Syst. 2022, 35, 24824–24837. [Google Scholar]
- Unsworth, N.; Engle, R.W. The nature of individual differences in working memory capacity: Active maintenance in primary memory and controlled search from secondary memory. Psychol. Rev. 2007, 114, 104–132. [Google Scholar] [CrossRef]
- Eccles, J.S.; Wigfield, A. Motivational beliefs, values, and goals. Annu. Rev. Psychol. 2002, 53, 109–132. [Google Scholar] [CrossRef]
- Fletcher, J.M.; Lyon, G.R.; Fuchs, L.S.; Barnes, M.A. Learning Disabilities: From Identification to Intervention, 2nd ed.; Guilford Press: New York, NY, USA, 2018; pp. 1–350. [Google Scholar]
- Ladson-Billings, G. Culturally relevant pedagogy 2.0: Aka the remix. Harv. Educ. Rev. 2014, 84, 74–84. [Google Scholar] [CrossRef]
- Radford, A.; Wu, J.; Child, R.; Luan, D.; Amodei, D.; Sutskever, I.; Language models are unsupervised multitask learners. OpenAI Blog 2019. Available online: https://openai.com/blog/better-language-models (accessed on 22 April 2025).
- Dornburg, A.; Davin, K. To what extent is ChatGPT useful for language teacher lesson plan creation? arXiv 2024, arXiv:2407.09974. [Google Scholar]
| Features | Metrics and Descriptions |
|---|---|
| Overall Readability | Flesch-Kincaid Grade Level (FKGL): Indicates text difficulty based on sentence length and word length Academic writing: The extent to which the texts include domain-specific words and sophisticated sentence structures, commonly found in academic writing texts Development of ideas: The extent to which ideas and concepts are developed and elaborated throughout a text. |
| Conceptual Density and Cohesion | Noun to verb ratio: Text with a high noun-to-verb ratio results in dense information and complex sentences that require greater cognitive effort to process Sentence cohesion: The extent to which the text contains connectives, and cohesion cues (e.g., repeating ideas, concepts). |
| Syntax Complexity | Sentence length: Longer sentences often have more clauses and complex structure Language variety: Indicates the extent to which text varies in the language used (sentence structures, wordings) |
| Lexical Complexity | Sophisticated wording: Lower measures indicate the vocabulary familiar and common, whereas higher measures indicate more advanced words. Academic frequency: Indicates the extent of sophisticated vocabulary are used, which are also common in academic texts |
| Domain | Text Title | Word Count | FKGL1 | |
|---|---|---|---|---|
| Biology | Bacteria | 468 | 12.10 | |
| Biology | The Cells | 426 | 11.61 | |
| Chemistry | Chemistry of Life | 436 | 12.71 | |
| Biology | Genetic Equilibrium | 441 | 12.61 | |
| Biology | Food Webs | 492 | 12.06 | |
| Biology | Patterns of evolution | 341 | 15.09 | |
| Biology | Causes and Effects of Mutations | 318 | 11.35 | |
| Physics | What are Gravitational Waves? | 359 | 16.51 | |
| Biochemistry | Photosynthesis | 427 | 11.44 | |
| Biology | Microbes | 407 | 14.38 | |
![]() |
![]() |
![]() |
![]() |
| Descriptions | |
|---|---|
| Reader 1 (High RS, High PK) | Age: 25 Educational level: Senior Major: Chemistry (Pre-med) ACT English composite score: 32/36 (performance is in the 96th percentile) ACT Reading composite score: 32/36 (performance is in the 96th percentile) ACT Math composite score: 28/36 (performance is in the 89th percentile) ACT Science composite score: 30/36 (performance is in the 94th percentile) Science background: Completed 8 required biology, physics and chemistry college-level courses (comprehensive academic background in the sciences, covering advanced topics in biology, chemistry, and physics, well-prepared for higher-level scientific learning and analysis) Reading goal: Understand scientific concepts and principles |
| Reader 2 (High RS, Low PK) | Age: 20 Educational level: Sophomore Major: Psychology ACT English composite score: 32/36 (performance is in the 96th percentile) ACT Reading composite score: 31/36 (performance is in the 94th percentile) ACT Math composite score: 18/36 (performance is in the 42th percentile) ACT Science composite score: 19/36 (performance is in the 46th percentile) Science background: Completed 1 high-school level chemistry course (no advanced science course). Limited exposure and understanding of scientific concepts Interests/ Favorite subjects: arts, literature Reading goal: Understand scientific concepts and principles |
| Reader 3 (Low RS, High PK) | Age: 20 Educational level: Sophomore Major: Health Science ACT English composite score: 19/36 (performance is in the 44th percentile) ACT Reading composite score: 20/36 (performance is in the 47th percentile) ACT Math composite score: 32/36 (performance is in the 97th percentile) ACT Science composite score: 30/36 (performance is in the 94th percentile) Science background: Completed 1 physics, 1 astronomy and 2 college level biology courses (substantial prior knowledge in science, having completed multiple college-level courses across several disciplines, strong foundation in scientific principles and concepts) Reading goal: Understand scientific concepts Reading disability: Dyslexia |
| Reader 4 (Low RS, Low PK) | Age: 18 Educational level: Freshman Major: Marketing ACT English composite score: 17/36 (performance is in the 33rd percentile) ACT Reading composite score: 18/36 (performance is in the 36th percentile) ACT Math composite score: 19/36 (performance is in the 48th percentile) ACT Science composite score: 17/36 (performance is in the 34th percentile) Science background: Completed 1 high-school level biology course (no advanced science course). Limited exposure and understanding of scientific concepts Reading goal: Understand scientific concepts |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).



