Submitted:
13 April 2024
Posted:
16 April 2024
Read the latest preprint version here
Abstract
Keywords:
Introduction
Literature Review
Types of Psycholinguistic Stimuli
The Challenge of Designing Psycholinguistic Stimuli
AI and Psycholinguistic Stimuli Development
The Acceptability of AI Linguistic Production
The Validity of AI Linguistic Production
The Present Study
Experiment 1: Anticipating L2 Semantic Information
Methods
Procedure and Materials
AI-Speech Acceptability Task
Participants
Materials
Procedures
Statistical Analysis
AI-Informed VWP Experiment
Participants
Materials
Procedures
Statistical Analysis
Results
AI-Speech Acceptability Task
AI-Informed VWP Experiment
Comprehension Task
Statistical Modeling
Discussion
Experiment 2: Anticipating L1 and L2 Grammatical Number Information
Methods
Procedure and Materials
AI-Speech Acceptability Task
Participants
Materials
Procedures
Statistical Analysis
AI-Informed VWP Experiment
Participants
Materials
Procedures
Statistical Analysis
Results
AI-Speech Acceptability Task
AI-Informed VWP Experiment
Comprehension Task
Statistical Modeling
Discussion
Experiment 3: Priming Coordination in Comprehension
Procedure and Materials
AI-Stimuli Acceptability Task
Participants
Materials
Procedures
Statistical Analysis
AI-Informed SPR Task
Participants
Materials
Procedures
Statistical Analysis
Results
AI-Stimuli Acceptability Task
AI-Informed SPR Task
Comprehension Task
Statistical Modeling
Discussion
General Discussion
Acceptability of AI-Produced Stimuli
Validity of AI-Produced Stimuli
Implications for Psycholinguistic Research
Limitations and Future Directions
Conclusions
Statement Concerning Research Involving Human Participants
Data Availability Statement
Conflict of Interest
References
- Altmann, G.T.; Kamide, Y. Incremental interpretation at verbs: restricting the domain of subsequent reference. Cognition 1999, 73, 247–264. [Google Scholar] [CrossRef] [PubMed]
- Anthony, L. (2023). Corpus AI: Integrating Large Language Models (LLMs) into a Corpus Analysis Toolkit. The 49th Annual Convention of the Japan Association for English Corpus Studies (JAECS).
- Anwyl-Irvine, A.L.; Massonnié, J.; Flitton, A.; Kirkham, N.; Evershed, J.K. Gorilla in our midst: An online behavioral experiment builder. Behav. Res. Methods 2019, 52, 388–407. [Google Scholar] [CrossRef] [PubMed]
- Arik, S. O. , Chen, J., Peng, K., Ping, W., & Zhou, Y. (2018). Neural Voice Cloning with a Few Samples.
- Bae, H. (2024). Chatgpt as a Research Assistant in Experimental Linguistics. SSRN. [CrossRef]
- Barca, L.; Burani, C.; Arduino, L.S. Word naming times and psycholinguistic norms for Italian nouns. Behav. Res. Methods, Instruments, Comput. 2002, 34, 424–434. [Google Scholar] [CrossRef] [PubMed]
- Barr, D.J.; Levy, R.; Scheepers, C.; Tily, H.J. Random effects structure for confirmatory hypothesis testing: Keep it maximal. J. Mem. Lang. 2013, 68, 255–278. [Google Scholar] [CrossRef] [PubMed]
- Bates, D.; Mächler, M.; Bolker, B.; Walker, S. Fitting Linear Mixed-Effects Models Using lme4. J. Stat. Softw. 2015, 67, 48. [Google Scholar] [CrossRef]
- Blasi, D.E.; Henrich, J.; Adamou, E.; Kemmerer, D.; Majid, A. Over-reliance on English hinders cognitive science. Trends Cogn. Sci. 2022, 26, 1153–1170. [Google Scholar] [CrossRef] [PubMed]
- Bosch, J. E. , Chailleux, M., Yee, J. E., Guasti, M. T., Arosio, F., & Ayoun, D. (2022). Prediction on the basis of gender and number in Mandarin-Italian bilingual children. In D. Ayoun (Ed.), Studies in Bilingualism (pp. 243–271). John Benjamins.
- Brothers, T.; Swaab, T.Y.; Traxler, M.J. Goals and strategies influence lexical prediction during sentence comprehension. J. Mem. Lang. 2017, 93, 203–216. [Google Scholar] [CrossRef]
- Bürkner, P.-C. brms: An R Package for Bayesian Multilevel Models Using Stan. J. Stat. Softw. 2017, 80, 1–28. [Google Scholar] [CrossRef]
- Carroll, R. , Patarroyo, A., & Hopp, H. (2024). Predictive L2 sentence processing in noise: Differential effects across linguistic cues. 15th Speech in Noise Workshop. [CrossRef]
- Casal, J.E.; Kessler, M. Can linguists distinguish between ChatGPT/AI and human writing?: A study of research ethics and academic publishing. Res. Methods Appl. Linguistics 2023, 2. [Google Scholar] [CrossRef]
- Chang, Y.-N.; Hsu, C.-H.; Tsai, J.-L.; Chen, C.-L.; Lee, C.-Y. A psycholinguistic database for traditional Chinese character naming. Behav. Res. Methods 2015, 48, 112–122. [Google Scholar] [CrossRef]
- Cheng, X.; Schafer, G.; Riddell, P.M. Immediate Auditory Repetition of Words and Nonwords: An ERP Study of Lexical and Sublexical Processing. PLOS ONE 2014, 9, e91988. [Google Scholar] [CrossRef] [PubMed]
- Chun, E. (2018). The role of prediction in adaptation: An evaluation of error-based learning accounts. [Unpublished doctoral dissertation, University of Florida].
- Chun, E. , Chen, S., Liu, S., & Chan, A. (2021). Influence of syntactic complexity on second language prediction. In E. Kaan & T. Grüter (Eds.), Prediction in Second Language Processing and Learning (pp. 70–89). John Benjamins. [CrossRef]
- Clark, M. (2024). lazerhawk: Miscellaneous functions mostly inspired by synthwave (Version 0.3.0). [R package]. https://github.com/m-clark/lazerhawk.
- Contemori, C. Changing comprehenders’ pronoun interpretations: Immediate and cumulative priming at the discourse level in L2 and native speakers of English. Second. Lang. Res. 2019, 37, 573–586. [Google Scholar] [CrossRef]
- Corps, R.E.; Brooke, C.; Pickering, M.J. Prediction involves two stages: Evidence from visual-world eye-tracking. J. Mem. Lang. 2022, 122. [Google Scholar] [CrossRef]
- Curry, N.; Baker, P.; Brookes, G. Generative AI for corpus approaches to discourse studies: A critical evaluation of ChatGPT. Appl. Corpus Linguistics 2024, 4. [Google Scholar] [CrossRef]
- Dubey, A. , Sturt, P., & Keller, F. (2005). Parallelism in coordination as an instance of syntactic priming: Evidence from corpus-based modeling. Proceedings of the Human Language Technology Conference Conference on Empirical Methods in Natural Language, 827–834.
- Dwivedi, Y.K.; Kshetri, N.; Hughes, L.; Slade, E.L.; Jeyaraj, A.; Kar, A.K.; Baabdullah, A.M.; Koohang, A.; Raghavan, V.; Ahuja, M.; et al. Opinion Paper: “So what if ChatGPT wrote it?” Multidisciplinary perspectives on opportunities, challenges and implications of generative conversational AI for research, practice and policy. Int. J. Inf. Manag. 2023, 71. [Google Scholar] [CrossRef]
- Flett, S.; Branigan, H.P.; Pickering, M.J. Are non-native structural preferences affected by native language preferences? Biling. Lang. Cogn. 2012, 16, 751–760. [Google Scholar] [CrossRef]
- Gao, C.A.; Howard, F.M.; Markov, N.S.; Dyer, E.C.; Ramesh, S.; Luo, Y.; Pearson, A.T. Comparing scientific abstracts generated by ChatGPT to real abstracts with detectors and blinded human reviewers. npj Digit. Med. 2023, 6, 1–5. [Google Scholar] [CrossRef]
- Gao, C.; Shinkareva, S.V.; Desai, R.H. SCOPE: The South Carolina psycholinguistic metabase. Behav. Res. Methods 2022, 55, 2853–2884. [Google Scholar] [CrossRef]
- Rodriguez, G.G.; Norcliffe, E.; Brown, P.; Huettig, F.; Levinson, S.C. Anticipatory Processing in a Verb-Initial Mayan Language: Eye-Tracking Evidence During Sentence Comprehension in Tseltal. Cogn. Sci. 2023, 47, e13292. [Google Scholar] [CrossRef] [PubMed]
- Goldberg, Y. (2019). Assessing BERT’s syntactic abilities. ArXiv. https://doi.org/1901.05287.
- Henrich, J. , Heine, S. ( 466(7302), 26–26.
- Herrmann, B. The perception of artificial-intelligence (AI) based synthesized speech in younger and older adults. Int. J. Speech Technol. 2023, 26, 395–415. [Google Scholar] [CrossRef]
- Heyman, T.; Heyman, G. The impact of ChatGPT on human data collection: A case study involving typicality norming data. Behav. Res. Methods 2023, 1–8. [Google Scholar] [CrossRef] [PubMed]
- Hu, J.; Levy, R. Prompting is not a substitute for probability measurements in large language models. ArXiv. [CrossRef]
- Huettig, F.; Guerra, E. Effects of speech rate, preview time of visual context, and participant instructions reveal strong limits on prediction in language processing. Brain Res. 2018, 1706, 196–208. [Google Scholar] [CrossRef] [PubMed]
- Huettig, F.; Rommers, J.; Meyer, A.S. Using the visual world paradigm to study language processing: A review and critical evaluation. Acta Psychol. 2011, 137, 151–171. [Google Scholar] [CrossRef] [PubMed]
- Ito, A.; Corley, M.; Pickering, M.J. A cognitive load delays predictive eye movements similarly during L1 and L2 comprehension. Biling. Lang. Cogn. 2017, 21, 251–264. [Google Scholar] [CrossRef]
- Ito, A.; Knoeferle, P. Analysing data from the psycholinguistic visual-world paradigm: Comparison of different analysis methods. Behav. Res. Methods 2022, 55, 3461–3493. [Google Scholar] [CrossRef] [PubMed]
- Ito, A.; Nguyen, H.T.T.; Knoeferle, P. German-dominant Vietnamese heritage speakers use semantic constraints of German for anticipation during comprehension in Vietnamese. Biling. Lang. Cogn. 2023, 1–18. [Google Scholar] [CrossRef]
- Jegerski, J.; VanPatten, B. Research Methods in Second Language Psycholinguistics; Routledge.
- Karaca, F.; Brouwer, S.; Unsworth, S.; Huettig, F. Morphosyntactic predictive processing in adult heritage speakers: effects of cue availability and spoken and written language experience. Lang. Cogn. Neurosci. 2023, 39, 118–135. [Google Scholar] [CrossRef]
- Koch, E.M.; Bulté, B.; Housen, A.; Godfroid, A. Using verb morphology to predict subject number in L1 and L2 sentence processing: A visual-world eye-tracking experiment. J. Eur. Second. Lang. Assoc. 2021, 5, 115–132. [Google Scholar] [CrossRef]
- Koch, E.M.; Bulté, B.; Housen, A.; Godfroid, A. The predictive processing of number information in subregular verb morphology in a first and second language. Appl. Psycholinguist. 2023, 44, 750–783. [Google Scholar] [CrossRef]
- Kuteeva, M.; Andersson, M. Diversity and Standards in Writing for Publication in the Age of AI—Between a Rock and a Hard Place. Appl. Linguistics 2024. [Google Scholar] [CrossRef]
- Lau, J.H.; Clark, A.; Lappin, S. Grammaticality, Acceptability, and Probability: A Probabilistic View of Linguistic Knowledge. Cogn. Sci. 2016, 41, 1202–1241. [Google Scholar] [CrossRef] [PubMed]
- Lee, G.; Kim, H.-Y. Human vs. AI: The battle for authenticity in fashion design and consumer response. J. Retail. Consum. Serv. 2024, 77. [Google Scholar] [CrossRef]
- Lemoine, N.P. Moving beyond noninformative priors: why and how to choose weakly informative priors in Bayesian analyses. Oikos 2019, 128, 912–928. [Google Scholar] [CrossRef]
- Lewis, J. R. (2018). Investigating MOS-X ratings of synthetic and human voices. Voice Interaction Design, 2(1).
- Li, H.; Moon, J.T.; Purkayastha, S.; Celi, L.A.; Trivedi, H.; Gichoya, J.W. Ethics of large language models in medicine and medical research. Lancet Digit. Heal. 2023, 5, e333–e335. [Google Scholar] [CrossRef] [PubMed]
- Lin, P. ChatGPT: Friend or foe (to corpus linguists)? Appl. Corpus Linguistics 2023, 3. [Google Scholar] [CrossRef]
- Lüdecke, D.; Ben-Shachar, M.S.; Patil, I.; Waggoner, P.; Makowski, D. performance: An R Package for Assessment, Comparison and Testing of Statistical Models. J. Open Source Softw. 2021, 6, 3139. [Google Scholar] [CrossRef]
- Marsden, E.; Thompson, S.; Plonsky, L. A methodological synthesis of self-paced reading in second language research. Appl. Psycholinguist. 2018, 39, 861–904. [Google Scholar] [CrossRef]
- Marvin, R.; Linzen, T. Targeted Syntactic Evaluation of Language Models. ArXiv.
- McDonough, K.; Trofimovich, P. Using Priming Methods in Second Language Research; Routledge.
- Megahed, F.M.; Chen, Y.-J.; Ferris, J.A.; Knoth, S.; Jones-Farmer, L.A. How generative AI models such as ChatGPT can be (mis)used in SPC practice, education, and research? An exploratory study. Qual. Eng. 2023, 36, 287–315. [Google Scholar] [CrossRef]
- Petersen, O.H. Inequality of Research Funding between Different Countries and Regions is a Serious Problem for Global Science. Function 2021, 2, zqab060. [Google Scholar] [CrossRef]
- Pickering, M.J.; Gambi, C. Predicting while comprehending language: A theory and review. Psychol. Bull. 2018, 144, 1002–1044. [Google Scholar] [CrossRef] [PubMed]
- Plonsky, L. Sampling and Generalizability in Lx Research: A Second-Order Synthesis. Languages 2023, 8, 75. [Google Scholar] [CrossRef]
- Prystauka, Y.; Altmann, G.T.M.; Rothman, J. Online eye tracking and real-time sentence processing: On opportunities and efficacy for capturing psycholinguistic effects of different magnitudes and diversity. Behav. Res. Methods 2023, 1–19. [Google Scholar] [CrossRef] [PubMed]
- Rodd, J.M.; Cai, Z.G.; Betts, H.N.; Hanby, B.; Hutchinson, C.; Adler, A. The impact of recent and long-term experience on access to word meanings: Evidence from large-scale internet-based experiments. J. Mem. Lang. 2016, 87, 16–37. [Google Scholar] [CrossRef]
- Saslow, M.G. Latency for Saccadic Eye Movement*. J. Opt. Soc. Am. 1967, 57, 1030–1033. [Google Scholar] [CrossRef] [PubMed]
- Schlenter, J. (2019). Predictive language processing in late bilinguals: Evidence from visual-world eye-tracking. Doctoral dissertation. Universität Potsdam.
- Scott, G.G.; Keitel, A.; Becirspahic, M.; Yao, B.; Sereno, S.C. The Glasgow Norms: Ratings of 5,500 words on nine scales. Behav. Res. Methods 2018, 51, 1258–1270. [Google Scholar] [CrossRef] [PubMed]
- Shin, J.; Christianson, K. Structural Priming and Second Language Learning. Lang. Learn. 2011, 62, 931–964. [Google Scholar] [CrossRef]
- Soares, A.P.; Costa, A.S.; Machado, J.; Comesaña, M.; Oliveira, H.M. The Minho Word Pool: Norms for imageability, concreteness, and subjective frequency for 3,800 Portuguese words. Behav. Res. Methods 2016, 49, 1065–1081. [Google Scholar] [CrossRef] [PubMed]
- Sturt, P.; Keller, F.; Dubey, A. Syntactic priming in comprehension: Parallelism effects with and without coordination. J. Mem. Lang. 2010, 62, 333–351. [Google Scholar] [CrossRef]
- Tavakol, M.; Dennick, R. Making sense of Cronbach's alpha. Int. J. Med. Educ. 2011, 2, 53–55. [Google Scholar] [CrossRef]
- Taylor, J.E.; Beith, A.; Sereno, S.C. LexOPS: An R package and user interface for the controlled generation of word stimuli. Behav. Res. Methods 2020, 52, 2372–2382. [Google Scholar] [CrossRef] [PubMed]
- Trott, S. Can large language models help augment English psycholinguistic datasets? Behav. Res. Methods 2024, 1–19. [Google Scholar] [CrossRef]
- Uchida, S. Using early LLMs for corpus linguistics: Examining ChatGPT's potential and limitations. Appl. Corpus Linguistics 2024, 4. [Google Scholar] [CrossRef]
- Verma, A.; Sikarwar, V.; Yadav, H.; Jaganathan, R.; Kumar, P. Shabd: A psycholinguistic database for Hindi. Behav. Res. Methods 2021, 54, 830–844. [Google Scholar] [CrossRef] [PubMed]
- Võ, M.L.H.; Conrad, M.; Kuchinke, L.; Urton, K.; Hofmann, M.J.; Jacobs, A.M. The Berlin Affective Word List Reloaded (BAWL-R). Behav. Res. Methods 2009, 41, 534–538. [Google Scholar] [CrossRef] [PubMed]
- Voeten, C. C. (2023). permutes: Permutation Tests for Time Series Data (R package version 2.8).
- Wei, H.; Boland, J.E.; Zhang, C.; Yang, A.; Yuan, F. Lexically Independent Structural Priming in Second Language Online Sentence Comprehension. Lang. Learn. 2023. [Google Scholar] [CrossRef]
- Winter, B. Statistics for Linguists: An Introduction Using R; Routledge.
- Zappavigna, M. Hack your corpus analysis: How AI can assist corpus linguists deal with messy social media data. Appl. Corpus Linguistics 2023, 3. [Google Scholar] [CrossRef]






| Exp | Original/related study | Examined effect | Target language |
|---|---|---|---|
| 1 | Altmann and Kamide (1991) | Semantic prediction | English |
| 2 | Koch et al. (2023) | Morphosyntactic prediction | Arabic |
| 3 | Wei et al. (2023) | Syntactic priming | English |
| Condition | Mean (SD) | Median | Range |
|---|---|---|---|
| Human native speaker | 65.81 (13.06) | 66.25 | 37-95 |
| AI | 69.31 (15.17) | 68.28 | 39-99 |
| Condition | Mean (SD) | Median | Range |
|---|---|---|---|
| Human native speaker | 81.26 (15.13) | 81.25 | 25-96 |
| AI | 68.61 (15.62) | 65.00 | 37-100 |
| NP1 | NP2 | n | RT | SD | SE |
|---|---|---|---|---|---|
| AdjP | AdjP | 248 | 982.41 | 594.11 | 37.73 |
| AdjP | RC | 248 | 1017.86 | 572.02 | 36.32 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).