Method
Spanish 331, which is taught completely in Spanish, develops language skills within cultural contexts that reflect the research interests of the instructor, as the catalog description of the course explains (University of Tennessee Undergraduate Catalog, 2025):
Introduction to the fundamental historical, political, and demographic developments that led to the creation, geographic distribution, and distinctive character of Hispanic cultures with attention to those qualities that distinguish Hispanic culture from other cultures, as well as to ethnic and linguistic components of the Hispanic world in the present day.
The geographical focus of Spanish 331 varies from semester to semester and can center on Spain or Spanish America. Spanish 331 is offered in two formats, in-person and asynchronous. The present study will involve an asynchronous offering of Spanish 331 delivered during a 16-week semester in the spring of 2025, which focused on the culture of Spain from prehistory until the present day.
One invariant in Spanish 331 is an emphasis on developing comprehension, writing, and oral skills in Spanish. Comprehension is a skill that develops through attentive reading, an activity in which all students can participate by completing the reading assignments. Similarly, writing skills develop through the completion of assignments that reinforce vocabulary and themes discussed in the readings.
At the same time, the development of students’ oral skills in Spanish 331 presents a pedagogical challenge to in-person instructors, whose ability to engage students consistently is limited by time constraints, and to asynchronous instructors, who do not engage with students orally. Indeed, research by scholars (Jdetawy, 2011; Rabab’ah, 2003) reveals that the development of language skills is hindered by sporadic rather than regular opportunities to speak a second language. Other scholars (Zrekat et al., 2016; Zrekat & Al-Sohbani, 2022) have demonstrated that students frequently believe they do not receive sufficient opportunities to practice speaking the second language in SLA courses.
Comments made by students in their course evaluations of asynchronous Spanish 331 indicate that they seek more opportunities to speak Spanish. These comments are recorded and housed by the University of Tennessee’s Office of Institutional Effectiveness (
https://utk.campuslabs.com/faculty/#/). One of the most poignant comments was made by a student evaluating asynchronous Spanish 331 in the fall of 2019:
I do not think that the class should be online. In my opinion, the class is not just learning about the history of Spain, it is also about practicing your Spanish listening and speaking skills. If this was not a component of the course then it would not be required for a Spanish minor. I do not think it is possible to effectively practice and continue to learn a language through a hybrid or online class.
A comment by a student after the spring of 2025 reveals a similar desire for oral practice in asynchronous Spanish 331: “This is an online class, but I feel like, even so, there were no chances to practice conversational or speaking skills. 100% of this work is reading, writing, and listening, so I had to hire a tutor to practice speaking for my study abroad trip to Costa Rica.” At the core of these comments lies a measure of discontent over insufficient interaction with the instructor, which aligns with a widely accepted pedagogical theory advanced by Long (1985), according to which interaction is paramount in promoting SLA.
Research (Pan & Steed, 2016; Peterson, 2005) has indicated the potential effectiveness of avatar chatbots in improving SLA by facilitating interaction. Scholars (Hyde et al., 2015; Schrader, 2019; Steptoe et al., 2010) have also found that this effectiveness is enhanced by incorporating animation techniques into the design of an avatar chatbot, and by linking it to a learning context. The learning context in the present study is a unit in asynchronous Spanish 331 on the Spanish Civil War (1936-1939) and the dictatorship of Francisco Franco (1939-1975), which requires viewing Guillermo del Toro’s 2006 feature-length film,
El laberinto del Fauno (Pan’s Labyrinth). Slabot was designed to complement the mythical post-Civil War world in
El laberinto del Fauno, which is centered around an abandoned labyrinth and a mysterious creature, “Fauno.” For a discussion of the Fauno character, see:
https://screenrant.com/pans-labyrinth-movie-faun-creature-explained/#:~:text=The%20Faun%20Creature%20from%20Pan's,traits%20mixed%20together%20more%20cohesively. Physically, Fauno is a visually striking character that combines human and animal characteristics, as can be seen in
Figure 1:
To increase its appeal to students, Slabot was conceived at the University of Tennessee as a university-age avatar by the DLI collaborators, including Dr. Gregory B. Kaplan (Professor of Spanish) as faculty leader, and a technical production team lead by Dr. Jason Johnston (Executive Director of Online Learning), and comprised of Naomi Breeden (Executive Director of Solutions), Michael Eilers (Graphic Designer), Chris Emberton (Assistant Director of Interactive Learning Technologies), Jonathan Fuqua (Web Application Developer), and Dani Powers (Graphic Designer). An initial attempt to create the physical form of Slabot was performed with Creatify (
https://app.creatify.ai/home), which is similar to platforms such as ManyChat (
http://manychat.com), and Tars (
https://hellotars.com/). The process required entering text describing the anticipated physical features of Slabot, as well as descriptions of Slabot’s clothing and the background that would appear:
Physical features: A visually striking non-binary character that combines human and animal characteristics. Small horns on the head. Light green skin. Human torso. Goat legs and hooves. Narrow eyes slanted inward, with a hint of playfulness. A cheerful, non-threatening, and studious look on its face. White, human teeth. A face that is half human and half goat, a tall and lean body, and a deep, rough voice.
Clothing: A dark green cloak that covers the whole body.
Background: The entrance to a labyrinth made out of thick hedges.
The avatar produced by Creatify is seen in
Figure 2.
The physical appearance of the avatar in
Figure 2 was ultimately seen as non-user-friendly, and a second attempt at creating Slabot was performed. As seen in
Figure 3 and
Figure 4, Slabot was endowed with congenial physical features. To achieve the physical form of Slabot,
Adobe Character Animator was used during a process described by Michael Eilers, the University of Tennessee graphic designer who accomplished the task:
First, the DLI team finalized the character design for Slabot, whose physical appearance consists of a youthful figure with a head half-human and half-goat, bright, playful eyes, and a cheerful, non-threatening look on its face. I then used Adobe Character Animator to build the avatar. One of the more difficult aspects of building Slabot was the isolation of the mouth visemes and making sure that these visemes correlated with the phonetics of the audio track. It should be pointed out that a viseme is a visual representation of a phoneme, which is a basic unit of speech sound. In other words, a viseme comprises the mouth shape and facial expression that correspond to a particular sound or word when lip-reading or animating a character. Visemes are important for lip reading, animation, and audio-visual speech recognition. During the process of isolation, constantly checking the timing between mouth visemes and the audio was crucial. In sum, I was able to generate a “rough” animation with the audio files provided and clean up the timing of Slabot’s mouth by modifying visemes within the timeline.
The audio files to which Michael Eilers refers above contain the discourse that would be spoken by Slabot to students. This discourse was designed to engage students in Spanish by always ending its replies to students with a question in Spanish. In addition, two basic functions for Slabot were established:
1. Greeting students
2. Asking students to discuss themes related to El laberinto del Fauno
During a stage in this project that will be completed in the fall of 2025, the two basic functions will be manifested by a greeting and five questions asked by Slabot. Students will control the pace of the interview by manually pushing stop and play buttons and will be able to hear each question more than once if needed. However, students will only be permitted one recorded response per question.
The first five questions will be formulated to inspire students to progress through levels of profundity in their responses, from repetitive details to speculative discourse:
Hola, voy a hacerte cinco preguntas sobre la película El laberinto del Fauno.
(Hello, I will ask you five questions about the Pan’s Labyrinth movie.)
Primera pregunta: En la película, el papel de Fauno es guiar a Ofelia en sus tres tareas. ¿Cuáles son esas tareas?
(First question: In the movie, Fauno’s role is to guide Ofelia in her three tasks. What are those three tasks?”)
Segunda pregunta: El director de la película, Guillermo del Toro, mezcla un mundo real con otro imaginario, ¿qué tienen en común ambos mundos?
(Second question: The director of the film, Guillermo del Toro, mixes a real world with an imaginary one: what do the two worlds have in common?)
Tercera pregunta: ¿Cuáles son algunas características del capitán Vidal?
(Third question: What are some of the characteristics of Captain Vidal?)
Cuarta pregunta: ¿Cómo trata el capitán Vidal a Ofelia?
(Fourth question: How does Captain Vidal treat Ofelia?)
Quinta pregunta: En tu opinión, ¿cuál es la moraleja de la película?
(Fifth question: In your opinion, what is the film’s moral?)
The final two questions will require responses from students about an unrehearsed context, namely, a video (
https://www.youtube.com/watch?v=T-j-G2GiE6I) about which they know nothing but which they will view after their interaction with Slabot. In the video, an individual who experienced the Spanish Civil War and Franco’s dictatorship firsthand, Miguel Muñoz (b. 1925-d. 2022), is interviewed and asked to explain some of the hardships that he endured during the 1940s (
El laberinto del fauno takes place in 1944). Slabot will ask the following two questions:
Sexta pregunta: “El laberinto del Fauno tiene lugar en 1944. En unos días verás un vídeo sobre Miguel Muñoz, un hombre que vivió en España durante los años 40. ¿Qué crees que Miguel dirá sobre cómo era su vida entonces?”
(Sixth question: “El laberinto del Fauno takes place in 1944. In a few days you'll see a video about Miguel Muñoz, a man who lived in Spain during the 1940s. What do you think Miguel will say about what his life was like back then?”)
Séptima pregunta: “Miguel Muñoz nació en 1925. ¿Cómo le describirías a Miguel cómo su vida podría verse afectada por los acontecimientos mundiales —políticos, sociales, económicos o militares— entre 1925 y 1945?
(Seventh question: “Miguel Muñoz was born in 1925. How would you describe to Miguel how his life might be affected by world events—political, social, economic, or military—between 1925 and 1945?”)
The capacity of Slabot to evoke substantive responses from students by guiding them through five contextualized questions may be evident in their recorded responses, which will be evaluated in the fall of 2025 according to the ACTFL ratings.
The ACTFL ratings are used at the University of Tennessee and at many other higher education institutions to assess the oral abilities of students in SLA courses. The ratings are assigned during an Oral Proficiency Interview (OPI):
The ACTFL Oral Proficiency Interview (OPI) is a valid and reliable means of assessing how well a person speaks a language. It is a 15-30-minute one-on-one interview between you and a certified ACTFL tester. The OPI is an assessment that is carried out in the form of an interview, but follows an established structure and protocol in order to elicit a ratable speech sample. (ACTFL OPI Examinee Handbook, 2024, p. 4)
Based on an evaluation during the OPI of what ACTFL guidelines describe as “the ability to use language that reflects practical communication tasks and that has been learned and practiced in an instructional or other structured setting” (ACTFL Proficiency Guidelines, 2024, p. 7), the assessor can award an ACTFL proficiency rating: Distinguished, Superior, Advanced High, Advanced Mid, Advanced Low, Intermediate High, Intermediate Mid, Intermediate Low, Novice High, Novice Mid, Novice Low. Descriptions of how each of these ratings are assigned are found in ACTFL Proficiency Guidelines (2024, p. 15-23).
In addition to an in-person OPI, ACTFL also offers a virtual option:
The ACTFL Oral Proficiency Interview-Computer (ACTFL OPIc) is a proctored, internet-delivered test of oral communication. It imitates the experience of a “live” ACTFL Oral Proficiency Interview (OPI) in a virtual format. Interview questions are selected by a carefully designed computer program and delivered using a virtual avatar. On average, the ACTFL OPIc takes 20 to 40 minutes to complete.
The goal of the ACTFL OPIc is the same as that of the ACTFL OPI: to obtain a speech sample that a rater can evaluate in relation to the ACTFL Proficiency Guidelines 2024—Speaking in order to assign a rating. The recordings of the test taker’s responses are made available electronically through a secure internet site to ACTFL-certified OPIc raters. The ACTFL OPIc measures a range of proficiency on the ACTFL scale from Novice to Superior.
The ACTFL OPIc was developed in response to increasing worldwide demand for oral language proficiency testing that is appropriate for both small-group and large-scale testing. It provides valid and reliable oral proficiency assessment in a format that allows hundreds of examinees to take the test online at the same time; it can be completed on demand from anywhere in the world, and at a time that is convenient for both the candidate and the proctor. (ACTFL Oral Proficiency Interview-Computer, 2024, p. 3)
The ACTFL OPIc involves integration with an avatar, samples of which can be seen in
Figure 5:
The ACTFL OPIc interviewer is personified by an avatar, whose name and image varies depending on the language being tested…Having a picture of the avatar on the screen helps to engage the test takers and mimics a one-on-one conversation with a live tester, as in the ACTFL OPI. (ACTFL Oral Proficiency Interview-Computer, 2024, p. 5)
When the test taker is ready, the ACTFL OPIc begins with the avatar stating: “Let’s start the interview now. Tell me something about yourself.” This serves as a warm-up and an opportunity for the test taker to begin using the language and to interact with the avatar before the main test begins. This warm-up activity is not rated.
The ACTFL OPIc then proceeds with the avatar asking randomly selected questions from within the predetermined pool of prompts, and the test taker providing responses. After completing the last response, the test taker sees an ending screen with the message “Congratulations! You have successfully completed your test.” The test taker’s recorded speech sample is then automatically uploaded to a secure rater site.” (ACTFL Oral Proficiency Interview-Computer, 2024, p. 5)
Like the in-person OPI, the ACTFL OPIc measures what ACTFL classifies as proficiency:
Proficiency describes an individual’s ability to use the language in all types of situations, with regard to topics that may or may not be familiar and in contexts that may or may not have been encountered previously. Proficiency refers to what an individual is able to do regardless of the setting, or where, when, and how the language was learned. (ACTFL Proficiency Guidelines, 2024, p. 6)
As in the case of the ACTFL OPIc, the seven questions that Slabot will ask involve “topics that may or may not be familiar and in contexts that may or may not have been encountered previously.” In particular, students in Spanish 434 will be required to discuss El laberinto del Fauno, which they have viewed, and Miguel Muñoz, with whom they are not familiar.
Results
An initial test of Slabot was performed in asynchronous Spanish 331 in late April 2025. The test was associated with two asynchronous classes and was an optional activity, whose successful completion counted for 1% of extra credit. During the two asynchronous classes, students viewed El laberinto del Fauno. Students were then given several days to respond to Slabot’s first question: “Primera pregunta: En la película, el papel de Fauno es guiar a Ofelia en sus tres tareas. ¿Cuáles son esas tareas? (First question: In the movie, Fauno’s role is to guide Ofelia in her three tasks. What are those three tasks?)”.
Out of 34 students in asynchronous Spanish 331, five students completed the extra credit assignment. The times of the five recorded responses, in chronological order of submission, were as follows: 17 seconds (May 5, 2025); 33 seconds (May 2, 2025); 41 seconds (May 8, 2025); 11 seconds (May 9, 2025); 1 minute, 10 seconds May 9, 2025. Their responses to Slabot’s question therefore ranged in time from 11 seconds to 1 minute and 10 seconds. Although five students do not constitute a large sample, the fact that one student’s response reached 1 minute and 10 seconds indicates the potential for Slabot to perform a pioneering function for an avatar chatbot, namely, to elicit ACTFL ratable speech samples from students.
Slabot’s potential is evident in the response that lasted for 1 minute and 10 seconds:
Hola, me llamo XXXXXX, en la película de El laberinto del Fauno, la Fauno le da a Ofelia tres tareas para completar. La primera tarea es ella necesita obtener un llave y, la llave, una llave. Y la llave está en el estómago de un sapo. Y el sapo está alrededor un arbol. La segunda tarea es ella necesita obtener un daga y, una daga, y la daga está con el hombre palido. El hombre palido es muy alta y tiene, tiene, el tiene ojos en sus manos. Y la tarea tercera es ella necesita mostrar su obediencia y pureza, pureza, porque, porque la Fauna pide que, pide que Ofilia mata, mata su hermano. Y porque él es un bebé ella, ella no mata su, su hermano y este es una manera para mostrar, para mostrar su obediencia y pureza. Gracias.
(Hello, my name is XXXXXX. In the movie Pan's Labyrinth, the Faun gives Ophelia three tasks to complete. The first task is she needs to get a key, and the key, a key. And the key is in the stomach of a toad. And the toad is around a tree. The second task is she needs to get a dagger, a dagger, and the dagger is with the pale man. The pale man is very tall and has eyes on his hands. And the third task is she needs to show her obedience and purity, purity, because, because Fauna asks that Ophelia kill, kills her brother. And because he is a baby, she, she doesn't kill his, her brother, and this is a way to show, to show her obedience and purity. Thank you.)
The instructor was also able to offer advice to the student concerning grammatical errors, which are indicated in bold type below in the corrected version of the response above:
Hola, me llamo XXXXXX, en la película de El laberinto del Fauno, la Fauno le da a Ofelia tres tareas para completar. La primera tarea es [que] ella necesita obtener un llave y, la llave, una llave. Y la llave está en el estómago de un sapo. Y el sapo está alrededor [de] un árbol [arból]. La segunda tarea es [que] ella necesita obtener un daga y, una daga, y la daga está con el hombre pálido [palído]. El hombre pálido [palído] es muy alta y tiene, tiene, el tiene ojos en sus manos. Y la tarea tercera es [que] ella necesita mostrar su obediencia y pureza, pureza, porque, porque la Fauna pide que, pide que Ofelia [Ofilia] mata [mate], mata [mate] [a] su hermano. Y porque él es un bebé ella, ella no mata [a] su, su hermano y este es una manera para mostrar, para mostrar su obediencia y pureza. Gracias.
In addition to receiving by email the corrected version of the response, the student was also provided with the following advice:
Thank you for your response. Please be aware of several grammatical errors:
Gender disagreement:
“la Fauno” instead of “el Fauno”
“un llave” instead of “una llave”
“un daga” instead of “una daga”
“alta” instead of “alto”
“la Fauna” instead of “el Fauno”
“este es una manera” instead of “esta es una manera
Missing conjunctions:
“es [que] ella necesita obtener”
“es [que] ella necesita mostrar”
Missing prepositions:
[de] un árbol
[a] su hermano
Accentuation and vowel pronunciation during oral discourse:
“árbol” [not “arból”]
“pálido” [not “palído”]
“Ofelia” [not Ofilia]
Syntax:
“tarea tercera” instead of “tercera tarea.”
Mood:
“pide que, pide que Ofelia mata [mate], mata [mate] [a] su hermano” (the third person present subjunctive form “mate” is required after the verb “pide”).
This assessment would have undoubtedly been more difficult to perform after an in-person interview without a recorded response, which underscores an advantage of using an avatar chatbot to conduct the interview. In other words, the fact that the interviewer is not present, and occupies the singular role of observer, rather than serving as both observer and recorder, offers a perspective by which detailed grammatical corrections may be provided in addition to an ACTFL rating.