Submitted:
10 March 2026
Posted:
13 March 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Translog-II: Components and Functions
2.1. Components of Translog-II
2.2. Functions of Translog-II
2.3. Replay and Visualization
3. Data Processing and Keystroke-to-word Mapping
3.1. CRITT TPR-DB
3.2. Keystroke logging data
- Type: type of the keyboard activity: insert, delete, return, navigation, IME
- Text: deleted text string, or ‘###’ if it cannot be reconstructed in the editor.
- Value: value of the keystroke. This is usually the inserted character and the logged text modification in the case of IME
- X / Y: position of the top left corner of the character on the screen
- Width: width of the inserted character
- Height: height of the inserted character
3.3. Text Diff Logging and Input Method Editor (IME)
NIOem5TUNAGABackBackBackBackTUNAGATTASpace
SHBackBackD4BAISpaceBackBackSHUUSINKEISpace
3.4. Keystroke to Word Mapping
4. Research Case and Analysis
4.1. Data Collection

4.1.1. Create a new project



4.1.2. Open and run a keylogging experiment

4.2. Data Processing
4.2.1. Replay log files

4.2.2. Upload to the TPR-DB

4.2.3. Word-alignment and manual annotation


4.3. Data Analysis
4.3.1. TPR-DB tables


4.3.2. Data analysis on Jupyter notebook


4.3.3. Data visualization on Shiny R interface


5. Discussion and Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Alves, F., & Vale, D. (2009). Probing the unit of translation in time: Aspects of the design and development of a web application for storing, annotating, and querying translation process data. Across languages and cultures, 10(2), 251–273. [CrossRef]
- Alves, F., & Vale, D. (2011). On drafting and revision in translation: A corpus linguistics oriented analysis of translation process data. Translation: Computation, corpora, cognition, 1(1), 89–110.
- Angelone, E., Ehrensberger-Dow, M., & Massey, G. (2015). Cognitive processes. Researching translation and interpreting, 43–57. Routledge.
- Carl, M., Dragsted, B., & Jakobsen, A. L. (2011). A taxonomy of human translation styles. Translation journal, 16(2), 155–168.
- Carl, M. (2012a). Translog-II: A Program for Recording User Activity Data for Empirical Translation Process Research. The 8th International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey.
- Carl, M. (2012b). The CRITT TPR-DB 1.0: A database for empirical human translation process research. Workshop on post-editing technology and practice.
- Carl, M. (2014). Produkt-und prozesseinheiten in der critt translation process research database. In Translationswissenschaftliches Kolloquium III: Beiträge zur Übersetzungs-und Dolmetschwissenschaft (Köln/Germersheim) (pp. 247–266). Peter Lang.
- Carl, M., Schaeffer, M., & Bangalore, S. (2016). The CRITT translation process research database. In Carl, M., Schaeffer, M., & Bangalore, S. (Ed.), New directions in empirical translation process research: Exploring the CRITT TPR-DB (pp. 37–50). Cham: Springer International Publishing.
- Carl, M. (2021). Micro units and the first translational response universal. In Explorations in empirical translation process research (pp. 233–257). Cham: Springer International Publishing.
- Catford, J. C. (1965). A linguistic theory of translation. London: Oxford University Press.
- Dragsted, B. (2010). Coordination of reading and writing processes in translation. Translation and Cognition, 15, 41.
- Germann, U. (2008). Yawat: yet another word alignment tool. The ACL-08: HLT demo session.
- Jääskeläinen, R. (2010). Think-aloud protocol. In Gambier, Y., & Doorslaer, L. (Ed.), Handbook of translation studies (pp. 371–374). John Benjamins Publishing Company.
- Koby, G. S. (2015). The ATA flowchart and framework as a differentiated error-marking scale in translation teaching. In Handbook of research on teaching methods in language translation and interpretation (pp. 220–253). IGI Global Scientific Publishing.
- Kotze, H. (2019). Converging what and how to find out why: An outlook on empirical translation studies. In New empirical perspectives on translation and interpreting (pp. 333–371). Routledge.
- Lommel, A., Burchardt, A., Popović, M., Harris, K., Avramidis, E., & Uszkoreit, H. (2014). Using a new analytic measure for the annotation and analysis of MT errors on real data. The 7th Annual conference of the European Association for Machine Translation.
- Muñoz Martín, R., & Cardona Guerra, J. M. (2019). Translating in fits and starts: Pause thresholds and roles in the research of translation processes. Perspectives, 27(4), 525–551. [CrossRef]
- Muñoz Martín, R., Sun, S., Du, Z. & Puerini, S. (2025). Keylogging. In A. Rojo López & R. Muñoz Martín (Ed.) Research Methods in Cognitive Translation and Interpreting Studies (pp. 157–182). John Benjamins Publishing Company.
- Schaeffer, M., & Carl, M. (2017). Language processing and translation. In Hansen-Schirra, S., Czulo, O., & Hofmann, S. (Ed.), Empirical Modeling of Translation and Interpreting (pp. 117–154). Language Science Press.
- Schou, L., Dragsted, B., & Carl, M. (2009). Ten years of Translog. In Mees, I., Alves, F., & Göpferich, S. (Ed.), Copenhagen Studies in Language (pp. 13–54). Copenhagen Business School.
- Shreve, G. M., & Lacruz, I. (2014). Translation as higher-order text processing. A companion to translation studies, 107–118.
- Sun, S. (2015). Measuring translation difficulty: Theoretical and methodological considerations. Across languages and cultures, 16(1), 29–54. [CrossRef]
- Sun, S., & Wen, J. (2017). Translation process research: An overview. In The Routledge handbook of Chinese translation (pp. 275–290). Routledge.
- Vale, D. C., Neumann, S., & Niemietz, P. (2016). Automatic recognition of linguistic replacements in text series generated from keystroke logs. The Tenth International Conference on Language Resources and Evaluation (LREC’16).
- Vinay, J. P., & Darbelnet, J. (1995). Comparative stylistics of French and English. John Benjamins. (Original work published 1958).
- Xiao, R. (2010). How different is translated Chinese from native Chinese?: A corpus-based study of translation universals. International journal of corpus linguistics, 15(1), 5–35. [CrossRef]
- Yamada, M., Mizowaki, T., Zou, L., & Carl, M. (2022). Trados-to-Translog-II: Adding gaze and Qualitivity data to the CRITT TPR-DB. The 23rd Annual Conference of the European Association for Machine Translation, Ghent, Belgium.
- Zou, L., & Carl, M. (2022). Trados and the CRITT TPR-DB: Translation process research in an ecologically valid environment. Model building in empirical translation studies: TRICKLET Conference (2022), Aachen, Germany.
- Zou, L., Carl, M., & Gilbert, D. (2023). Integrating Trados-qualitivity data to the CRITT TPR-DB: Measuring post-editing process data in an ecologically valid setting. In Pan, J., & Laviosa, S. (Ed.), Corpora and translation education: Advances and challenges (pp. 63–86). Singapore: Springer Nature Singapore.
- Zou, L. (2024). Cognitive Processes in Human-ChatGPT Interaction during Machine Translation Post-editing (Doctoral dissertation, Kent State University).
| 1 | To register for a temporary test account, check the link: https://sites.google.com/site/centretranslationinnovation/tpr-db/getting-started. |
| 2 | The data can be downloaded free of charge from the CRITT website: https://sites.google.com/site/centretranslationinnovation/tpr-db. |
| 3 | Many of which are listed on the CRITT website: https://sites.google.com/site/centretranslationinnovation/tpr-db-publications. |
| 4 | Currently, Eyelink, Tobii, and SMI eyetracker are supported. |
| 5 | |
| 6 | In some rare cases the logging information is not complete, most likely due to some unanticipated keystroke combination. |
| 7 | The tool can be downloaded from the CRITT website: https://sites.google.com/site/centretranslationinnovation/tpr-db/auxilliary-tools
|
| 8 | |
| 9 | It is usually recommended to use a font without serifs. |
| 10 | See naming conventions of the TPR-DB via the following link: https://sites.google.com/site/centretranslationinnovation/tpr-db/uploading?authuser=0#h.p_ID_175. |
| 11 | |
| 12 | Check the interactive table describing all the current features supported by the TPR-DB on https://sites.google.com/site/centretranslationinnovation/tpr-db/features?authuser=0. |
| 13 |
| <Key Time ="323531" Cursor ="769" Text =" die Motive für die Morde Norris " Type =" |
| delete " Value ="[ Back ]" /> |
| <Key Time ="324248" Cursor ="769" Type =" insert " Value ="N" /> |
| <Key Time ="323531" Cursor ="769" Text =" die Motive für die Morde Norris " Type =" |
| insert " Value ="N" /> |
| <Key Time="538625" Type="IME" Value="[N]" /> |
| <Key Time="538796" Type="IME" Value="[E]" /> |
| <Key Time="538952" Type="IME" Value="[N]" /> |
| <Key Time="538952" Type="IME" Value="[N]" /> |
| <Key Time="539046" Type="IME" Value="[Space]" /> |
| <Key Time="539451" Cursor="148" IMEtext="NENNSpace" Type="insert" Value="年" /> |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).