Submitted:
21 February 2025
Posted:
24 February 2025
You are already at the latest version
Abstract
This paper examines the impact of artificial intelligence (AI) on digital humanities through a colonial lens, analyzing how AI can both reinforce and challenge colonial power dynamics. AI tools in digital humanities, such as text mining and language preservation, often perpetuate Western epistemologies and marginalize non-Western perspectives due to biases in data and algorithms. Using case studies, such as the Slave Voyages database and indigenous language preservation projects, this paper highlights AI’s dual role as both a potential perpetuator of colonial legacies and a tool for decolonization. It recommends inclusive AI design, community-driven data governance, and the integration of alternative epistemologies to mitigate AI’s colonial biases and promote more equitable knowledge production.
Keywords:
1. Introduction
- How do AI-based techniques in digital humanities reproduce or challenge colonial power structures?
- In what ways do AI-driven projects in digital humanities favor Western epistemologies and exclude marginalized or indigenous perspectives?
- How can AI be employed as a tool for decolonizing digital humanities, ensuring that non-Western voices are included in global knowledge production?
2. Theoretical Framework
- Machine Learning (ML): A subset of AI, machine learning involves training algorithms on large datasets to identify patterns, classify information, and make predictions without explicit programming. ML in digital humanities can assist in analyzing cultural artifacts by recognizing patterns across large datasets, such as identifying trends in historical texts, categorizing images, or clustering themes across vast archives. Common techniques in ML include supervised learning (where models are trained on labeled data), unsupervised learning (pattern detection without labels), and deep learning (which uses neural networks for more complex pattern recognition).
- Natural Language Processing (NLP): NLP refers to AI’s ability to understand, interpret, and generate human language. This is particularly relevant in the analysis of historical documents, literature, and digitized texts. NLP is used in tasks like text mining, sentiment analysis, and automated translation. Projects involving the digitization of colonial archives often apply NLP techniques to mine vast quantities of text, identifying recurring themes or uncovering previously hidden connections within historical records.
- Computer Vision: AI’s computer vision capabilities allow machines to interpret and categorize visual information, making this technology especially useful for analyzing and preserving visual artifacts like paintings, photographs, and sculptures [15]. In digital humanities, computer vision can be applied to recognize patterns in art history, detect visual elements in colonial-era photographs, or even reconstruct incomplete artifacts.
- Generative AI: Generative models, like Generative Adversarial Networks (GANs), are increasingly being used in the humanities to generate art or text based on learned data. For example, these models can be trained on a dataset of colonial-era art or texts and then generate new, derivative works that reflect historical styles or content [7].
2.1. Colonialism: Definition and Framework
2.2. Colonialism in Action
2.2.1. The Slave Voyages Database
2.2.2. Google’s AI for Indigenous Language Preservation
2.2.3. Mapping Colonial India Through Image Recognition
2.2.4. The Colonial Despatches Project
3. AI’s Role in Reinforcing Colonial Structures
4. Decolonizing AI in Digital Humanities
4.1. Practical Examples
4.2. Challenges and Solutions for Decolonial AI
4.2.1. Community-Led Data Curation:
4.2.2. Inclusive AI Design:
4.2.3. Critical AI Literacy for Digital Humanists:
5. Conlusions
- Ethical AI Development: Develop AI systems that are critically aware of the biases in their training data and seek to counterbalance these biases by incorporating diverse datasets and voices. This can include training NLP systems on indigenous languages or ensuring that visual recognition algorithms are not skewed toward Western art and cultural norms.
- Community-Centric Approaches: AI projects in digital humanities should engage directly with the communities whose data is being used, ensuring that these communities have control over how their knowledge is represented and used. This includes respecting indigenous data sovereignty, following ethical frameworks like Te Mana Raraunga, and fostering collaborations that benefit all parties.
- Interdisciplinary Collaboration: Bridging the gap between technologists and humanists is crucial for creating more equitable AI systems. Scholars in digital humanities should work alongside AI developers to ensure that ethical and epistemological concerns are embedded into the design of AI tools from the beginning. This interdisciplinary approach can help identify potential biases and blind spots in AI applications before they become entrenched.
- Inclusive Epistemologies: AI-driven digital humanities projects must move beyond Western frameworks of knowledge and embrace non-Western epistemologies. This involves recognizing the validity of oral histories, communal memory, and alternative understandings of history and culture. By integrating these perspectives into AI systems, digital humanities can help decolonize knowledge production and representation.
References
- Miriyam Aouragh and Paula Chakravartty. Infrastructures of empire: Towards a critical geopolitics of media and information studies. Media, culture & society 2016, 38, 559–575.
- Ruha Benjamin. Race After Technology: Abolitionist Tools for the New Jim Code, Polity, 2019.
- Nicholas Blomley. Unsettling the city: Urban land and the politics of property, Routledge, 2004.
- Dipesh Chakrabarty. Provincializing Europe: Postcolonial Thought and Historical Difference, Princeton University Press, 2000.
- Nick Couldry and Ulises, A. Mejias. The Costs of Connection: How Data Is Colonizing Human Life and Appropriating It for Capitalism, Stanford University Press, 2019. [Google Scholar]
- Nick Couldry and Ulises, A. Mejias. Data colonialism: Rethinking big data’s relation to the contemporary subject. Television & New Media 2019, 20, 336–349. [Google Scholar]
- Roddy Cowie, Catherine Pelachaud, and Paolo Petta. Emotion-oriented systems: the Humaine handbook, Springer, 2011.
- David Eltis and David Richardson. Atlas of the Transatlantic Slave Trade, Yale University Press, 2010.
- Miranda Fricker. Epistemic injustice: Power and the ethics of knowing, Oxford University Press, 2007.
- Marie-Odile Junker and Terry Stewart. A linguistic atlas for endangered languages: www. atlas-ling. ca. In Edulearn11 Proceedings, pages 3366–3376. IATED, 2011.
- Kostas Karpouzis. Plato’s shadows in the digital cave: Controlling cultural bias in generative ai. Electronics 2024, 13, 1457. [CrossRef]
- Tahu Kukutai and John Taylor. Indigenous data sovereignty: Toward an agenda, ANU press, 2016.
- Walter Mignolo. The darker side of western modernity: Global futures, decolonial options, Duke University Press, 2011.
- Safiya Umoja Noble. Algorithms of oppression: How search engines reinforce racism. In Algorithms of oppression. New York university press, 2018.
- Maja Pantic, George Caridakis, Elisabeth André, Jonghwa Kim, Kostas Karpouzis, and Stefanos Kollias. Multimodal emotion recognition from low-level cues. In Emotion-oriented systems: The humaine handbook; 2011; pp. 115–132.
- Christopher Pinney. Camera Indica: the social life of Indian photographs, University of Chicago Press, 1997.
- Aníbal Quijano. Coloniality of power, eurocentrism, and latin america. Nepantla: Views from South, 2000; 1, 533–580.
- Roopika Risam. New Digital Worlds: Postcolonial Digital Humanities in Theory, Praxis, and Pedagogy, Northwestern University Press, 2018.
- Edward, W. Said. Orientalism, Pantheon Books, 1978. [Google Scholar]
- Yannis Skarpelos, Sophia Messini, Elina Roinioti, Kostas Karpouzis, Stavros Kaperonis, and Michaela-Gavriela Marazoti. Emotions during the pandemic’s first wave: The case of greek tweets. Digital 2024, 4, 126–151.
- Maggie Walter. The voice of indigenous data: Beyond the markers of disadvantage. Griffith Review 2018, 256–263.
| 1 | Slave Voyages: The Trans-Atlantic Slave Trade Database, Emory University, https://www.slavevoyages.org
|
| 2 | Colonial Despatches: The Colonial Despatches of Vancouver Island and British Columbia 1846-1871, University of Victoria, https://www.colonialdespatches.ca
|
| 3 | Endangered Archives Programme, https://eap.bl.uk/
|
| 4 | Algonquian Linguistic Atlas, https://www.atlas-ling.ca
|
| 5 | Te Mana Raraunga: Māori Data Sovereignty Network, https://www.temanararaunga.maori.nz
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).