Submitted:
06 July 2025
Posted:
08 July 2025
You are already at the latest version
Abstract
Keywords:
Chapter One: Introduction
1.1. Background of the Study
1.2. Statement of the Problem
1.3. Objectives of the Study
- To identify and categorize the key NLP techniques and models used in poetic generation and literary analysis.
- To analyze the capabilities of these systems in mimicking or enhancing human creativity in language.
- To assess the effectiveness and limitations of existing evaluation metrics in capturing creativity, aesthetic quality, and semantic depth.
- To examine interdisciplinary contributions that combine computer science, literary theory, and digital humanities in exploring AI-assisted literary creativity.
- To highlight ethical, philosophical, and cultural considerations arising from the use of NLP in creative domains.
- To propose a future research agenda that integrates technological development with aesthetic theory and cultural critique.
1.4. Research Questions
- What are the major NLP models and techniques applied in computational poetics and literary analysis from 2000 to 2025?
- In what ways do NLP systems simulate, enhance, or collaborate in the creation of poetic or literary content?
- What are the current challenges and limitations of these systems in terms of thematic coherence, originality, metaphorical complexity, and interpretive depth?
- How are machine-generated or machine-interpreted literary texts evaluated, and are these evaluation methods adequate from a creative and aesthetic standpoint?
- What ethical and philosophical questions emerge from using NLP in creative writing and critical analysis?
- How can interdisciplinary collaboration between AI researchers, literary scholars, and artists foster more meaningful and inclusive applications?
1.5. Significance of the Study
1.6. Scope and Delimitation
1.7. Structure of the Review
- Chapter One introduces the study, outlines the problem, objectives, and significance, and sets the scope.
- Chapter Two presents a historical and conceptual review of literature related to NLP in creative writing and literary studies.
- Chapter Three details the methodological framework for the systematic review, including inclusion criteria, data sources, and thematic analysis approach.
- Chapter Four presents and organizes the findings into categories of generation, interpretation, and interdisciplinary collaboration.
- Chapter Five discusses the implications of the findings in terms of theory, practice, and ethics.
- Chapter Six concludes the review and proposes future research directions for the field.
1.8. Conclusions
Chapter Two: Literature Review
2.1. Introduction
2.2. Historical Evolution of NLP in Creative Texts
2.2.1. From Rule-Based Systems to Statistical Methods
2.2.2. Emergence of Neural Networks and Deep Learning
2.3. Computational Poetics: NLP in Poetry Generation
2.3.1. Poetic Structure and Form
2.3.2. Semantic and Figurative Language
2.3.3. Evaluation of Machine-Generated Poetry
2.4. NLP in Literary Analysis and Interpretation
2.4.1. Stylometry and Authorship Attribution
2.4.2. Topic Modeling and Distant Reading
2.4.3. Sentiment and Emotion Analysis in Literary Texts
2.5. Interdisciplinary Contributions and Theoretical Frameworks
- Cognitive poetics provides a psychological grounding for how readers process metaphor, narrative, and literary style, offering insights for NLP modeling.
- Semiotics and deconstructionist theory challenge the notion of fixed meaning, influencing how NLP systems deal with ambiguity and polysemy.
- Philosophy of language, particularly from thinkers like Searle and Derrida, critiques the assumption that language can be reduced to computable form—an issue directly relevant to NLP's limitations.
2.6. Limitations in the Literature
- Cultural Bias in Training Data: Most NLP models are trained on English-centric, Western literary corpora, marginalizing non-Western poetics and indigenous forms.
- Lack of Semantic Depth: While surface fluency has improved, models still struggle with deep semantic cohesion and symbolic layering.
- Evaluation Standardization: There is no consensus on how to measure literary creativity or interpretive accuracy in computational outputs.
- Neglect of Reader Reception: Few studies examine how readers perceive or respond to AI-generated texts, which is crucial for assessing creative impact.
2.7. Emerging Trends
- Multilingual and Cross-Cultural NLP is expanding the reach of literary AI to include diverse poetic traditions and global narratives.
- Zero-shot and Few-shot Learning are enabling models to generate high-quality literary content with minimal supervision.
- Explainable NLP seeks to make model decisions more transparent, fostering better integration into scholarly interpretation.
- Co-creative Systems position AI as a collaborator in the writing process, rather than an autonomous creator—reshaping notions of authorship.
2.8. Summary of the Chapter
Chapter Three: Methodology
3.1. Introduction
3.2. Research Design
3.3. Research Questions Revisited
- What are the major NLP models and techniques applied in computational poetics and literary analysis between 2000 and 2025?
- In what ways do NLP systems simulate, enhance, or collaborate in the creation or interpretation of poetic or literary content?
- What are the evaluation methods used for assessing creativity and interpretive depth in NLP-generated texts?
- What theoretical, ethical, or cultural issues emerge in these applications?
3.4. Inclusion and Exclusion Criteria
-
Inclusion Criteria:
- ○
- Peer-reviewed journal articles, conference proceedings, technical reports, and book chapters.
- ○
- Published between January 2000 and March 2025.
- ○
- Written in English.
- ○
- Focused explicitly on NLP in poetry generation, stylistic modeling, metaphor analysis, literary interpretation, or related topics.
-
Exclusion Criteria:
- ○
- Non-peer-reviewed blog posts, editorials, or non-scholarly commentaries.
- ○
- Works focused exclusively on non-literary genres (e.g., dialogue systems for customer service).
- ○
- Studies without methodological or empirical grounding.
3.5. Data Sources and Search Strategy
- ACM Digital Library
- IEEE Xplore
- Scopus
- Google Scholar
- SpringerLink
- arXiv.org (for preprints)
3.6. Data Extraction and Organization
- Title and authors
- Year of publication
- Publication type (journal, conference, etc.)
- NLP model or method used
- Literary or poetic application
- Evaluation method (human, automated, hybrid)
- Key findings and contributions
3.7. Quality Assessment
- Methodological transparency
- Relevance to research questions
- Contribution to theory or practice
- Replicability of results
- Acknowledgment of limitations
3.8. Limitations of the Methodology
- Potential publication bias toward positive or novel results.
- Language restriction to English may exclude key global contributions.
- Difficulty in quantifying creative output using traditional NLP metrics.
- Subjectivity in thematic classification despite standard coding protocols.
3.9. Summary
Chapter Four: Results and Thematic Analysis
4.1. Introduction
4.2. NLP Models in Creative Text Generation
- Rule-Based and Template Systems: Early works relied on manually crafted templates to emulate poetic forms (e.g., haiku generators using syllabic constraints).
- Statistical Models: n-gram models and Markov chains were used in earlier studies to generate pseudo-poetry.
- Neural Models: The majority of recent studies use LSTMs, GRUs, and Transformers. GPT-2 and GPT-3 dominate contemporary poetry generation, capable of producing grammatically fluent and stylistically aware texts.
- Multimodal and Interactive Systems: A small subset incorporated visual or affective cues to guide poem generation (e.g., poems based on images or sentiment prompts).
4.3. NLP in Literary Analysis
- Stylometry and Authorial Attribution: Use of function word frequency, sentence structure, and embedding models for authorship detection (e.g., in Shakespearean corpus).
- Topic Modeling and Thematic Mapping: Latent Dirichlet Allocation (LDA) used to uncover recurring motifs in novels and poetry.
- Metaphor and Figurative Language Detection: Leveraging resources like MetaNet and FrameNet to identify complex semantic structures.
- Narrative and Sentiment Analysis: Tracing emotional arcs and character sentiment shifts within texts using BERT-based sentiment classifiers.
4.4. Evaluation of NLP-Created Literary Works
- Human Judgment Studies: Participants rated outputs for coherence, creativity, emotional depth, and novelty.
- Automated Metrics: Use of BLEU, ROUGE, METEOR—though frequently criticized for being inadequate for aesthetic judgment.
- Hybrid Models: Combined automated metrics with crowd-sourced or expert evaluations.
- Emerging Measures: Novel proposals for evaluating creativity include the Creativity Assessment Index and surprise-novelty-consistency scales.
4.5. Ethical, Cultural, and Theoretical Dimensions
- Cultural Representation: Highlighted the Western bias in training corpora and neglect of non-Western literary traditions.
- Authorship and Originality: Debates about algorithmic authorship, especially in contexts of publication or monetization.
- Reader Perception: Studies probing whether readers can distinguish between human and machine-created texts.
- Philosophical Frameworks: Engagement with post-structuralist theories and the philosophy of language in interpreting algorithmic outputs.
4.6. Summary of Findings
- The field has seen explosive growth post-2017 due to transformer models.
- Poetry and short-form prose are more commonly studied than long-form narratives.
- Evaluation remains a methodological weak point.
- There is significant theoretical potential in bridging NLP with literary theory, but collaboration remains rare.
Chapter Five: Discussion
5.1. Introduction
5.2. Bridging Creativity and Computation
5.3. Evaluation Challenges and Subjectivity
5.4. The Role of Human-AI Collaboration
5.5. Interdisciplinary Integration
5.6. Ethical and Philosophical Considerations
5.7. Implications for Education and Pedagogy
5.8. Summary
Chapter Six: Conclusion and Future Directions
6.1. Overview
6.2. Summary of Key Findings
- NLP tools have advanced significantly in generating creative text, especially with the advent of transformers.
- Literary analysis using NLP is increasingly sophisticated, enabling thematic, stylistic, and emotional exploration of texts.
- Evaluation remains a major methodological challenge, particularly for aesthetic and affective dimensions.
- Ethical and cultural issues must be addressed more robustly in both model design and deployment.
- Human-AI collaboration offers a productive middle ground that supports creativity without displacing it.
6.3. Contributions of the Study
- Synthesizing literature across NLP, poetics, and literary theory.
- Identifying conceptual gaps and proposing new evaluation paradigms.
- Highlighting the role of co-creative systems in redefining literary authorship.
- Advocating for cross-cultural and interdisciplinary research methodologies.
6.4. Recommendations for Future Research
- Develop Interdisciplinary Evaluation Metrics: Bridging literary theory with computational metrics to better assess creative outputs.
- Expand Multilingual and Multicultural Datasets: Including diverse poetic traditions to mitigate cultural bias.
- Explore Reader Reception Studies: Empirically assess how human audiences engage with machine-generated texts.
- Enhance Explainability in NLP Models: Making creative decisions of AI systems more transparent and interpretable.
- Foster Cross-Disciplinary Collaborations: Joint projects between literary scholars and AI researchers to produce hybrid models of creativity.
6.5. Final Reflections
References
- Rahman, M. H.; Kazi, M.; Hossan, K. M. R.; Hassain, D. The Poetry of Programming: Utilizing Natural Language Processing for Creative Expression, 2023.
- Boden, M. A. The Creative Mind: Myths and Mechanisms, 2nd ed.; Routledge, 2004. [Google Scholar]
- Colton, S.; Pease, A.; Ritchie, G. The Effect of Input Knowledge on Creativity. In In Proceedings of the AISB Symposium on AI and Creativity in Arts and Science; 2001. [Google Scholar]
- Veale, T.; Hao, Y. A Fluid Knowledge Representation for Understanding and Generating Creative Metaphors. Knowledge-Based Systems 2008, 21(7), 614–622. [Google Scholar]
- Ghazvininejad, M.; Shi, X.; Choi, Y.; Knight, K. Hafez: An Interactive Poetry Generation System. In Proceedings of ACL 2017, System Demonstrations 2017, 43–48. [Google Scholar]
- Lau, J. H.; Cohn, T.; Baldwin, T. Deep-speare: A joint neural model of poetic language, meter and rhyme. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics; 2018; pp. 1948–1958. [Google Scholar]
- Fan, A.; Lewis, M.; Dauphin, Y. Hierarchical Neural Story Generation. In Proceedings of ACL 2018; 2018; pp. 889–898. [Google Scholar]
- Radford, A.; Wu, J.; Child, R.; Luan, D.; Amodei, D.; Sutskever, I. Language Models are Unsupervised Multitask Learners; OpenAI Technical Report, 2019. [Google Scholar]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A. N.; Polosukhin, I. Attention is All You Need. In Advances in Neural Information Processing Systems; 2017; pp. 5998–6008. [Google Scholar]
- Pillutla, K.; Li, S.; Zettlemoyer, L. MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence Frontiers. In Advances in Neural Information Processing Systems (NeurIPS); 2021. [Google Scholar]
- Holyoak, K. J.; Thagard, P. Mental Leaps: Analogy in Creative Thought; MIT Press, 1995. [Google Scholar]
- Dennett, D. Consciousness Explained; Little, Brown and Co, 1991. [Google Scholar]
- Searle, J. R. Minds, Brains, and Programs. Behavioral and Brain Sciences 1980, 3(3), 417–457. [Google Scholar]
- McCormack, J.; Gifford, T.; Hutchings, P. Autonomy, Authenticity, Authorship and Intention in Computer Generated Art. IEEE Transactions on Affective Computing 2019, 10(3), 351–363. [Google Scholar]
- Yang, Z.; Dai, Z.; Yang, Y.; Carbonell, J.; Salakhutdinov, R.; Le, Q. V. XLNet: Generalized Autoregressive Pretraining for Language Understanding. In Advances in Neural Information Processing Systems; 2019. [Google Scholar]
- Holton, R. AI and the ‘Art’ of Creativity. Leonardo 2009, 42(5), 418–423. [Google Scholar]
- Elgammal, A.; Liu, B.; Elhoseiny, M.; Mazzone, M. CAN: Creative Adversarial Networks, Generating ‘Art’ by Learning About Styles and Deviating from Style Norms. In In Proceedings of the 8th International Conference on Computational Creativity; 2017. [Google Scholar]
- Lin, C.-Y. ROUGE: A Package for Automatic Evaluation of Summaries. In In Proceedings of the Workshop on Text Summarization Branches Out (WAS 2004); 2004. [Google Scholar]
- Papineni, K.; Roukos, S.; Ward, T.; Zhu, W.-J. BLEU: A Method for Automatic Evaluation of Machine Translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics; 2002; pp. 311–318. [Google Scholar]
- Dincer, B.; Aydın, C. C. The Ethics of AI-generated Literature: Issues of Responsibility, Accountability, and Meaning. AI & Society 2021, 36, 1105–1118. [Google Scholar]
- Floridi, L. The Logic of Information: A Theory of Philosophy as Conceptual Design; Oxford University Press, 2019. [Google Scholar]
- Ha, D.; Eck, D. A Neural Representation of Sketch Drawings. In International Conference on Learning Representations (ICLR); 2018. [Google Scholar]
- Manjavacas, E.; Koolen, C. Stylometry for Authorship Attribution: A Literature Review. Digital Scholarship in the Humanities 2021, 36 Supplement_1, i108–i126. [Google Scholar]
- Sharma, R.; Kaur, H. A Review of Deep Learning Techniques in Poetry Generation. International Journal of Advanced Computer Science and Applications 2022, 13(1), 527–536. [Google Scholar]
- Zhang, Y.; Sun, S.; Galley, M.; Chen, Y. C.; Brockett, C.; Gao, X.; Dolan, B. DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations; 2020; pp. 270–278. [Google Scholar]
- Hutchinson, B.; Prabhakaran, V.; Denton, E.; Webster, K.; Zhong, Y.; D'Amour, A. Towards Transparent and Accountable NLP: A Review of Bias in Language Model Applications. In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency; 2021; pp. 629–644. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).