Submitted:
21 June 2026
Posted:
23 June 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction: A Question Demanding Clarification
2. Core Framework: Alignment and Evocation
2.1. Alignment: Aesthetics as Statistical Fitting
2.2. Evocation: Aesthetic Experience at the Fragment Recombination Level
2.3. The Relationship Between Alignment and Evocation
3. Neural Pathways: The Neurological Basis of Three Art Forms
3.1. The Auditory-Emotional Direct Pathway (Music)
3.2. The Visual-Semantic Construction Pathway (Painting)
3.3. The Multimodal-Scenario Simulation Pathway (Literature)
3.4. The Unique Position of Multimodal Integrated Art (Film)
4. The Ring Scale: A Quantification Dimension for Aesthetic Intensity
| Ring | Experience Description | Neural Correspondence | Mechanism Attribute | Evocation Level |
| Miss | No response, even aversion | Prediction error too large or too small | Alignment failure | — |
| 1-3 Rings | Recognition ("It's okay," "Listenable") | Familiar paradigm matched, no new connections | Pure alignment | — |
| 4-6 Rings | Touched ("Good," "Interesting") | Local fragments activated, no cross-regional integration | Alignment dominant | B+: Preliminary evocation |
| 7-8 Rings | Struck ("Hit," "Goosebumps," "Tears welling") | Multimodal fragments synchronously activated, Yijing emerges | Evocation dominant | A: Large-scale evocation |
| 9 Rings | Transformation ("Brain reorganized") | Large-scale neural synchronization, self-boundary dissolution, sustained cognitive restructuring | Deep evocation | A+: Extreme evocation |
| 10 Rings | Ineffable | Rare peak experience, loss of time sense, dissolution of self | Limit of evocation | A+: Extreme evocation |
| 1 Tables may have a footer. | ||||
5. The Capability Boundaries of AI
6. From Description to Guidance: Three Strategic Shifts
6.1. Shift One: From Substituting Humans to Complementing Humans
6.2. Shift Two: From Exhausting Known Imagery to Detecting Individual Fragments
6.3. Shift Three: From Pursuing Ring Scores to Pursuing Transformation
7. Grounding in Practice: Imagination-Driven Prompt Engineering
7.1. Strategy One: From "Emotion Labels" to "Physiological Arousal Descriptions"
7.2. Strategy Two: From "Imagery Accumulation" to "Multimodal Simulation Prompts"
7.3. Strategy Three: Strategic Omission—From "Pursuing Perfection" to "Preserving Fissures"
8. AI Hallucination: A Potential Catalyst for Imagination
9. Conclusion
10. Hypotheses, Verification, and Call for Empirical Research
10.1. List of Testable Hypotheses
- Hypothesis One: Physiological arousal descriptions outperform emotion labels. In prompts for AI music/sound effect creation, using physiological arousal descriptions enhances the audience's emotional experience intensity and physiological arousal level more effectively than using emotion labels.
- Hypothesis Two: Multimodal simulation prompts outperform purely visual imagery. In prompts for AI literature/film script creation, incorporating multi-sensory embodied details enhances the audience's multimodal scenario simulation intensity and Yijing experience depth more effectively than purely visual imagery accumulation.
- Hypothesis Three: Strategic blank-leaving has an optimal range. In AI art creation, instructing AI to manufacture controlled imperfections or blanks produces superior evocation effects compared to "flawless" output; however, the degree of blank-leaving has an optimal range—too little produces no effect, too much becomes genuine defect.
- Hypothesis Four: Screened output from AI hallucination can serve as a trigger source for imagination. The unconventional imagery combinations produced by AI hallucination, after human creator screening and secondary processing, can trigger the human creator's imagination and ultimately elevate the evocation level of the work.
- Hypothesis Five: Evocation has levels, and higher levels require the participation of a latent evocative reservoir. Aesthetic evocation is a spectrum from shallow to deep (B+ → A → A+); higher-level evocation (A level and above) requires the participation of the audience's latent evocative reservoir, and therefore depends more heavily on individual differences among audience members.
10.2. Verification Framework
10.3. Call for Empirical Research
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Barsalou, Lawrence W. Perceptual symbol systems. Behav. Brain Sci. 1999, 22(4), 577–660. [Google Scholar] [CrossRef] [PubMed]
- Friston, Karl. The free-energy principle: A unified brain theory? Nat. Rev. Neurosci. 2010, 11(2), 127–138. [Google Scholar] [CrossRef] [PubMed]
- Gallese, Vittorio; Lakoff, George. The brain’s concepts: The role of the sensory-motor system in conceptual knowledge. Cogn. Neuropsychol. 2005, 22(3–4), 455–479. [Google Scholar] [CrossRef] [PubMed]
- Holm-Hadulla, Rainer M.; Bertolino, Aljoscha. Creativity, alcohol and drug abuse: The pop icon Jim Morrison. Psychopathology 2014, 47(3), 167–173. [Google Scholar] [CrossRef] [PubMed]
- Iser, Wolfgang. The Act of Reading: A Theory of Aesthetic Response; Johns Hopkins University Press: Baltimore, 1978. [Google Scholar]
- Koestler, Arthur. The Act of Creation; Hutchinson: London, 1964. [Google Scholar]
- Ludwig, Arnold M. Alcohol input and creative output. Br. J. Addctn. 1990, 85(7), 953–963. [Google Scholar] [CrossRef] [PubMed]
- Martindale, Colin. The Clockwork Muse: The Predictability of Artistic Change; Basic Books: New York, 1992. [Google Scholar]
- Norlander, Torsten; Gustafson, Roland. Effects of alcohol on a divergent figural fluency test during the illumination phase of the creative process. Creat. Res. J. 1998, 11(3), 265–274. [Google Scholar] [CrossRef] [PubMed]
- Ramachandran, V. S.; Hirstein, William. The science of art: A neurological theory of aesthetic experience. J. Conscious. Stud. 1999, 6(6–7), 15–51. [Google Scholar]
- Sayette, Michael A.; et al. Alcohol and creativity: A meta-analysis. Psychol. Bull. 2012, 138(4), 637–663. [Google Scholar] [CrossRef] [PubMed]
- Substance Abuse and Mental Health Services Administration (SAMHSA). Substance Use and Substance Use Disorders by Industry. SAMHSA: Washington, DC, 2007. Available online: https://store.samhsa.gov/product/substance-use-and-substance-use-disorders-by-industry/sma07-4294.
- Yi, Xianqun. The Serendipity of Imagery: On the Isomorphism between AI Hallucination and Aesthetic Mechanisms Lyric Writing. In Forthcoming; in Chinese; 2026. [Google Scholar]
- Zeki, Semir. Inner Vision: An Exploration of Art and the Brain; Oxford University Press: Oxford, 1999. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).