Submitted:
23 December 2024
Posted:
07 January 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Errors
2.1. AI Does Not Understand
2.1.1. Tic Tac Toe
-
ChatGPT 4o-latest (accessed 12/13/24): “The most critical spot to consider is the top-left corner (row 1, column 1), because placing "O" there would create a vertical line from top to bottom in the second column (O in 1,3, 2,2, and 3,1).”Comment: Playing an O in the top left corner does not create a vertical line from top to bottom in the second column. It loses immediately.
-
ChatGPT 4o-mini (accessed 12/13/24): “The next best move for O is to place an O in the top left corner (first row, first column). This will create two potential winning lines for O in the next move:”Comment: Playing an O in the top left corner does not create two potential winning lines for the O in the next move. It loses immediately.
-
Claude-3.5-Sonnet (accessed 12/13/24): “For O to win or prevent X from winning, O should play in the top left corner (first row, first column)”Comment: Playing in the top left corner loses immediately.
2.1.2. Dinner Party: Optimal Mixing
2.1.3. Repeat After Me
2.2. AI Is Biased
2.2.1. The “42” Error
2.2.2. AI is Gullible
2.2.3. Error Sentiment Analysis of Customer Feedback
- The phrase "not bad" suggests mild positivity, but it is countered by "not great," which reflects a lack of enthusiasm.
- The expression "kind of does the job, I guess" conveys a lukewarm or indifferent attitude, implying that the product meets basic expectations but doesn’t impress.
- Neutral sentiment: The phrases "not bad" and "does the job, I guess" indicate a lukewarm or indifferent attitude toward the product’s performance.
- Negative sentiment: The statement "not great either" and "priced pretty high" suggest dissatisfaction with the product’s value for money and overall quality.
2.3. AI Can’t Calculate
2.3.1. Length of Passage
2.3.2. Number of Vowels
3. Conclusions
Acknowledgments
Appendix A. Tic-Tac-Toe Problem


Appendix B. Dinner Party Problem



References
- Lebovitz, S.; Lifshitz-Assaf, H.; Levina, N. To Engage or Not to Engage with AI for Critical Judgments: How Professionals Deal with Opacity When Using AI for Medical Diagnosis. Organization Science 2022, 33, 126–148. [Google Scholar] [CrossRef]
- Oniani, D.; Hilsman, J.; Peng, Y.; Poropatich, R.K.; Pamplin, J.C.; Legault, G.L.; Wang, Y. Adopting and expanding ethical principles for generative artificial intelligence from military to healthcare. NPJ Digital Medicine 2023, 6, 225. [Google Scholar] [CrossRef] [PubMed]
- Mollick, E. Co-intelligence. Penguin, 2024. [Google Scholar]
- Leniolabs. 42: GPT’s answer to life, the universe, and everything, 2023.
- Dell’Acqua, F.; McFowland III, E.; Mollick, E.R.; Lifshitz-Assaf, H.; Kellogg, K.; Rajendran, S.; Krayer, L.; Candelon, F.; Lakhani, K.R. Navigating the jagged technological frontier: Field experimental evidence of the effects of AI on knowledge worker productivity and quality. Harvard Business School Technology & Operations Mgt. Unit Working Paper, 2023. [Google Scholar]
- Jenni, AI. ChatGPT errors: Their underlying causes and fixes, 2023.
- Heikkilä, M. AI language models are rife with different political biases, 2023.
- Borji, A. A categorical archive of ChatGPT failures. arXiv preprint arXiv:2302.03494 2023. [CrossRef]







Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).