Submitted:
21 January 2026
Posted:
22 January 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. The Stability Problem in Agentic AI Systems
1.2. Scope, Limitations, and Agentic Boundary Definition
1.3. System Class and Failure Envelope
2. The Shift from Control-Dominated Failures to Intent-Dominated Failures
2.1. Traditional Assumptions in Agent Design
2.2. Agentic Execution as an Active System Component
2.3. The Agentic Stack in Practice
3. Why Classical Metrics Fail
3.1. The Measurement Gap in Agentic Systems
3.2. The Invisibility of Agentic Incoherence
3.3. Why Intent Consistency Cannot Be Measured Locally
4. Agentic Incoherence as a Structural Phenomenon
4.1. Coherence Versus Incoherence in Agentic Systems
4.2. Soft Degradation Versus Hard Failure

4.3. Non-Local Coupling and Emergent Effects
5. The Economic Dimension: Where Real Costs Arise
5.1. Direct Cost Effects
5.2. Hidden and Systemic Costs: Ghost Work in Agentic Systems
5.3. Measured Metrics Versus Actual System Cost
5.4. Irreproducibility and Audit Risk
6. Why Scaling Is Not a Solution
6.1. More Agents, More Incoherence
6.2. Larger Context Windows, Deeper Drift
6.3. More Tool Access, More Amplification Paths
7. Toward a Structural Perspective
7.1. The Need for System-Level Analysis
7.2. Operator-Based Approaches to Agentic Stability
7.2.1. Auditability and Traceability of Agentic Decisions
8. Discussion
8.1. Implications for Architecture and Deployment Decisions
8.2. Open Research Questions
8.3. Outlook on Empirical Validation
9. Conclusions
- ai.01 Interconnect Stability: Physical coupling, including hardware synchronization, communication fabric behavior, and data movement constraints.
- ai.04 Runtime Control Coherence: Logical coupling, encompassing schedulers, orchestrators, runtime engines, and policy enforcement mechanisms.
- ai.13 Agentic System Stability: Semantic coupling, involving intent propagation, planning dynamics, tool use, and multi-agent coordination.
Data Availability Statement
Acknowledgments
Conflicts of Interest
Use of Artificial Intelligence
References
- Wegener, G. H. SORT-AI: A Projection-Based Structural Framework for AI Safety—Alignment Stability, Drift Detection, and Scalable Oversight. Preprints 2025, 2024121334. [Google Scholar] [CrossRef]
- Wegener, G. H. SORT-CX: A Projection-Based Structural Framework for Complex Systems—Operator Geometry, Non-Local Kernels, Drift Diagnostics, and Emergent Stability. Preprints 2025, 2024121431. [Google Scholar] [CrossRef]
- Wegener, G. H. SORT-AI: A Structural Safety and Reliability Framework for Advanced AI Systems with Retrieval-Augmented Generation as a Diagnostic Testbed. Preprints 2025, 2024121345. [Google Scholar] [CrossRef]
- Wegener, G. H. SORT-AI: Interconnect Stability and Cost per Performance in Large-Scale AI Infrastructure—A Structural Analysis of Runtime Instability in Distributed Systems. Preprints 2025, 2026010161. [Google Scholar] [CrossRef]
- Wegener, G. H. SORT-AI: Runtime Control Coherence in Large-Scale AI Systems—Structural Causes of Cost, Instability, and Non-Determinism Beyond Interconnect Failures. Preprints 2025, 2026010298. [Google Scholar] [CrossRef]
- Wooldridge, M. An Introduction to MultiAgent Systems, 2nd ed.; John Wiley & Sons, 2009; ISBN 978-0-470-51946-2. [Google Scholar]
- Russell, S.; Norvig, P. Artificial Intelligence: A Modern Approach, 4th ed.; Pearson, 2021; ISBN 978-0-13-461099-3. [Google Scholar]
- Stone, P., et al. (2016). Artificial Intelligence and Life in 2030. One Hundred Year Study on Artificial Intelligence: Report of the 2015–2016 Study Panel. Stanford University. ai100.stanford.edu/2016-report.
- Shoham, Y.; Leyton-Brown, K. Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations; Cambridge University Press, 2009; ISBN 978-0-521-89943-7. [Google Scholar]
- Dorri, A.; Kanhere, S. S.; Jurdak, R. Multi-Agent Systems: A Survey. IEEE Access 2018, 6, 28573–28593. [Google Scholar] [CrossRef]
- Jennings, N. R. On Agent-Based Software Engineering. Artificial Intelligence 2000, 117(2), 277–296. [Google Scholar] [CrossRef]
- Yao, S.; et al. ReAct: Synergizing Reasoning and Acting in Language Models. Proceedings of ICLR 2023, 2023. [Google Scholar]
- Shinn, N.; et al. Reflexion: Language Agents with Verbal Reinforcement Learning. Proceedings of NeurIPS, 2023. [Google Scholar]
- Wu, Q.; et al. AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation. arXiv 2023, arXiv:2308.08155. [Google Scholar]
- Schick, T.; et al. Toolformer: Language Models Can Teach Themselves to Use Tools. Proceedings of NeurIPS, 2023. [Google Scholar]
- Wei, J.; et al. Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. Proceedings of NeurIPS 2022, 2022. [Google Scholar]
- Xi, Z.; et al. The Rise and Potential of Large Language Model Based Agents: A Survey. arXiv 2023, arXiv:2309.07864. [Google Scholar] [CrossRef]
- Wang, L.; et al. A Survey on Large Language Model Based Autonomous Agents. Frontiers of Computer Science 2024, 18(6), 186345. [Google Scholar] [CrossRef]
- Mialon, G.; et al. Augmented Language Models: A Survey. In Transactions on Machine Learning Research; 2023. [Google Scholar]
- Qin, Y.; et al. Tool Learning with Foundation Models. arXiv 2023, arXiv:2304.08354. [Google Scholar] [CrossRef]
- Shen, Y.; et al. HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face. Proceedings of NeurIPS 2023, 2024. [Google Scholar]
- Park, J. S.; et al. Generative Agents: Interactive Simulacra of Human Behavior. In Proceedings of UIST 2023; 2023; pp. 1–22. [Google Scholar] [CrossRef]
- Lewis, P.; et al. Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. Proceedings of NeurIPS 2020, 2020. [Google Scholar]
- Patil, S. G.; et al. Gorilla: Large Language Model Connected with Massive APIs. arXiv 2023, arXiv:2305.15334. [Google Scholar] [CrossRef]
- Hao, S.; et al. ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings. Proceedings of NeurIPS 2023, 2024. [Google Scholar]
- Ruan, J.; et al. TPTU: Large Language Model-Based AI Agents for Task Planning and Tool Usage. arXiv 2024, arXiv:2308.03427. [Google Scholar]
- Huang, W.; et al. Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents. Proceedings of ICML 2022, 2022. [Google Scholar]
- Gao, L.; et al. PAL: Program-Aided Language Models. Proceedings of ICML 2023, 2023. [Google Scholar]
- De Wolf, T.; Holvoet, T. Emergence Versus Self-Organisation: Different Concepts but Promising When Combined. In Engineering Self-Organising Systems; Springer, 2005; pp. 1–15. [Google Scholar] [CrossRef]
- Holland, J. H. Hidden Order: How Adaptation Builds Complexity; Addison-Wesley, 1995; ISBN 978-0-201-44230-4. [Google Scholar]
- Levin, S. A. Complex Adaptive Systems: Exploring the Known, the Unknown and the Unknowable. Bulletin of the American Mathematical Society 2003, 40(1), 3–19. [Google Scholar] [CrossRef]
- Mitchell, M. Complexity: A Guided Tour; Oxford University Press, 2009; ISBN 978-0-19-512441-5. [Google Scholar]
- Bar-Yam, Y. Dynamics of Complex Systems; Addison-Wesley, 1997; ISBN 978-0-201-55748-1. [Google Scholar]
- Newman, M. Networks: An Introduction; Oxford University Press, 2010; ISBN 978-0-19-920665-0. [Google Scholar]
- Albert, R.; Barabási, A.-L. Statistical Mechanics of Complex Networks. Rev. Mod. Phys. 2002, 74, 47–97. [Google Scholar] [CrossRef]
- Scheffer, M.; et al. Early-Warning Signals for Critical Transitions. Nature 2009, 461, 53–59. [Google Scholar] [CrossRef]
- Strogatz, S. H. Exploring Complex Networks. Nature 2001, 410, 268–276. [Google Scholar] [CrossRef] [PubMed]
- Watts, D. J.; Strogatz, S. H. Collective Dynamics of `Small-World’ Networks. Nature 1998, 393, 440–442. [Google Scholar] [CrossRef]
- Panait, L.; Luke, S. Cooperative Multi-Agent Learning: The State of the Art. Autonomous Agents and Multi-Agent Systems 2005, 11(3), 387–434. [Google Scholar] [CrossRef]
- Stone, P.; Veloso, M. Multiagent Systems: A Survey from a Machine Learning Perspective. Autonomous Robots 2000, 8(3), 345–383. [Google Scholar] [CrossRef]
- Durfee, E. H. Distributed Problem Solving and Planning. In Multi-Agent Systems and Applications; Springer, 1999; pp. 118–149. [Google Scholar]
- Tumer, K.; Agogino, A. K. Distributed Agent-Based Air Traffic Flow Management. In Proceedings of AAMAS 2007; 2007; pp. 1–8. [Google Scholar] [CrossRef]
- Busoniu, L.; Babuska, R.; De Schutter, B. A Comprehensive Survey of Multiagent Reinforcement Learning. IEEE Transactions on Systems, Man, and Cybernetics, Part C 2008, 38(2), 156–172. [Google Scholar] [CrossRef]
- Saltzer, J. H.; Reed, D. P.; Clark, D. D. End-to-End Arguments in System Design. ACM Transactions on Computer Systems 1984, 2(4), 277–288. [Google Scholar] [CrossRef]
- Ousterhout, J. A Philosophy of Software Design; Yaknyam Press, 2018; ISBN 978-1-7321022-0-0. [Google Scholar]
- Lamport, L. Time, Clocks, and the Ordering of Events in a Distributed System. Communications of the ACM 1978, 21(7), 558–565. [Google Scholar] [CrossRef]
- Jain, R. The Art of Computer Systems Performance Analysis; John Wiley & Sons, 1991; ISBN 978-0-471-50336-1. [Google Scholar]
- Hellerstein, J. L.; Diao, Y.; Parekh, S.; Tilbury, D. M. Feedback Control of Computing Systems; John Wiley & Sons, 2004; ISBN 978-0-471-26637-2. [Google Scholar]
- Avizienis, A.; et al. Basic Concepts and Taxonomy of Dependable and Secure Computing. IEEE Transactions on Dependable and Secure Computing 2004, 1(1), 11–33. [Google Scholar] [CrossRef]
- Ananthanarayanan, G.; et al. Effective Straggler Mitigation: Attack of the Clones. Proceedings of NSDI’13, 2013; pp. 185–198. [Google Scholar]
- Dean, J.; Barroso, L. A. The Tail at Scale. Communications of the ACM 2013, 56(2), 74–80. [Google Scholar] [CrossRef]
- Gunawi, H. S.; et al. What Bugs Live in the Cloud? A Study of 3000+ Issues in Cloud Systems. 2014, Proceedings of SoCC’14, 1–14. [Google Scholar] [CrossRef]
- Perrow, C. Normal Accidents: Living with High-Risk Technologies; Basic Books, 1984; ISBN 978-0-465-05142-9. [Google Scholar]
- Brynjolfsson, E.; Hitt, L. M. Beyond Computation: Information Technology, Organizational Transformation and Business Performance. Journal of Economic Perspectives 2000, 14(4), 23–48. [Google Scholar] [CrossRef]
- Barroso, L. A.; Clidaras, J.; Hölzle, U. The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines. In Synthesis Lectures on Computer Architecture, 2nd ed.; 2013; Volume 8, 3, pp. 1–154. [Google Scholar] [CrossRef]
- Barroso, L. A.; Hölzle, U.; Ranganathan, P. The Datacenter as a Computer: Designing Warehouse-Scale Machines. In Synthesis Lectures on Computer Architecture, 3rd ed.; 2019; Volume 13, 3, pp. 1–189. [Google Scholar] [CrossRef]
- Patterson, D. A.; Hennessy, J. L. Computer Organization and Design: The Hardware/Software Interface, 5th ed.; Morgan Kaufmann, 2016; ISBN 978-0-12-407726-3. [Google Scholar]
- Kannan, R. S.; et al. GrandSLAm: Guaranteeing SLAs for Jobs in Microservices Execution Frameworks. In Proceedings of EuroSys’ 19; 2019; pp. 1–16. [Google Scholar] [CrossRef]
- Sculley, D.; et al. Hidden Technical Debt in Machine Learning Systems. Proceedings of NeurIPS 2015, 2015; pp. 2503–2511. [Google Scholar]
- Amershi, S.; et al. Software Engineering for Machine Learning: A Case Study. In Proceedings of ICSE-SEIP’19; 2019; pp. 291–300. [Google Scholar] [CrossRef]
- Zaharia, M.; et al. Accelerating the Machine Learning Lifecycle with MLflow. IEEE Data Engineering Bulletin 2018, 41(4), 39–45. [Google Scholar]
- Delimitrou, C.; Kozyrakis, C. Quasar: Resource-Efficient and QoS-Aware Cluster Management. Proceedings of ASPLOS’ 14 2014, 127–144. [Google Scholar] [CrossRef]
- Kephart, J. O.; Chess, D. M. The Vision of Autonomic Computing. IEEE Computer 2003, 36(1), 41–50. [Google Scholar] [CrossRef]
- Alon, U. Network Motifs: Theory and Experimental Approaches. Nature Reviews Genetics 2007, 8, 450–461. [Google Scholar] [CrossRef] [PubMed]
- Gilbert, S.; Lynch, N. Brewer’s Conjecture and the Feasibility of Consistent, Available, Partition-Tolerant Web Services. ACM SIGACT News 2002, 33(2), 51–59. [Google Scholar] [CrossRef]
- Kroll, J. A.; et al. Accountable Algorithms. University of Pennsylvania Law Review 2017, 165(3), 633–705. [Google Scholar]
- Wachter, S.; Mittelstadt, B.; Floridi, L. Why a Right to Explanation of Automated Decision-Making Does Not Exist in the General Data Protection Regulation. International Data Privacy Law 2017, 7(2), 76–99. [Google Scholar] [CrossRef]
- Doshi-Velez, F.; Kim, B. Towards A Rigorous Science of Interpretable Machine Learning. arXiv 2017, arXiv:1702.08608. [Google Scholar] [CrossRef]
- Brundage, M.; et al. Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims. arXiv 2020, arXiv:2004.07213. [Google Scholar] [CrossRef]
- Bommasani, R.; et al. On the Opportunities and Risks of Foundation Models. arXiv 2021, arXiv:2108.07258. [Google Scholar] [CrossRef]
- Pineau, J.; et al. Improving Reproducibility in Machine Learning Research. Journal of Machine Learning Research 2021, 22(164), 1–20. [Google Scholar]
- Rabinovich, E.; et al. Scaling Distributed Machine Learning with In-Network Aggregation. Proceedings of NSDI’23, 2023; pp. 1–19. [Google Scholar]
- Liu, X.; et al. AgentBench: Evaluating LLMs as Agents. Proceedings of ICLR, 2024; 2023. [Google Scholar]
- Chang, Y.; et al. A Survey on Evaluation of Large Language Models. ACM Transactions on Intelligent Systems and Technology 2024, 15(3), 1–45. [Google Scholar] [CrossRef]
- Lilja, D. J. Measuring Computer Performance: A Practitioner’s Guide; Cambridge University Press, 2000; ISBN 978-0-521-64105-4. [Google Scholar]
- Kapoor, S.; Narayanan, A. Leakage and the Reproducibility Crisis in ML-Based Science. Patterns 2024, 5(4), 100804. [Google Scholar] [CrossRef]
- Liang, P.; et al. Holistic Evaluation of Language Models. In Transactions on Machine Learning Research; 2023. [Google Scholar]
- Liu, N. F.; et al. Lost in the Middle: How Language Models Use Long Contexts. In Transactions on Machine Learning Research; 2024. [Google Scholar]
- Sumers, T. R.; et al. Cognitive Architectures for Language Agents. In Transactions on Machine Learning Research; 2024. [Google Scholar]




Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).