Beyond Scaling: A Survey on Data-Efficient Agentic Learning

Yaqing Wang; Zhenlin Luo; Peiyao Zhao; Yunfeng Cai; Quanming Yao

doi:10.20944/preprints202605.0477.v1

Submitted:

07 May 2026

Posted:

08 May 2026

You are already at the latest version

Abstract

LLM-based agents are increasingly deployed across web and GUI automation, embodied decision making, and scientific workflows, yet their progress is often constrained by limited data and interaction. High-quality supervision is costly, and real-environment interactions are expensive, risky, and quickly invalidated by environment drift. This survey studies how to obtain and improve LLM-based agents with fewer samples, fewer labels, and fewer/ cheaper interactions. We view agentic learning as a closed-loop decision process where experience arises from both external supervision and online interactions, and data efficiency requires maximizing information yield per unit cost. We then introduce a unified agentic learning framework and organize the literature along three complementary dimensions: experience augmentation, agent structural design, and learning paradigms. This perspective connects design choices to where learning signals come from, how they are utilized, and how adaptation is performed under bounded budgets. We summarize representative benchmarks and synthesize key open challenges, aiming to clarify the emerging landscape and support future progress in data-efficient agentic learning.

Keywords:

machine learning

;

few-shot learning

;

agent-based and multi-agent systems

Subject:

Computer Science and Mathematics - Artificial Intelligence and Machine Learning

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Beyond Scaling: A Survey on Data-Efficient Agentic Learning

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe