UI-OCEANUS: Scaling GUI Agents with Synthetic Environmental Dynamics

Mengzhou Wu; Yuzhe Guo; Yuan Cao; Haochuan Lu; Songhe Zhu; Pingzhe Qu; Xin Chen; Kang Qin; Zhongpu Wang; Xiaode Zhang; Xinyi Wang; Wei Dai; Gang Cao; Yuetang Deng; Zhi Gong; Dezhi Ran; Linyi Li; Wei Yang; Tao Xie

doi:10.20944/preprints202603.0980.v1

Submitted:

11 March 2026

Posted:

12 March 2026

You are already at the latest version

Abstract

Scaling generalist GUI agents is hindered by the data scalability bottleneck of expensive human demonstrations and the ``distillation ceiling'' of synthetic teacher supervision. To transcend these limitations, we propose UI-Oceanus, a framework that shifts the learning focus from mimicking high-level trajectories to mastering interaction physics via ground-truth environmental feedback. Through a systematic investigation of self-supervised objectives, we identify that forward dynamics, defined as the generative prediction of future interface states, acts as the primary driver for scalability and significantly outweighs inverse inference. UI-Oceanus leverages this insight by converting low-cost autonomous exploration, which is verified directly by system execution, into high-density generative supervision to construct a robust internal world model. Experimental evaluations across a series of models demonstrate the decisive superiority of our approach: models utilizing Continual Pre-Training (CPT) on synthetic dynamics outperform non-CPT baselines with an average success rate improvement of 7% on offline benchmarks, which amplifies to a 16.8% gain in real-world online navigation. Furthermore, we observe that navigation performance scales with synthetic data volume. These results confirm that grounding agents in forward predictive modeling offers a superior pathway to scalable GUI automation with robust cross-domain adaptability and compositional generalization.

Keywords:

GUI agents

;

world models

;

vision-language models

;

synthetic data

;

continual pre-training

Subject:

Computer Science and Mathematics - Artificial Intelligence and Machine Learning

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

UI-OCEANUS: Scaling GUI Agents with Synthetic Environmental Dynamics

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe