Preprint
Article

This version is not peer-reviewed.

Path-Sensitive AGI Alignment: Cognitive Integrity, Escape Cost, and Trajectory Risk in Augmented State Space

Submitted:

12 May 2026

Posted:

14 May 2026

You are already at the latest version

Abstract
AGI alignment is often evaluated at a snapshot: a system is judged by its current outputs, policy profile, benchmark behavior, or apparent corrigibility. Snapshot evaluation misses a central risk of advanced deployment: a good endpoint can still be reached by a bad journey. Two trajectories may arrive in similar behavioral regions while differing in reversibility, opacity, intervention cost, memory entanglement, institutional dependency, and the quality of human judgment left available for oversight. This paper develops a path-sensitive alternative. It represents AGI development as motion through an augmented state space Z containing model and environment state, world-model structure, policy state, memory and provenance traces, governance affordances, institutional embedding, and human evaluative capacity. Cognitive integrity — the capacity of individuals, teams, or institutions to sustain calibrated attention, trust, contestability, and decision under pressure [1] — is introduced here as an alignment-relevant state variable rather than assumed as a familiar metric. The formal contribution is a scaffold of definitions: controlled transition laws over augmented state, escape cost, path-level alignment functionals, viability floors, forbidden regions, and trajectory classes distinguished by lock-in, basin structure, retargetability, and integrity preservation. The result does not supply a calibrated empirical model of deployed AGI systems. It specifies what such a model must track if alignment evidence is to cover both present behavior and the remaining possibility of legible, reversible, and cognitively intact correction.
Keywords: 
;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated