The Spiraling Intelligence Thesis: Intelligence as a Bounded Non-Convergent Trajectory

Stephen Atalebe

doi:10.20944/preprints202512.2640.v1

Submitted:

29 December 2025

Posted:

30 December 2025

You are already at the latest version

Abstract

Most deployed AI systems follow a Train-Freeze-Deploy lifecycle: parameters are optimized offline and then served as a static checkpoint until a new training cycle produces a replacement. This design assumes intelligence can be captured as a fixed point in parameter space, making continual adaptation brittle under distribution shift. This paper advances a different thesis: intelligence is better modeled as a bounded trajectory than as a convergent point. The central object is not a final parameter vector W^∗ but an evolving state W(t) whose identity is its history. This study proposes the Spiraling Intelligence Architecture (SIA) as a concrete instantiation, grounded in the Infinite Transformation Principle (ITP): irreversible, history-dependent evolution with recurrent revisitation and self-maintenance. The core mechanism combines (i) Rotational Hebbian Learning (RHL), a drift-inducing complex-valued plasticity rule that separates memories by phase, and (ii) an Autopoietic Sleep Cycle that reorganizes the internal structure without external labels. Through a minimal, reproducible toy simulation, the paper demonstrates the qualitative signature implied by the thesis: under distribution switching, a spiraling learner exhibits bounded non-convergence and recurrent re-alignment peaks for an earlier task, exceeding a convergent baseline that relaxes to a static compromise. The empirical scope is intentionally modest; the contribution is a falsifiable theoretical framing and a minimal mechanism that exhibits the predicted qualitative behaviour.

Keywords:

continual learning

;

non-Markovian learning

;

representational drift

;

Hebbian learning

;

dynamical systems

;

complex-valued neural networks

;

catastrophic forgetting

;

bio-inspired AI

Subject:

Computer Science and Mathematics - Artificial Intelligence and Machine Learning

1. Introduction

Production AI commonly follows a Train–Freeze–Deploy lifecycle: a model is trained, its weights are frozen, and the system is served as a static function

y = f (x; W)

. When conditions change, adaptation typically requires external retraining and deployment of a new checkpoint. This model of intelligence encourages fixed-point thinking: an optimal state

W^{*}

is sought, then treated as the system.

Continual learning under distribution shift exposes a structural weakness in this paradigm. New gradients can overwrite old structure, producing catastrophic interference [1]. Considerable research mitigates this via parameter protection, replay, or modularization, but many methods retain a fixed-point intuition: a stable representation is discovered and then preserved.

Biological intelligence offers a different hint. Neural representations drift while remaining functional [2,3]. Sleep reconfigures synaptic structure rather than merely resting it [4,5]. The relevant observation is not that biology “solves” continual learning, but that biological cognition is not naturally described as convergence to a single point.

Scope and Split

Two distinct questions often get conflated: (i) whether bounded, non-convergent dynamics can reduce forgetting under shift, and (ii) whether such dynamics reduce compute cost relative to retraining. This paper addresses (i) as a theoretical and proof-of-concept claim. Cost feasibility and benchmarking are explicitly deferred to a companion paper focused on engineering evaluation.

Contributions

C1.: A falsifiable thesis framing: intelligence as a bounded trajectory rather than a convergent parameter point, with formal definitions.
C2.: A minimal mechanistic realization: Rotational Hebbian Learning (RHL) plus an Autopoietic Sleep Cycle.
C3.: A formal theorem showing phase-based memory separation in RHL.
C4.: A reproducible toy simulation demonstrating bounded non-convergence and recurrent recovery peaks under distribution switching (the qualitative signature).
C5.: Clarification of the “non-Markovian” claim relative to deployable checkpoints.

2. Related Work

Continual learning. Approaches mitigate forgetting through parameter regularization (e.g., Elastic Weight Consolidation) [1], replay [6], or architectural strategies. Many methods implicitly assume a stable representation should be preserved; the present thesis instead treats controlled drift as a feature.

Complex-valued neural networks. Complex-valued networks represent oscillations and rotations using phase [7]. Here phase is used as a separation coordinate for temporally distinct traces, implemented as rotation in a complex weight space.

Biological drift and sleep. Representational drift [3] and synaptic homeostasis [4,5] motivate an explicit offline reorganization mode rather than purely online fitting.

Multi-timescale memory. Cascade models [8] emphasize that stable memory can require structured transitions across timescales. Consolidation in this paper plays a related role, reducing plasticity for frequently reinforced traces.

3. The Bounded Trajectory Intelligence Framework

Definition 1

(Bounded Trajectory Intelligence). A learning system implements Bounded Trajectory Intelligence if its parameter state

W_{t} \in R^{d}

satisfies:

1.: Non-convergence: ${lim}_{t \to \infty} W_{t}$ does not exist (or is not a singleton).
2.: Boundedness: ${sup}_{t} ∥ W_{t} ∥ < \infty$ .
3.: Recurrent Revisitation: For any task-relevant manifold $M$ , $dist (W_{t}, M)$ exhibits recurrent local minima.

The Spiraling Intelligence Architecture (SIA) is proposed as a sufficient mechanism. SIA is grounded in the Infinite Transformation Principle (ITP), reframed as design axioms for trajectory-based learning.

3.1. ITP Axioms for AI Systems

Axiom I: Irreversibility. Learning updates are path-dependent and not invertible in practice:

W_{t + 1} = Φ (W_{t}, x_{t}), Φ^{- 1} does not exist in general .

Irreversibility can be induced by consolidation, pruning, or any many-to-one map.

Axiom II: Spiraling Recurrence. Concepts must be revisited repeatedly but never identically:

x_{t_{2}} = x_{t_{1}} ⇏ W_{t_{2}} = W_{t_{1}} .

Recurrence implies drift; identical revisitation implies overwriting.

Axiom III: Autopoiesis. The system must include an internal reorganization mode in the absence of external supervision:

\frac{d W}{d t} = - \nabla_{W} H (W) + ξ (t) .

This is a design claim: consolidation requires offline structure-maintenance.

Axiom IV: Memory Separation. Distinct episodes should reduce destructive interference. Separation may be geometric, temporal, or phase-based.

Remark 1

(On Markovianity). Let the deployable state be

{\tilde{W}}_{t}

, typically just the weight snapshot. The internal state is

X_{t} = (W_{t}, C_{t}, {\hat{θ}}_{t}, \dots)

. The update

X_{t + 1} = F (X_{t}, x_{t})

is Markovian in

X_{t}

. The internal state includes auxiliary variables

{\hat{θ}}_{t}

which are not part of the deployable weights

{\tilde{W}}_{t}

. For a system deployed via frozen checkpoints, this distinction is essential.

4. Rotational Hebbian Learning (RHL)

RHL introduces controlled drift using complex-valued weights. Magnitude stores strength; phase stores temporal context.

Definition 2

(Complex Weights). Let synaptic weights be complex:

W_{i j} (t) = r_{i j} (t) e^{i θ_{i j} (t)} \in C .

4.1. Phase-Preserving Activation

A common complex nonlinearity preserves phase while shaping magnitude:

f (z) = σ (| z |) e^{i arg (z)} .

4.2. RHL Update Rule

W_{t + 1} = e^{i Ω_{t}} W_{t} + η_{t} (y_{t} x_{t}^{†}) (1 - C_{t}) .

(1)

Here

Ω_{t}

is a rotation rate (internal clock),

η_{t}

is plasticity, and

C_{t} \in [0, 1]

is consolidation.

4.3. Consolidation

A minimal consolidation update:

C_{t + 1} = (1 - κ) C_{t} + κ | y_{t} x_{t}^{†} |,

(2)

with

κ \in (0, 1)

and

C_{0} = 0

. Complete reproduction details and figure-generation scripts are provided in Appendix B.5

Theorem 1

(Phase-based Memory Separation). Consider the RHL rule (Eq. 1) with constant rotation Ω and no consolidation (

C_{t} = 0

). For two input patterns

x^{(1)}, x^{(2)}

presented at times

t_{1}

and

t_{2} = t_{1} + Δ t

, the interference of the first memory on the second, measured by the real part of their inner product in weight space, is:

ℜ 〈 e^{i Ω Δ t} x^{(1)}, x^{(2)} 〉 = cos (Ω Δ t) ℜ 〈 x^{(1)}, x^{(2)} 〉 - sin (Ω Δ t) ℑ 〈 x^{(1)}, x^{(2)} 〉 .

Corollary (Real-valued inputs).

If

x^{(1)}, x^{(2)} \in R^{n}

, then

ℑ 〈 x^{(1)}, x^{(2)} 〉 = 0

, and the interference term reduces to

ℜ 〈 e^{i Ω Δ t} x^{(1)}, x^{(2)} 〉 = cos (Ω Δ t) 〈 x^{(1)}, x^{(2)} 〉 .

Thus, orthogonalisation occurs at

Ω Δ t = π / 2

.

Proof.

At encoding time

t_{1}

, the weight component aligned with

x^{(1)}

is strengthened. After rotation for

Δ t

, this component is rotated by

e^{i Ω Δ t}

. The interference term follows from the standard inner product in complex vector space. The corollary follows by direct substitution. □

Remark 2.

This theorem provides a concrete, parameterized mechanism for reducing catastrophic interference without explicit buffers or gradient projections. The phase

θ = Ω t

serves as a temporal coordinate separating memory traces.

5. Autopoietic Sleep Cycle

Sleep is modeled as an explicit mode that reorganizes internal structure without new external labels, implementing Axiom III (Autopoiesis).

5.1. A Homeostatic Functional

A simple energy functional combines consistency with magnitude regulation:

H (W) = - \frac{1}{2} \sum_{i, j} ℜ (w_{i j} s_{i} {\bar{s}}_{j}) + \frac{λ}{2} \sum_{i, j} (| w_{i j} {|^{2} - w_{0}^{2})}^{2} .

Sleep dynamics follow:

\frac{d W}{d t} = - γ \nabla_{W} H (W) + ξ (t),

where

ξ (t)

represents low-amplitude noise.

Figure 1. Wake–sleep cycling with internal observables. Sleep events (markers) coincide with reorganization episodes; stress (and/or related signals) tracks sustained mismatch and distinguishes transient error from persistent failure. This figure makes the autopoietic cycle operational rather than metaphorical.

5.2. Minimal Replay

In the toy setting, replay can be implemented as a “dream” direction derived from the current weights or a small buffer. The present paper treats replay as a minimal mechanism to trigger re-encounter without explicit task labels.

6. Falsifiable Signature: Experiment Design

To test the thesis, the study constructs a minimal experiment where the null hypothesis (convergent dynamics) and the thesis hypothesis (bounded non-convergent dynamics) make qualitatively different predictions.

6.1. Setup

Let

w \in R^{2}

and two unit patterns

A, B

where B is a rotation of A by

60^{\circ}

. Inputs are noisy normalized samples:

x_{t} = \frac{p + ξ_{t}}{∥ p + ξ_{t} ∥}, ξ_{t} \sim N (0, σ^{2} I_{2}), p \in {A, B} .

Phases:

1.: Phase A: Task A only ( $0 \leq t < t_{A}$ ),
2.: Phase B: Task B only ( $t_{A} \leq t < t_{B}$ ),
3.: Phase C: mixture ( $t_{B} \leq t < T_{max}$ ).

Alignment with Task A is tracked:

{align}_{A} (t) = \frac{w_{t} \cdot A}{∥ w_{t} ∥ ∥ A ∥} .

6.2. Agents

Convergent baseline (Markov). A standard projection-like Hebbian update:

y_{t} = w_{t} \cdot x_{t}, Δ w_{t} = η_{M} y_{t} (x_{t} - y_{t} w_{t}), w_{t + 1} = \frac{w_{t} + Δ w_{t}}{∥ w_{t} + Δ w_{t} ∥} .

Spiral agent (drift + sleep). During wake:

w_{t}^{'} = R (ω) w_{t}, y_{t} = w_{t}^{'} \cdot x_{t}, Δ w_{t} = η_{W} y_{t} (x_{t} - y_{t} w_{t}^{'}), w_{t + 1} = \frac{w_{t}^{'} + Δ w_{t}}{∥ w_{t}^{'} + Δ w_{t} ∥} .

During periodic sleep in Phase C, a small self-consistency update is applied with slower rotation and mild pruning.

6.3. Hypotheses

H0 (Convergent): After the switch, ${align}_{A} (t)$ converges to a constant.
H1 (Spiral): After the switch, ${align}_{A} (t)$ exhibits stationary oscillations with recurrent peaks exceeding the convergent baseline’s plateau.

These criteria are independent of implementation details and test the thesis at the level of dynamical behavior rather than task performance.

7. Results: The Qualitative Signature

As predicted by the thesis, the spiral agent’s alignment

{align}_{A} (t)

in Phase C exhibits a stationary oscillatory process, while the Markov agent’s alignment converges to a constant (Figure 2).

Quantitative analysis: For the spiral agent, the time series

{{align}_{A} (t)}_{t \in Phase C}

rejects the null hypothesis of stationarity around a constant mean (Augmented Dickey-Fuller test,

p < 0.01

). The autocorrelation function shows significant periodicity with period

\approx 2 π / ω

. Recurrent peaks exceed the Markov baseline’s plateau for

> 15 %

of Phase C duration across 100 random seeds.

Interpretation: This constitutes the falsifiable signature implied by the thesis: bounded recurrence rather than fixed-point settling. The spiral agent does not “forget” Task A; it temporarily loses alignment but recurrently re-aligns via the combination of rotational drift and sleep-driven reorganization.

Figure 3. Catastrophic interference under an

A \to B

switch. The Markov learner rapidly collapses away from Task-A alignment after the switch, while the spiral learner preserves a bounded trajectory that later enables recurrent partial recovery in the mixed regime.

Figure 3. Catastrophic interference under an

A \to B

switch. The Markov learner rapidly collapses away from Task-A alignment after the switch, while the spiral learner preserves a bounded trajectory that later enables recurrent partial recovery in the mixed regime.

8. Falsifiability and Boundary Conditions

The thesis makes testable predictions. It would be falsified if:

1.: In the minimal simulation, the spiral agent’s recurrent peaks are artifacts of stochasticity and do not exceed the baseline’s plateau with statistical significance.
2.: The boundedness property fails, leading to divergence or collapse.
3.: The qualitative signature disappears when scaling to slightly larger but still tractable models (e.g., a 10-neuron network) under the same principles.

The provided code enables community falsification attempts.

Figure 4. Summary over random seeds for the two-task stream. Reported quantities should include at minimum: Phase-C peak alignment to Task A, Phase-C minimum alignment (floor), and fraction of Phase-C steps where the spiral agent exceeds the Markov plateau. This aggregation guards against cherry-picked trajectories.

9. Limitations and Future Work

Theoretical scope. This study has not proven that RHL + sleep leads to bounded trajectories in high-dimensional, deep networks. This is a key open theoretical problem.

Empirical scope. The toy simulation demonstrates a qualitative signature, not state-of-the-art performance. Scaling to benchmarks with cost constraints is addressed in the companion paper.

Safety and governance. A trajectory-defined system complicates auditability relative to static checkpoints. Deployment would require logging, controlled sleep windows, and conservative gating of irreversible operations.

Extensions. Several compatible modules (meta-cognitive stress, phase alignment interfaces, structural mutation) are discussed in the companion paper but are not required for the core thesis.

10. Conclusion

The paper proposes a thesis: intelligence is better modeled as a bounded trajectory than as convergence to a fixed point. The study presents a minimal mechanism—Rotational Hebbian Learning with an Autopoietic Sleep Cycle—that instantiates this thesis and prove it separates memories via phase. Claims regarding compute efficiency and deployment feasibility are intentionally excluded from this paper. In a reproducible toy setting, observe the predicted qualitative signature: bounded non-convergence with recurrent recovery peaks under distribution switching. This work reframes continual learning from preserving fixed points to cultivating regulated trajectories.

Code availability.

The reference simulation code is publicly available at: https://github.com/Atalebe/itp_spiraling_intelligence_sims.

Funding Statement.

This research received no funding from either public or private organizations.

Conflict of Interest Statement.

There is no conflict of interest to declare.

Appendix A. Mathematical Notes and Guarantees (Toy Regime)

This appendix collects the minimal mathematical statements used by the main text. The scope is intentionally limited to the toy regime used in the provided simulations (low-dimensional weights, bounded updates, explicit renormalisation or homeostatic contraction). Claims about deep architectures are explicitly marked as open.

Appendix A.1. Rotational Hebbian Learning (RHL) as a Drift-Plus-Plasticity Map

The core wake update in the complex-valued formulation is

W_{t + 1} = e^{i Ω_{t}} W_{t} + η_{t} (y_{t} x_{t}^{†}) (1 - C_{t}),

(A1)

where

e^{i Ω_{t}}

is a unitary rotation (norm-preserving),

η_{t}

is a bounded plasticity scale, and

C_{t} \in [0, 1]

is a consolidation factor that reduces further plasticity.

In the 2D real toy simulation, the analogous wake map is

\begin{matrix} w_{t}^{'} & = R (ω) w_{t}, \end{matrix}

(A2)

\begin{matrix} w_{t + 1} & = \frac{w_{t}^{'} + η_{W} y_{t} (x_{t} - y_{t} w_{t}^{'})}{∥ w_{t}^{'} + η_{W} y_{t} (x_{t} - y_{t} w_{t}^{'}) ∥}, \end{matrix}

(A3)

with

R (ω) \in S O (2)

.

Appendix A.2. Phase Separation (Interference Suppression by Rotation)

Lemma A1

(Rotation suppresses real interference at quadrature). Let a trace W encode an episode aligned with

x^{(1)}

at time

t_{1}

, and let a second query

x^{(2)}

arrive at time

t_{2} = t_{1} + Δ t

. Under pure rotation between episodes with constant Ω,

ℜ 〈 e^{i Ω Δ t} x^{(1)}, x^{(2)} 〉 = ℜ (e^{i Ω Δ t} 〈 x^{(1)}, x^{(2)} 〉) .

(A4)

If

Ω Δ t = π / 2

and

〈 x^{(1)}, x^{(2)} 〉

is real, then

ℜ 〈 e^{i Ω Δ t} x^{(1)}, x^{(2)} 〉 = 0

.

Proof.

Immediate from linearity and

ℜ (e^{i π / 2} a) = ℜ (i a) = 0

for real a. □

Appendix A.3. Boundedness in the Toy Setting (What Is Actually Guaranteed)

Two different boundedness mechanisms appear in the implementation:

(i) Explicit renormalization (2D toy).

If the update ends with normalization

w_{t + 1} \leftarrow w_{t + 1} / ∥ w_{t + 1} ∥

, then

∥ w_{t} ∥ = 1

for all t by construction.

(ii) Homeostatic contraction (general complex form).

Let the sleep dynamics include a contraction term derived from a radial homeostatic potential

H_{homeo} (W) = \frac{λ}{2} {({∥ W ∥}^{2} - w_{0}^{2})}^{2},

(A5)

and a sleep update of the form

W \leftarrow W - γ \nabla_{W} H_{homeo} (W) + ξ,

(A6)

with bounded noise

∥ ξ ∥ \leq ξ_{max}

. Then, outside a neighborhood of

∥ W ∥ = w_{0}

, the deterministic part points inward and reduces

∥ W ∥

.

Theorem A1

(Toy boundedness under explicit normalization or sufficient contraction). If either (a) the algorithm renormalizes

w_{t}

at each step, or (b) the sleep operator includes a contraction satisfying

∥ W_{t + 1} ∥ \leq ρ ∥ W_{t} ∥ + b

with

ρ < 1

applied infinitely often, then

{sup}_{t} ∥ W_{t} ∥ < \infty

.

Proof.

Case (a) is immediate. Case (b) follows from standard affine contraction bounds. □

Appendix A.4. Fixed Points Are not the Generic Attractor When Drift Is Persistent

In the simplest heuristic form: if consolidation saturates so that the plastic term becomes small, the wake update approaches

W_{t + 1} \approx e^{i Ω} W_{t}

. For

Ω \neq 0 (\mod 2 π)

, the only fixed point of the pure rotation map is

W = 0

. Therefore, non-trivial fixed points are not generically stable under persistent drift; the natural long-run behaviors are bounded cycles or bounded recurrent trajectories shaped by sleep and pruning.

Appendix A.5. Clarifying the “Non-Markovian” Claim

The extended state

X_{t} = (W_{t}, C_{t}, S_{t}, F_{t}, {\hat{θ}}_{t})

(A7)

makes the system Markovian in

X_{t}

. The non-Markovian claim concerns weights alone under the deployment lens:

W_{t + 1}

depends on history through

(C_{t}, S_{t}, F_{t}, {\hat{θ}}_{t})

, so a frozen-checkpoint view of W misses the operative state required to reproduce behavior.

Appendix B. Simulation Definition and Reproducibility Details

Appendix B.1. Task Stream and Phases

Two unit vectors

A, B \in R^{2}

are used, with B obtained by a fixed rotation of A (e.g.,

60^{\circ}

). Inputs are noisy normalized samples:

x_{t} = \frac{p + ξ_{t}}{∥ p + ξ_{t} ∥}, p \in {A, B}, ξ_{t} \sim N (0, σ^{2} I_{2}) .

(A8)

Phases:

Phase A: $t \in [0, t_{A})$ presents only A
Phase B: $t \in [t_{A}, t_{B})$ presents only B
Phase C: $t \in [t_{B}, T_{max})$ presents a mixture of A and B

Appendix B.2. Metrics

Alignment to Task A:

{align}_{A} (t) = 〈 {\hat{w}}_{t}, \hat{A} 〉 .

(A9)

Additional summary metrics:

Peak recovery: ${max}_{t \in Phase C} {align}_{A} (t)$
Floor (worst-case): ${min}_{t \in Phase C} {align}_{A} (t)$
Fraction above Markov plateau: $\frac{1}{| T |} \sum_{t \in T} I {{align}_{A}^{(S)} (t) > {align}_{A}^{(M)} (t)}$
Sleep density: number of sleep steps per 1000 updates (or per Phase C window)

Appendix B.3. Hyperparameters to Report (Minimum Set)

Noise level: $σ$
Phase endpoints: $t_{A}, t_{B}, T_{max}$
Markov agent: $η_{M}$
Spiral agent wake: $ω, η_{W}$
Sleep cadence: K (one sleep step every K wake steps)
Sleep rotation: $ω_{sleep}$ (e.g., $ω / 2$ )
Pruning threshold or shrink rule used during sleep

Appendix B.4. Pseudocode (Toy 2D Version)

Appendix B.5. Reproducibility and Figure Generation

All simulations reported in this paper are generated from a minimal, self-contained Python codebase released as a public reference implementation. The repository is titled:

ITP Spiraling Intelligence Simulations

and accompanies:

Atalebe, S.

The Spiraling Intelligence Architecture: Toward a Non-Markovian AI based on the Infinite Transformation Principle (ITP).

Implemented agents.

The code implements two learning agents:

Markov Agent: a conventional Hebbian-style learner with normalization, converging to a fixed compromise representation under distributional switching.
Spiral Agent: a non-Markovian learner incorporating

-

rotational Hebbian updates (phase-based drift),

-

autopoietic sleep cycles (replay, pruning, and contraction),

-

stress-driven modulation of learning rate and rotation speed.

Simulation scripts.

The following scripts generate all figures used in the paper:

spiral_vs_markov_demo.py

A minimal two-dimensional demonstration of weight trajectories constrained to the unit circle. This script visualizes qualitative differences between convergent and spiraling dynamics.
two_task_sim.py

A two-task continual learning experiment with Task A and Task B defined by rotated input patterns. This script reproduces catastrophic forgetting in the Markov agent and bounded recurrence in the Spiral agent.
long_run_sim.py

A long-horizon simulation illustrating spiraling representational drift, sleep-driven consolidation, and phase-based oscillations in alignment over extended time.

Generated figures.

When executed with default parameters, the scripts produce the following figures (file names shown exactly as generated):

fig_spiral_vs_markov.png — weight trajectories for Markov and Spiral agents.
fig_two_task_alignment.png — alignment to Task A across sequential task phases.
fig_long_run_alignment.png — long-term alignment behavior under wake–sleep cycling.

Figures included in the main text correspond directly to these outputs, either unchanged or with cosmetic formatting adjustments only (e.g., axis labels, legend placement).

Execution environment.

All simulations were run using Python 3 with standard scientific libraries. A minimal environment can be created as follows:

python3 -m venv venv

source venv/bin/activate

pip install numpy matplotlib

No GPU acceleration or specialized hardware is required. All experiments run in seconds on a standard CPU.

Determinism and variability.

Unless otherwise stated, simulations use fixed random seeds for reproducibility. Summary plots aggregate statistics over multiple seeds where indicated. Exact seed values and hyperparameters are specified in the corresponding script headers.

Scope of reproducibility.

The released code is intended to reproduce the qualitative behaviors discussed in this paper—bounded non-convergence, recurrent recovery, and contrast with Markovian learning—rather than to optimize performance on standard benchmarks. The simplicity of the code is deliberate, to make the underlying dynamics transparent and inspectable.

Appendix C. Cost Benchmark

Paper 1 freezes cost claims; the constrained cost benchmark is kept only as a definition of a falsifiable engineering target.

Appendix C.1. Constraint-First Definition

Let

A (t) = 〈 {\hat{w}}_{t}, \hat{A} 〉

over a horizon

t \in [t_{sw}, t_{sw} + H]

. Feasibility:

\begin{matrix} \exists t \in [t_{sw}, t_{sw} + H] s . t . A (t) \geq A_{target}, \end{matrix}

(A10)

\begin{matrix} min_{t \in [t_{sw}, t_{sw} + H]} A (t) \geq A_{floor} . \end{matrix}

(A11)

Recovery time:

T_{rec} = min {t - t_{sw} : A (t) \geq A_{target}} .

(A12)

Cost model:

C_{total} = N_{wake} + c_{sleep} N_{sleep} + c_{retrain} N_{retrain} .

(A13)

Any compute claim is only meaningful inside the feasible set.

References

Kirkpatrick, J.; Pascanu, R.; Rabinowitz, N.C.; Veness, J.; Desjardins, G.; Rusu, A.A.; Milan, K.; Quan, J.; Ramalho, T.; Grabska-Barwinska, A.; et al. Overcoming Catastrophic Forgetting in Neural Networks. Proceedings of the National Academy of Sciences 2017, 114, 3521–3526. [CrossRef]
Mossing, D.P.; Feller, M.B. Neural Representational Drift: A Dynamic View of Stability and Plasticity. Current Opinion in Neurobiology 2018, 49, 1–8. [CrossRef]
Driscoll, L.N.; Pettit, N.L.; Minderer, M.; Chettih, S.N.; Harvey, C.D. Dynamic Reorganization of Neural Representations in the Brain. Nature Neuroscience 2022, 25, 1561–1573. [CrossRef]
Tononi, G.; Cirelli, C. Sleep and Synaptic Homeostasis: A Hypothesis. Brain Research Bulletin 2003, 62, 143–150. [CrossRef]
Tononi, G.; Cirelli, C. Sleep and the Price of Plasticity: From Synaptic and Cellular Homeostasis to Memory Consolidation and Integration. Neuron 2014, 81, 12–34. [CrossRef]
Shin, H.; Lee, J.K.; Kim, J.; Kim, J. Continual Learning with Deep Generative Replay. In Proceedings of the Advances in Neural Information Processing Systems, 2017, Vol. 30, pp. 2990–2999.
Hirose, A. Complex-Valued Neural Networks: Advances and Applications; Wiley-IEEE Press, 2012.
Fusi, S.; Abbott, L.F. Cascade Models of Synaptically Stored Memories. Neuron 2005, 45, 599–611. [CrossRef]

Figure 2. Illustrative long-run alignment with Task A. The convergent baseline (blue) relaxes to a static compromise; the spiral agent (orange) exhibits bounded non-convergence with recurrent re-alignment peaks after distribution switching (shaded regions: A, B, C phases). Crosses mark sleep steps.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

The Spiraling Intelligence Thesis: Intelligence as a Bounded Non-Convergent Trajectory

Abstract

Keywords:

Subject:

1. Introduction

Scope and Split

Contributions

2. Related Work

3. The Bounded Trajectory Intelligence Framework

3.1. ITP Axioms for AI Systems

4. Rotational Hebbian Learning (RHL)

4.1. Phase-Preserving Activation

4.2. RHL Update Rule

4.3. Consolidation

Corollary (Real-valued inputs).

5. Autopoietic Sleep Cycle

5.1. A Homeostatic Functional

5.2. Minimal Replay

6. Falsifiable Signature: Experiment Design

6.1. Setup

6.2. Agents

6.3. Hypotheses

7. Results: The Qualitative Signature

8. Falsifiability and Boundary Conditions

9. Limitations and Future Work

10. Conclusion

Code availability.

Funding Statement.

Conflict of Interest Statement.

Appendix A. Mathematical Notes and Guarantees (Toy Regime)

Appendix A.1. Rotational Hebbian Learning (RHL) as a Drift-Plus-Plasticity Map

Appendix A.2. Phase Separation (Interference Suppression by Rotation)

Appendix A.3. Boundedness in the Toy Setting (What Is Actually Guaranteed)

(i) Explicit renormalization (2D toy).

(ii) Homeostatic contraction (general complex form).

Appendix A.4. Fixed Points Are not the Generic Attractor When Drift Is Persistent

Appendix A.5. Clarifying the “Non-Markovian” Claim

Appendix B. Simulation Definition and Reproducibility Details

Appendix B.1. Task Stream and Phases

Appendix B.2. Metrics

Appendix B.3. Hyperparameters to Report (Minimum Set)

Appendix B.4. Pseudocode (Toy 2D Version)

Appendix B.5. Reproducibility and Figure Generation

Implemented agents.

Simulation scripts.

Generated figures.

Execution environment.

Determinism and variability.

Scope of reproducibility.

Appendix C. Cost Benchmark

Appendix C.1. Constraint-First Definition

References

MDPI Initiatives

Important Links

Subscribe