Submitted:
24 December 2025
Posted:
25 December 2025
You are already at the latest version
Abstract
Keywords:
I. Introduction
II. Related Literature
III. The Paradox of the "Bad" Predictive Model
A. The Advantage of the Market Taker
B. The Decorrelation Objective
IV. Mathematical Derivations of System Controllability
A. The Kalman Reachability Criterion
B. The Controllability Gramian and Energy Requirements
| Property | Mathematical Indicator | Practical Interpretation |
|---|---|---|
| Full Controllability | Any portfolio target can be reached | |
| Directional Energy | The input effort required to steer factor i | |
| System Stability | Largest eigenvalue indicates tendency toward stability or divergence | |
| Linear Independence | Data features provide non-redundant control signals |
C. Nonlinear and Fractional-Order Extensions
V. Estimation Versus Control in Financial Markets

A. The Breakdown of the Separation Principle
B. Actuator Dynamics and Liquidity Constraints
1. Price Impact as a Nonlinear Limit
2. Anti-Windup Control in Trading
| Feature | Estimation-Centric Paradigm | Control-Centric Paradigm |
|---|---|---|
| Primary Goal | Minimize forecast error (RMSE/MAE) | Maximize utility / reach target state |
| System View | Market is an exogenous signal | Market is a feedback-driven plant |
| Input Analysis | Ignored (assumes zero impact) | Modeled as control effort with saturation |
| State Feedback | Open-loop (predict then trade) | Closed-loop (trade, observe, adapt) |
| Risk Metric | Forecast variance | System stability and windup potential |
VI. The Prediction–Profit Non-Monotonicity Result

A. The Hubáček and Šír Decorrelation Objective
B. Forecast Evaluation Beyond RMSE: Association and Dispersion
- Association (Spearman Correlation): Measures how well the forecast captures the shape and rank-ordering of the daily price curve rather than absolute price levels. For a Battery Energy Storage System (BESS), profitability is driven by exploiting the spread between daily lows and highs; therefore, correctly timing price peaks and troughs is more critical than minimizing pointwise price error.
- Dispersion (Log-Determinant of Error Covariance): Measures how forecast errors are distributed across time. A model whose errors are concentrated during non-profitable hours and exhibits high accuracy during peak-value periods is significantly more valuable than a model with uniformly distributed errors, even when aggregate error magnitudes are comparable.Table IV. Relationship Between Statistical Metrics and Economic Profitability.
Statistical Metric Correlation with Economic Profit Implication for System Design RMSE Weak / Low Over-penalizes outliers that may be economically profitable MAE Weak Does not account for the direction or timing of errors Spearman () High Prioritizes timing accuracy and price-curve topology Log Det () High Focuses on error structure, dispersion, and diversity
VII. Market Controllability and Liquidity as Rank
A. Kalman Reachability in Thin Markets
B. Approximate Controllability of Fractional Markets
VIII. Separated Control Under Partial Observability
A. The Duncan-Mortensen-Zakai Equation in Finance
B. Maximum Likelihood Recursive Estimation
IX. Non-Markovian Dynamics and Path-Dependent Control
A. Signature Portfolios and Rough Path Theory
B. Optimization as a Convex Quadratic Problem
| Portfolio Type | Information Source | Mathematical Form | Practical Advantage |
|---|---|---|---|
| Markowitz | Asset covariance | Static quadratic form | Simple and well-understood |
| Functionally Generated | Current market weights | Log-gradient of a scalar function | Robust performance and long-term growth |
| Signature Portfolio | Full path history | Linear function of iterated integrals | Universal approximation of path-dependent strategies |
X. Actuator Saturation and the Geometry of Market Impact

A. The Missile Guidance Analogy: From Homing to Hedging
B. Anti-Windup and Model-Recovery Networks
C. Linear Parameter-Varying (LPV) Control for Impact
- Defining the Parameter Space: Construct a grid over relevant operating conditions, such as Mach number, altitude, and angle of attack in the missile guidance case, or volatility, bid–ask spread, and market depth in the trading case.
- Synthesizing Vertex Controllers: Design locally optimal control laws at each grid point using convex optimization techniques, typically formulated as Linear Matrix Inequalities (LMIs).
- Self-Scheduling: Blend or interpolate among the vertex controllers in real time as the scheduling parameter evolves across the grid, yielding a gain-scheduled or LPV control policy.
XI. Implications for Quantitative Trading System Design
A. Integrated Decision-Focused Architecture
- Incorporating the Control Law in Training: The loss function should be the negative expected profit (or utility) of the trades generated by the model, accounting for market impact and transaction costs:
- Actuator-Aware Alpha: Alpha models should be “aware” of liquidity constraints. High-conviction signals that cannot be executed due to saturation should be down-weighted in favor of lower-conviction but more “controllable” signals.
- Decorrelation from Consensus: To avoid crowded trades and market maker traps, the training objective can include a penalty term:
- Signature-Based Feature Encoding: For assets with complex dependencies, path signatures capture the historical context of the price action, providing a robust input for control policies compared to simple time-lagged features.
B. Integrated Trajectory and Execution Planning
C. The Role of Fuzzy Adaptive Gain Scheduling
D. Decorrelation as a Risk Management Strategy
E. The Role of Agent-Based Modeling (ABM)
XII. Competitive controllability and and Adversarial Feedback
A. From Passive Plants to Heterogeneous Controllers
B. The Control Fragility Index (CFI) and Metastability
- Metastable States: Bajpai K. (2024)[12] demonstrates that markets can exist in “metastable” configurations—appearing statistically stable while remaining critically vulnerable to small transient errors.
- Disintegration of Control: This explains why the “Market Control Illusion” is most dangerous at high levels of predictive accuracy. High-gain controllers optimized for local accuracy can unintentionally push the system toward a Critical Feedback Instability (CFI [12] threshold, where control is lost almost instantaneously.
C. Endogenous Instability as a Product of Local Optimality
- Reflexive Resonance: Algorithmic synchronization that amplifies noise into apparent signals.
- Liquidity Phase Transitions: The sudden collapse of the controllability matrix when adversarial feedback laws trigger a collective withdrawal of market-making actuators (Bajpai, 2024)[12].
D. Shifting the Design Paradigm
XIII. Empirical Case Study: Battery Energy Storage and the Accuracy-Value Gap
A. The Weak Correlation of RMSE and Revenue
| Metric Divergence | Metric Type | Mathematical Definition | Relationship to BESS Profit |
|---|---|---|---|
| Statistical (RMSE) | Weak; ignores timing and shape | 1 | |
| Dispersion (Cov-e) | Strong; reflects error consistency | 20 | |
| Association (Corr-f) | Average Spearman | Very Strong; captures the price curve | 2 |
| Decision-Focused | Optimal; incorporates the task | — |
XIV. Algorithmic Implementation: The Self-Scheduling LPV Controller
| Algorithm 1: Fuzzy Path-Dependent LPV Control for HFT |
![]() |
XV. Latency & Computational Analysis
A. Signature Update Complexity
- Traditional ML (RNN/LSTM): where N is sequence length.
- Signature Update: where d is asset count and k is signature depth.
B. Re-Optimization Latency
| Hardware/Stack | Latency () | Suitability |
|---|---|---|
| Python / NumPy | Mid-Frequency / Execution Algorithms | |
| C++ / Eigen | High-Frequency Trading (HFT) | |
| FPGA (Hardware) | Ultra-Low Latency / Market Making |
XVI. Forensic Case Study: Integrator Windup and the Anatomy of a Flash Crash
A. The Mechanism: Control in a Singular Region
B. The 2010 Flash Crash as a Feedback-Induced Instability
- Saturation: As the primary sell algorithm hit its volume-participation limits, it saturated the available bid-side liquidity.
- Windup: Competing market-making controllers, sensing a Liquidity Phase Transition, widened their spreads or triggered “kill switches.”
- The Spiral: The primary algorithm, seeing its “fills” drop to zero while its “error” skyrocketed, reached a state of maximum “windup energy.”
C. The Massive Overshoot: The Recovery Phase
D. From Forensic to Proactive: The Anti-Windup Solution
XVII. Path Signatures and the Challenge of Non-Markovian Markets
A. Universal Portfolio Approximation via Signatures
B. Robustness to Frictions

XVIII. Results: Simulating the Control Advantage
A. BESS Arbitrage: The Profit of "Shape" Over "Accuracy"
| Forecast Type | RMSE (Avg) | Spearman Association | Realized Annual Revenue (1MW) |
|---|---|---|---|
| ARX (Linear) | 12.5 | 0.45 | €145,000 |
| NN (Nonlinear) | 10.2 | 0.52 | €168,000 |
| Perfect Foresight | 0.0 | 1.00 | €200,000 |
| “Bad” Decorrelated | 14.8 | 0.58 | €182,000 |
B. Signature Portfolios: Path-Dependent Outperformance
| Market Index | Benchmark Sharpe | Signature Portfolio Sharpe (Net) | Max Drawdown Reduction |
|---|---|---|---|
| NASDAQ | 0.65 | 0.91 | 18% |
| SMI | 0.52 | 0.84 | 22% |
| S&P 500 | 0.58 | 0.79 | 15% |
XIX. Unified Signature-LPV Control Law
A. Path-Dependent Scheduling and the Signature Saturation Indicator
B. Geometry-Aware Gains: Roughness as a Control Parameter
- High-Roughness Regimes: When price paths exhibit high roughness, the probability of “integrator windup” increases. In these states, the LPV controller automatically dampens its gains to prevent the “Execution Death Spiral” identified in previous sections.
- Lead-Lag Anticipation: Using the signature’s “Area” term (the second-order iterated integral), the controller can detect whether the current price move is “leading” or “lagging” the broader market flow. This enables Geometry-Aware Gains, where the controller preemptively scales back trade aggression if the path geometry indicates that the agent is becoming the dominant (and thus price-moving) force in the adversarial environment.
C. Predictive Anti-Windup via Path Geometry
XX. The Controllability-Adjusted Sharpe Ratio ()
A. Mathematical Formulation
- is the expected net return.
- is the volatility of returns.
- is the Controllability Gramian, representing the “volume” of the state space reachable by the controller.
- represents the Control Energy, i.e., the “effort” required to steer the market price.
B. Interpretation: Penalizing "Expensive" Alpha
- High-Effort Alpha: An “accurate” model that is highly correlated with the consensus requires high control energy (high ) because it competes for the same limited liquidity “actuator” B. Even if the profit is high, will be low, reflecting high Control Fragility.
- Low-Effort Alpha: A “bad” or decorrelated model may have lower raw profit, but if it operates in a subspace where the Gramian W is well-conditioned (large eigenvalues), the “effort” term is small. This results in a superior .
C. Comparative Results: Standard vs. Controllability-Adjusted
| Strategy | Standard Sharpe | Control Energy () | (Adjusted) |
|---|---|---|---|
| MSE-Optimal (Consensus) | 1.22 | 8.41 | 0.14 |
| BESS Shape-Optimizer | 0.94 | 1.15 | 0.81 |
| Signature Portfolio | 0.91 | 0.88 | 1.03 |
| Decorrelated Model | 0.76 | 0.42 | 1.80 |
D. Conclusion on Metric Utility
XXI. Computational Feasibility and the Latency-Stability Trade-off
A. Complexity of Path Signatures and One-Pass Updates
B. The Information-Latency Frontier
- Low-Latency Regime: In high-frequency trading (HFT) environments, we recommend the use of truncated path signatures at low orders (typically levels 2 or 3) to minimize computational overhead, particularly the cost of estimating and inverting the Controllability Gramian under strict latency constraints.
- High-Capacity Regime: For institutional-scale execution problems, such as Battery Energy Storage System (BESS) arbitrage, higher-order path signatures are computationally feasible. In this regime, the dominant limitation arises from actuator saturation and physical constraints rather than execution speed, allowing richer non-Markovian representations.
C. Hardware Acceleration via FPGA
XXII. Conclusion
References
- Hubáček, O.; Šír, G. Beating the market with a bad predictive model. Work on decorrelation and market taker advantage cited in "The Market Control Illusion" 2018.
- Gatheral, J.; Jaisson, T.; Rosenbaum, M. Volatility is rough. Quantitative Finance 2018, 18, 933–949. [Google Scholar] [CrossRef]
- Zakai, M. On the optimal filtering of diffusion processes. Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete 1969, 11, 230–243. [Google Scholar] [CrossRef]
- Duncan, T.E. Probability densities for diffusion processes with applications to nonlinear filtering theory. PhD Thesis, Stanford University, 1967. [Google Scholar]
- Lyons, T. Rough paths, signatures and the modelling of functions on streams. International Congress of Mathematicians 2014.
- Shamma, J.S.; Athans, M. Guaranteed performance of gain-scheduled control for linear parameter-varying plants. Automatica 1991, 27, 949–964. [Google Scholar] [CrossRef]
- Kalman, R.E. On the general theory of control systems. In Proceedings of the First International Congress of the International Federation of Automatic Control; 1960. [Google Scholar]
- Perdomo, J.; Richardson, T.; Dwivedi, R.; Ebert-Uphoff, I.; Re, C. Performative Prediction. International Conference on Machine Learning (ICML); 2020. [Google Scholar]
- Unknown (Cited in, Bajpai. Multi-market optimization for Battery Energy Storage Systems in the Swedish energy sector. Empirical study on Association and Dispersion metrics; 2024. [Google Scholar]
- Zhou, Y.; Jiao, F. Controllability of fractional evolution boundary value problems. Computers & Mathematics with Applications 2012, 64, 3120–3124. [Google Scholar]
- Tarbouriech, S.; Garcia, G.; Gomes da Silva Jr, J.; Queinnec, I. Control design for systems with saturation; Springer Science & Business Media, 2011. [Google Scholar]
- Bajpai, K. Markets as Adversarial Control Systems: Stability, Metastability, and Feedback-Induced Instability. https://doi.org/10.21203/rs.3.rs-8422090/v1, 2025. Preprint (Version 1), available at Research Square. [CrossRef]
| Control Theory Term | Trading / Financial Equivalent | Practical Business Meaning |
|---|---|---|
| Plant (The System) | The Market Microstructure | The external environment (Limit Order Book) that reacts to your orders. |
| Actuator Saturation | Liquidity Depth / Top-of-Book | “You cannot execute an infinitely large trade; the “physical” limit of the book stops your “motor.”” |
| Integrator Windup | Position Bloat / Toxic Inventory | When a strategy keeps buying into a falling market because the “error” (price gap) persists, leading to a massive, unmanageable position. |
| Anti-Windup Logic | Hard Risk Constraints / Stops | Clamping the execution logic so the system stops “accumulating” a position when the market isn’t responding. |
| Observer (Estimator) | Alpha Signal / Price Predictor | Your attempt to guess the “true state” of the market based on noisy data (the “Forecast”). |
| Control Law | Execution / OMS Logic | The rule that decides exactly how many shares to buy/sell based on the current signal and current position. |
| Controllability Gramian | Liquidity / Market Depth Matrix | A mathematical measure of whether you actually have enough “room” in the market to move your P&L to a desired state. |
| Closed-Loop Feedback | Market Impact / Slippage | The phenomenon where your own trading moves the price against you, changing the “state” you were trying to predict. |
| LPV (Linear Parameter-Varying) | Regime-Switching Alpha | A model that changes its behavior depending on external “scheduling variables” like Volatility or Volume. |
| Information State | Signature / Feature Vector | The minimal set of data (price paths, signatures) needed to make an optimal decision right now. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
