Submitted:
12 December 2025
Posted:
16 December 2025
You are already at the latest version
Abstract

Keywords:
1. Introduction
2. Theoretical Foundations and Core Issues
2.1. The Economic Significance of Statistical Arbitrage
2.2. Key Issues and Controversies
3. Methodological Evolution: From Classical Models to Artificial Intelligence

2.3. Statistical Arbitrage in the Era of Classical Econometric Model
2.3.1. Data Preprocessing
2.3.2. Statistical Relationship Identification
2.3.3. Factor and Dimension Optimization
2.3.4. Dynamic Forecasting and Signal Capture
2.3.5. Volatility and Risk Control
2.3.6. Deepening Multivariate Dependencies
2.3.7. Summary of This Era
2.4. The Extension in the Era of AI
2.4.1. Supervised Learning and Unsupervised Learning
2.4.2. Reinforcement Learning
2.4.3. Deep Learning
2.4.4. Summary of the Extension in This Era
3. Cross-Market Applications and Empirical Performance
3.1. Stock Market
3.2. FX Market
3.3. Crypto Market
3.4. Derivative Market
3.5. ETF&LETF Market
4. Practical Challenges and Solutions Based on Human-AI Collaboration
4.1. Challenges in the Real Trading
4.2. Construction of Statistical Arbitrage Trading Model Coordinated by AI and Human Beings
4.2.1. Limitations of Statistical Arbitrage with Machine Learning as the Main Body
4.2.2. Compensatory Advantages and Irreplaceability of Human Wisdom
4.2.3. The Design of Statistical Arbitrage System of Human-Computer Collaboration
4.2.4. Operational Human-AI Control Algorithm


4.2.5. The Robustness of Human-AI Collaboration and Its Prospects
| Dimension | Traditional Statistical Arbitrage | Machine Learning-Based Statistical Arbitrage | Human-AI Collaborative Framework (This study) |
| Core Decision Logic | Rule-based, econometric or parametric assumptions | Data-driven black-box or semi-black-box models | Hybrid: Algorithmic prediction + human strategic judgment |
| Interpretability | High (transparent mathematical structure) | Low to medium(model-dependent) | Medium to high (AI + human oversight + explanability tools) |
| Tail Risk Control | Weak or indirect | Medium (depends on optimization constraints) | Strong (explicit CVaR screening + drawdown monitoring + human intervention) |
| Robustness to Structural Breaks | Low | Medium (If retrained frequently) | Strong (explicit CVaR screening + drawdown monitoring + human intervention) |
| Response to Extreme Events | Poor | Unreliable/unstable | Adaptive and flexible (human-in-the-loop decision override) |
| Overfitting Risk | Low to medium | High (especially in deep models) | Controlled (human validation + stress scenario filtering) |
| Regulatory Transparency | Medium | Low | High (traceable logic + explainable risk process) |
| Ethical/Accountability Layer | Limited | Ambiguous | Clear (human accountability embedded in decision chain) |
| Practical Implementability | High | Medium to high | High (combines automation with portfolio manager control) |
| Long-term Stability | Medium | Uncertain | High (multi-layer risk governance loop) |
5. Conclusion
5.1. Summary
5.2. Limitations
Acknowledgments
References
- Alexander, C., Chen, X., Deng, J., & Wang, T. (2024). Arbitrage opportunities and efficiency tests in crypto derivatives. Journal of Financial Markets, 71, 100930. [CrossRef]
- Avellaneda, M., & Lee, J.-H. (2010). Statistical arbitrage in the US equities market. Quantitative Finance, 10(7), 761–782. [CrossRef]
- Balladares, K., Ramos-Requena, J. P., Trinidad-Segovia, J. E., & Sánchez-Granero, M. A. (2021). Statistical Arbitrage in Emerging Markets: A Global Test of Efficiency. Mathematics, 9(2), 179. [CrossRef]
- Bollerslev, T. (1986). Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics, 31(3), 307–327. [CrossRef]
- Caneo, F., & Kristjanpoller, W. (2021). Improving statistical arbitrage investment strategy: Evidence from Latin American stock markets. International Journal of Finance & Economics, 26(3), 4424–4440. [CrossRef]
- Ciacci, A., Sueshige, T., Takayasu, H., Christensen, K., & Takayasu, M. (2020). The microscopic relationships between triangular arbitrage and cross-currency correlations in a simple agent based model of foreign exchange markets. PLOS ONE, 15(6), e0234709. [CrossRef]
- Coache, A., Jaimungal, S., & Cartea, Á. (2023). Conditionally Elicitable Dynamic Risk Measures for Deep Reinforcement Learning. SIAM Journal on Financial Mathematics, 14(4), 1249–1289. [CrossRef]
- De Moura, C. E., Pizzinga, A., & Zubelli, J. (2016). A pairs trading strategy based on linear state space models and the Kalman filter. Quantitative Finance, 16(10), 1559–1573. [CrossRef]
- Dickey, D. A., & Fuller, W. A. (1979). Distribution of the Estimators for Autoregressive Time Series with a Unit Root. Journal of the American Statistical Association, 74(366a), 427–431.
- Do, B., & Faff, R. (2010). Does Simple Pairs Trading Still Work? Financial Analysts Journal, 66(4), 83–95. [CrossRef]
- Elliott, R. J., Van Der Hoek , J., & Malcolm, W. P. (2005). Pairs trading. Quantitative Finance, 5(3), 271–276.
- Engle, R. (2002). Dynamic Conditional Correlation: A Simple Class of Multivariate Generalized Autoregressive Conditional Heteroskedasticity Models. Journal of Business & Economic Statistics, 20(3), 339–350.
- Engle, R. F., & Granger, C. W. J. (1987). Co-Integration and Error Correction: Representation, Estimation, and Testing. Econometrica, 55(2), 251. https://doi.org/10.2307/1913236. [CrossRef]
- Frino, A., Mollica, V., Webb, R. I., & Zhang, S. (2017). The impact of latency sensitive trading on high frequency arbitrage opportunities. Pacific-Basin Finance Journal, 45, 91–102. [CrossRef]
- Gatev, E., Goetzmann, W. N., & Rouwenhorst, K. G. (2006). Pairs Trading: Performance of a Relative-Value Arbitrage Rule. Review of Financial Studies, 19(3), 797–827. [CrossRef]
- Greenwald, B. C., & Stein, J. C. (1991). Transactional Risk, Market Crashes, and the Role of Circuit Breakers. The Journal of Business, 64(4), 443. [CrossRef]
- Han, C., He, Z., & Toh, A. J. W. (2023). Pairs trading via unsupervised learning. European Journal of Operational Research, 307(2), 929–947. [CrossRef]
- Hochreiter, S., & Schmidhuber, J. (1997). Long Short-Term Memory. Neural Computation, 9(8), 1735–1780.
- Hogan, S., Jarrow, R., Teo, M., & Warachka, M. (2004). Testing market efficiency using statistical arbitrage with applications to momentum and value strategies. Journal of Financial Economics, 73(3), 525–565. [CrossRef]
- Hong, H., & Stein, J. C. (2007). Disagreement and the Stock Market. Journal of Economic Perspectives, 21(2), 109–128.
- Jaimungal, S., Pesenti, S. M., Wang, Y. S., & Tatsat, H. (2022). Robust Risk-Aware Reinforcement Learning. SIAM Journal on Financial Mathematics, 13(1), 213–226. [CrossRef]
- Jarrow, R., Teo, M., Tse, Y. K., & Warachka, M. (2012). An improved test for statistical arbitrage. Journal of Financial Markets, 15(1), 47–80. [CrossRef]
- Jegadeesh, N., & Titman, S. (1993). Returns to Buying Winners and Selling Losers: Implications for Stock Market Efficiency. The Journal of Finance, 48(1), 65–91.
- Johansen, S. (1988). Statistical analysis of cointegration vectors. Journal of Economic Dynamics and Control, 12(2–3), 231–254. [CrossRef]
- Krauss, C., Do, X. A., & Huck, N. (2017). Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500. European Journal of Operational Research, 259(2), 689–702. [CrossRef]
- Lin, B., & Tan, Z. (2023). Exploring arbitrage opportunities between China’s carbon markets based on statistical arbitrage pairs trading strategy. Environmental Impact Assessment Review, 99, 107041. [CrossRef]
- Lütkebohmert, E., & Sester, J. (2021). Robust statistical arbitrage strategies. Quantitative Finance, 21(3), 379–402. [CrossRef]
- Marshall, B. R., Nguyen, N. H., & Visaltanachoti, N. (2013). ETF arbitrage: Intraday evidence. Journal of Banking & Finance, 37(9), 3486–3498. [CrossRef]
- Murphy, N., & Gebbie, T. (2021). Learning the dynamics of technical trading strategies. Quantitative Finance, 21(8), 1325–1349. [CrossRef]
- Nasekin, S., & Härdle, W. K. (2019). Model-driven statistical arbitrage on LETF option markets. Quantitative Finance, 19(11), 1817–1837. [CrossRef]
- Piotrowski, E. W., & Sladkowski, J. (2004). Arbitrage risk induced by transaction costs. Physica A: Statistical Mechanics and Its Applications, 331(1–2), 233–239. [CrossRef]
- Ramos-Requena, J. P., Trinidad-Segovia, J. E., & Sánchez-Granero, M. Á. (2020). Some Notes on the Formation of a Pair in Pairs Trading. Mathematics, 8(3), 348. [CrossRef]
- Rousseeuw, P. J., & Croux, C. (1993). Alternatives to the Median Absolute Deviation. Journal of the American Statistical Association, 88(424), 1273–1283.
- Serrano, C., & Hoesli, M. (2012). Fractional Cointegration Analysis of Securitized Real Estate. The Journal of Real Estate Finance and Economics, 44(3), 319–338. [CrossRef]
- Stephenson, J., Vanstone, B., & Hahn, T. (2021). A Unifying Model for Statistical Arbitrage: Model Assumptions and Empirical Failure. Computational Economics, 58(4), 943–964. [CrossRef]
- Troyanskaya, O., Cantor, M., Sherlock, G., Brown, P., Hastie, T., Tibshirani, R., Botstein, D., & Altman, R. B. (2001). Missing value estimation methods for DNA microarrays. Bioinformatics, 17(6), 520–525. [CrossRef]
- Vergara, G., & Kristjanpoller, W. (2024). Deep reinforcement learning applied to statistical arbitrage investment strategy on cryptomarket. Applied Soft Computing, 153, 111255. [CrossRef]
- Zhang, W., Li, S., Guo, Z., & Yang, Y. (2023). A hybrid forecasting model based on deep learning feature extraction and statistical arbitrage methods for stock trading strategies. Journal of Forecasting, 42(7), 1729–1749. [CrossRef]
- Kanamura, T. (2025). Sustainability arbitrage pricing of ESG derivatives. International Review of Financial Analysis, 104, 104177. [CrossRef]
- Bailey, D. H., & Lopez De Prado, M. (2014). The Deflated Sharpe Ratio: Correcting for Selection Bias, Backtest Overfitting and Non-Normality. SSRN Electronic Journal.
- Fischer, T., & Krauss, C. (2018). Deep learning with long short-term memory networks for financial market predictions. European Journal of Operational Research, 270(2), 654–669. [CrossRef]
- Flori, A., & Regoli, D. (2021). Revealing Pairs-trading opportunities with long short-term memory networks. European Journal of Operational Research, 295(2), 772–791. [CrossRef]
- Huck, N. (2019). Large data sets and machine learning: Applications to statistical arbitrage. European Journal of Operational Research, 278(1), 330–342. [CrossRef]
- Krauss, C., Do, X. A., & Huck, N. (2017). Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500. European Journal of Operational Research, 259(2), 689–702. [CrossRef]
- Chow, Y., Ghavamzadeh, M., Janson, L., & Pavone, M. (2018). Risk-Constrained Reinforcement Learning with Percentile Risk Criteria. Journal of Machine Learning Research, 18, 1–51.
- Rockafellar, R. T., & Uryasev, S. (2000). Optimization of Conditional Value-at-Risk. Journal of Risk, 2(3), 21–42. [CrossRef]
- Ning, B. (2024). Advanced Statistical Arbitrage with Reinforcement Learning. arXiv preprint arXiv:2403.12180. [CrossRef]
- Rockafellar, R. T., & Uryasev, S. (2002). Conditional Value-at-Risk for General Loss Distributions. Journal of Banking & Finance, 26(7), 1443–1471. [CrossRef]
- Yuan, H. (2024). Alpha-GPT 2.0: Human-in-the-Loop AI for Quantitative Investment. arXiv preprint arXiv:2402.09746.
- Mosqueira-Rey, E., Hernández-Pereira, E., Alonso-Ríos, D., Bobes-Bascarán, J., & Fernández-Leal, Á. (2023). Human-in-the-loop Machine Learning: A State of the Art. Artificial Intelligence Review, 56, 3005–3054. [CrossRef]
- OECD & Financial Stability Board (FSB). (2024). OECD–FSB Roundtable on Artificial Intelligence (AI) in Finance: Summary of Key Findings.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).