Submitted:
20 June 2023
Posted:
20 June 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Residual Neural Networks as a Mean Field Optimal Control Problem
2.1. The Supervised Learning paradigm
- denotes the inputs of the NN;
- denotes the outputs of the NN;
- the corresponding targets.
2.2. Empirical Risk Minimization
2.3. Population Risk Minimization as Mean Field Optimal Control Problem
- going from layer index T to continuous parameter t;
- passing from a discrete set of inputs/output to distribution that represents the joint distribution in modelling the input-label distribution;
- passing from empirical risk minimization to population risk (i.e. minimization over expectation ).
- , , are bounded;
- f, L, are Lipschitz continuous with respect to x, with the Lipschitz constants of f and L being independent of parameters , and
- has finite support in
3. Stochastic Neural Network as a Stochastic Optimal Control Problem
4. Mean Field Neural Network as a Mean Field Optimal Transport
4.1. Mean Field Game
- a finite time horizon ;
- is the state space;
- the space of probability measure over ;
- describes the agent state, the mean-field term and the agent control;
- , and , provides the running and, resp., the terminal cost;
- represents the drift function ;
- is the volatility of the state.
- 1.
-
minimizes over α:where solves the SDEW being a d-dimensional Brownian motion and has distribution ;
- 2.
- for all , is the probability distribution of .
4.2. Mean Field Control
4.3. Mean Field Optimal Transport
- 1.
- Optimal control via direct approximation of controls v;
- 2.
- Deep Galerkin Method for solving a forward-backward systems of PDEs;
- 3.
- Augmented Lagrangian Method with Deep Learning exploiting the variational formulation of MFOT and the primal/dual approach.
4.4. Other approaches for learning Mean Field function
- the first data-driven approach, presented in [1], has been considered to solve a stochastic optimal control problem, where the unknown model parameters were estimated in real-time using a direct filter method. This method involves transitioning from the stochastic maximum principle to approximate the conditional probability density functions of the parameters given an observation, which is a set of random samples;
- in [26], the authors report a map that by operating over an appropriate classes of neural networks, specifically the Bin density-based approximation and Cylindrical approximation, is able to reconstruct a mapping between the Wasserstein space of probability measures and an infinite dimensional function space on a similar setting to MFG.
5. Conclusions and further directions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| DL | Deep Learning |
| HJB | Hamilton-Jacobi-Bellman |
| MFC | Mean Field Control |
| MFG | Mean Field Games |
| MFOCP | Mean Field Optimal Control Problem |
| ML | Machine Learning |
| MFOT | Mean Field Optimal Transport |
| NN | Neural Network |
| ODE | Ordinary Differential Equation |
| OT | Optimal Transport |
| SDE | Stochastic Differential Equation |
| SNN | Stochastic Neural Network |
| SGD | Stochastic Gradient Descent |
References
- Archibald, R.; Bao, F.; Yong, J. An Online Method for the Data Driven Stochastic Optimal Control Problem with Unknown Model Parameters, arXiv e-prints, 2022.
- Archibald, R.; Bao, F.; Cao, Y.; Zhang, H. ; A backward SDE method for uncertainty quantification in deep learning. Discrete and Continuous Dynamical Systems, 2022, 15(10): 2807-2835. [CrossRef]
- Baudelet, S.; Frénais, B.; Laurière, M.; Machtalay, A.; Zhu, Y. Deep Learning for Mean Field Optimal Transport. arXiv e-prints 2023. [CrossRef]
- Benamou, J. D; Carlier, G.; Santambrogio, F; Variational Mean Field Games. Nicola Bellomo, Pierre Degond, Eitan Tadmor. Active Particles, Volume 1, Springer, pp.141-171, 2017. 0129. [Google Scholar]
- Backhoff-Veraguas, J.; Bartl, D.; Beiglblock, M.; Wiesel, J. Estimating processes in adapted Wasserstein distance. Ann. Appl. Probab. 32 (1) 529 - 550, February 2022. [CrossRef]
- Bao, F.; Cao, Y.; Archibald, R.; Zhang, H. Uncertainty quantification for deep learning through stochastic maximum principle. arXiv: 3489122, 2021.
- Bonnet, B.; Cipriani, C.; Fornasier, M.; Huang, H. measure theoretical approach to the mean-field maximum principle for training NeurODEs, Nonlinear Analysis, Volume 227, 2023, 113161, ISSN 0362-546X. [CrossRef]
- Benoît, B. A Pontryagin Maximum Principle in Wasserstein spaces for constrained optimal control problems. ESAIM: COCV, 25 2019 52. [CrossRef]
- Cao, H. , Guo, X., Laurière, M., Connecting GANs, MFGs, and OT, arXiv e-prints, 2020.
- Carmona, R. , Lauriere, M. Deep Learning for Mean Field Games and Mean Field Control with Applications to Finance. In A. Capponi & C. Lehalle (Eds.), Machine Learning and Data Sciences for Financial Markets: A Guide to Contemporary Practices (pp. 369-392). Cambridge: Cambridge University Press. 2023. [Google Scholar] [CrossRef]
- Chizat, L.; Bach, F. , On the global convergence of gradient descent for overparameterized models using optimal transport. In Advances in neural information processing systems, 2018, pages 3040–305.
- Chizat, L.; Colombo, M. , Fernández-Real, X.; and Figalli, A. Infinite-width limit of deep linear neural networks, arXiv e-prints, 2022. [CrossRef]
- Cuturi, M. Sinkhorn distances: Lightspeed computation of optimal transport. In Advances in Neural Information Processing Systems, 2013 pages 2292–2300.
- de Bie, G.; Peyré, G.; Cuturi, M. Stochastic Deep Networks, Proceedings of the 36th International Conference on Machine Learning, 2019. Long Beach, California, PMLR 97.
- Di Persio, L. , Garbelli M. Deep Learning and Mean-Field Games: A Stochastic Optimal Control Perspective. Symmetry 2021 13(1):14.
- E, W.; Han, J.; Li, Q. A mean-field optimal control formulation of deep learning. Res Math Sci 6, 10. 2019. [Google Scholar] [CrossRef]
- Fernández-Real, X. : Figalli, A. The Continuous Formulation of Shallow Neural Networks as Wasserstein-Type Gradient Flows. In: Avila, A., Rassias, M.T., Sinai, Y. (eds) Analysis at Large. Springer, Cham. 2022. [Google Scholar] [CrossRef]
- Gangbo, W.; Mayorga, S.; Swiech, A. Finite Dimensional Approximations of Hamilton-Jacobi Bellman Equations in Spaces of Probability Measures. SIAM Journal on Mathematical Analysis 2021 53:2, 1320-1356.
- He, K.; Zhang, X.; Ren, S.; Sun, J. ; "Deep Residual Learning for Image Recognition," 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 770-778. [CrossRef]
- Han Huang, Jiajia Yu, Jie Chen, Rongjie Lai, Bridging mean-field games and normalizing flows with trajectory regularization, Journal of Computational Physics, Volume 487, 2023, 112155, ISSN 0021-9991. [CrossRef]
- Jimenez, C. ; A. Marigonda A.; Quincampoix, M. Dynamical systems and Hamilton-Jacobi-Bellman equations on the Wasserstein space and their L2 representations, 2022. Preprint at https://cvgmt.sns.it/media/doc/paper/5584/AMCJMQ_HJB_2022-03-30.pdf.
- Hashimoto, T.; Gifford, D.; Jaakkola, T. Learning population-level diffusions with generative rnns. International Conference on Machine Learning, 2016, pages 2417–2426.
- Li, Q.; Lin, T.; Shen, Z. . Deep Learning via Dynamical Systems: An Approximation Perspective. 2019; arXiv:1912.10382v1. [Google Scholar]
- Li, Q., Long, C., Cheng, T.; E, W. Maximum principle based algorithms for deep learning. J. Mach. Learn. Res. 2017, 18, 5998–6026.
- Mikami, T. 4: Two End Points Marginal Problem by Stochastic Optimal Transportation SIAM Journal on Control and Optimization 2015 53:4, 2449-2461.
- Pham, H.; Warin, X. Mean-field neural networks: learning mappings on Wasserstein space, 2022 arXiv e-prints.
- Peyré, G.; Cuturi, M. Computational Optimal Transport: With Applications to Data Science, Foundations and Trends in Machine Learning, 2019. Vol. 11: No. 5-6, pp 355-607. [CrossRef]
- Sirignano, J.; Spiliopoulos, K. Mean Field Analysis of Deep Neural Networks. Mathematics of Operations Research 2021, 47(1):120-152.
- Tan, X.; Nizar Touzi, N. ; Optimal transportation under controlled stochastic dynamics." The Annals of Probability, 41(5) 3201-3240 2013. [CrossRef]
- Wojtowytsch, S. , On the Convergence of Gradient Descent Training for Two-layer ReLU-networks in the Mean Field Regime, arXiv e-prints, 2020. [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).