Submitted:
18 February 2025
Posted:
19 February 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. What Is Generative Modeling
2.1. Definition and Purpose
2.2. Types of Generative Models
3. Mathematical Intuition of Variational Autoencoders

3.1. The Generative Process
3.2. The Inference Process
3.3. The Evidence Lower Bound (ELBO)
3.4. The Loss Function
- Reconstruction Loss: The first term measures how well the model can reconstruct the input data from the latent representation.
- Regularization Loss: The second term penalizes the divergence between the variational distribution and the prior distribution, ensuring that the learned latent space follows the desired prior distribution (standard normal distribution).
3.5. Sampling from the Latent Space
4. Option Pricing and Implied Volatility
- r: risk-free rate
- q: dividend yield
- : speed of mean reversion. A high means rapid mean reversion, and stable long-dated volatilities.
- : long-term variance
- : volatility of variance. A high means more pronounced volatility smile
- : correlation coefficient a negative would depict downward-sloping skew (equity markets) and a positive would indicate an upward-sloping skew (some FX pairs)
4.1. Feller Condition
4.2. No Arbitrage Conditions
- Buy one call (or put) option of strike
- sell two call (or put) option of strike K
- Buy one call (or put) option of strike
5. Training VAE Model on Synthetic Volatility Surfaces
| Parameters | v0 | ||||
|---|---|---|---|---|---|
| Lower bound | 0.025 | -0.87 | 0.5 | 0.08 | 0.1 |
| Upper bound | 0.035 | -0.067 | 1.5 | 0.1 | 2.2 |
| VAE ARCHITECTURE | ||||
|---|---|---|---|---|
| Dimension | Activation Function | Number of batches | Number of Epochs | |
| Encoder | (255, 128, 64, 32) | 100 | 100 | |
| Latent Space Dim. | 16 | |||
| Decoder | (16, 32, 64, 128, 255) |
6. Completion of Volatility Surface by Latent Space Optimization
7. Concluding Remarks
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
Abbreviations
| VAE | Variational Autoencoder |
| VAEs | Variational Autoencoders |
References
- Alexander, C., Market Risk Analysis, Volume III: Pricing, Hedging and Trading Financial Instruments, Wiley, (2008).
- Bergeron, M., Fung, N., Hull, J., Poulos, Z., Variational autoencoders: A hands-off approach to volatility, Quantitative Finance, Computational Finance, arXiv:2102.03945v1 [q-fin.CP], (2021). [CrossRef]
- Black, F., Scholes, M., The Pricing of Options and Corporate Liabilities, Journal of Political Economy, 81(3), 637-654, (1973).
- Broussard, R. P., Atanasov, N., Oosterlee, C. W., Completing the Volatility Surface with Deep Generative Models, Journal of Financial Econometrics, (2022).
- Chataigner, L., Hull, J. C., White, H., Completing individual volatility surfaces using autoencoders, Journal of Economic Dynamics and Control, 130, 104261, (2021).
- Cont, R., Vuletić, M., Simulation of Arbitrage-Free Implied Volatility Surfaces, Applied Mathematical Finance, 30, 2, 94-121, (2023). [CrossRef]
- Derman, E., Miller, M. B., The Volatility Smile, Wiley Finance, (2016).
- Azemtsa Donfack, H., Wafo Soh, C., and Kotze, A., Volatility smile interpolation with Radial basis functions, International Journal of Theoretical and Applied Finance 25, Issue 07n08, (2022).
- Dumoulin, V., Belghazi, I., Poole, B., Lamb, A., Arjovsky, M., Mastropietro, O., Courville, A., Adversarially learned inference, International Conference on Learning Representations, (2017).
- Dupire, B., Pricing with a Smile, Risk, 7(1), 18-20, (1994).
- Fengler, M. R., Arbitrage-free smoothing of the implied volatility surface Quantitative Finance, 9(4), 417-428, (2009).
- Follmer, H., Kupiainen, A., Lescourret, F., Variational Autoencoders for Volatility Surfaces, arXiv preprint arXiv:2102.03945, (2021).
- Fung, C., Variational Autoencoders for Volatility Surfaces, Unpublished Masters thesis, Graduate Department of Electrical & Computer Engineering, University of Toronto, (2021).
- Gatheral, J., The Volatility Surface: A Practitioner’s Guide, Wiley Finance, (2006).
- Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y., Generative adversarial nets, Advances in Neural Information Processing Systems, 2672-2680, (2014).
- Gregor, K., Danihelka, I., Graves, A., Rezende, D., Wierstra, D., DRAW: A Recurrent Neural Network For Image Generation, International Conference on Machine Learning, 1462-1471, (2015).
- Grover, A., Dhar, M., Ermon, S., Flow-GAN: Combining maximum likelihood and adversarial learning in generative models, AAAI Conference on Artificial Intelligence, (2018). [CrossRef]
- Guo, Y., Heston, S., Lubke, T., Implied Volatility Surface Completion with Variational Autoencoder and Continuous-Time Stochastic Differential Equation, arXiv preprint arXiv:2108.04941, (2021).
- Heston, S.L., A Closed-Form Solution for Options with Stochastic Volatility with Applications to Bond and Currency Options, The Review of Financial Studies, 6(2), 327-343, (1993).
- Johnson, M., Duvenaud, D. K., Wiltschko, A., Adams, R. P., Datta, S. R., Composing graphical models with neural networks for structured representations and fast inference, Advances in Neural Information Processing Systems, 2946-2954, (2016).
- Kingma, D. P., Welling, M., Auto-Encoding Variational Bayes, arXiv:1312.6114, (2013).
- Kingma, D. P., Salimans, T., Jozefowicz, R., Chen, X., Sutskever, I., Welling, M., Improved variational inference with inverse autoregressive flow, Advances in Neural Information Processing Systems, 4743-4751, (2016).
- Mahdavi Damghani, B., Kos, A., De-arbitraging With a Weak Smile: Application to Skew Risk, Wilmott magazines, 2013, No.64, 1-60, (2013). [CrossRef]
- Niu, Q., No arbitrage conditions and characters of implied volatility, A review for implied volatility modelers, Princeton University thesis, (2015).
- Pascal François, R., Galarneau-Vincent, G., Gauthier, F., Godin, Venturing into uncharted territory: An extensible implied volatility surface model, Journal of Futures Markets, 42, 1912-1940, (2022).
- Rosca, M., Lakshminarayanan, B., Mohamed, S., Distribution matching in variational inference, arXiv preprint arXiv:1802.06847, (2018).
- Ulrich, M., Zimmer, L., Merbecks, C., Implied volatility surfaces: A comprehensive analysis using half a billion option prices, Review of Derivatives Research, 26, 135-169, (2023). [CrossRef]
- Wang, X., Zhao, Y., Bao, Y., Arbitrage-free conditions for implied volatility surface by Delta, North American Journal of Economics and Finance, 48, 819-834, (2019). [CrossRef]
- Zhang, W., Li, L., Zhang, G., A two-step framework for arbitrage-free prediction of the implied volatility surface, Quantitative Finance, 23, No. 1, 21-34, (2023). [CrossRef]
- Zheng, G., Wojciech, F., Tiranti, R., A new encoding of implied volatility surfaces for their synthetic generation, arXiv:2211.12892 [q-fin.PR], (2023).




| Original |
| Reconstructed |






| Heston surface total variance |
| VAE reconstructed surface total variance |
| % holes | nbr of holes | Training set | Validation set | ||
| Error Recons | Error Latent Opt | Error Recons | Error Latent Opt | ||
| 20% | 51 | 5 | 3.71 | 3.619 | 3.616 |
| 40% | 102 | 6.59 | 3.91 | 3.62 | 3.616 |
| 60% | 153 | 6.59 | 4.39 | 3.621 | 3.617 |
| 80% | 204 | 1.008 | 5.40 | 3.621 | 3.617 |
| 96% | 243 | 1.95 | 5.21 | 3.622 | 3.617 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).