Submitted:
27 September 2023
Posted:
29 September 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Generative Modelling
1.2. Trilemma of Generative Learning

2. Background
2.1. Classical Image Synthesis
2.1.1. Restricted Boltzmann Machine

2.1.2. Variational Autoencoder

2.1.3. Generative Adversarial Network

2.1.4. Denoising Diffusion Probabilistic Model

- Increasing depth versus width, holding model size relatively constant.
- Increasing the number of attention heads.
- Using attention at 32×32, 16×16, and 8×8 resolutions rather than only at 16×16.
- Using the BigGAN residual block for upsampling and downsampling the activations, following
- Rescaling residual connections with
2.2. Quantum Machine Learning
2.2.1. Quantum Boltzmann Machine
2.2.2. Image Synthesis
3. Methods
3.1. Goal
- perform the image synthesis directly on the QBM
- evaluate the performance of the QBM against a: RBM, VAE, GAN, & DDPM
- evaluate various generative modelling methods on FID, KID, and Inception scores
- model a richer image dataset, CIFAR-10
3.2. Data

3.3. Classical Models
3.4. Quantum Model
3.5. Hyper-Parameters
3.6. Metrics
3.6.1. Inception Score
3.6.2. Fréchet Inception Distance (FID)
3.6.3. Kernel Inception Distance (KID)
3.6.4. Quantitative Metrics
| Metric | Description | Performance |
|---|---|---|
| Inception | KL-Divergence between conditional and marginal label distributions over generated data | Higher is better |
| FID | Wasserstein-2 distance between multivariate Gaussians fitted to data embedded into a feature space | Lower is better |
| KID | Measures the dissimilarity between two probability distributions and using samples drawn independently from each distribution | Lower is better |
3.6.5. Qualitative Metrics
4. Results
4.1. Restricted Boltzmann Machine (RBM)
4.2. Variational Autoencoder (VAE)
4.3. Generative Adversarial Network (GAN)
4.4. Denoising Diffusion Probabilistic Model (DDPM)

4.5. Quantum Boltzmann Machine (QBM)
5. Analysis
5.1. Scores
5.1.1. Inception Score
5.1.2. Fréchet Inception Distance
5.1.3. Kernel Inception Distance
5.2. Feature Extraction
5.3. Trilemma of Generative Learning
5.3.1. High-Quality Sampling
5.3.2. Mode Coverage & Diversity
5.3.3. Fast Sampling
5.3.4. Conclusion
6. Conclusion & Future Work
- Restricted Boltzmann Machine
- Variational Autoencoder
- Generative Adversarial Network
- Denoising Diffusion Probabilistic Model
6.1. Image Preprocessing
6.2. Quantum Computing
Acknowledgments
References
- Weng, L. What are diffusion models? lilianweng.github.io/lil-log.
- Ho, J.; Jain, A.; Abbeel, P. 2020; arXiv:cs.LG/2006.11239].
- Dhariwal, P.; Nichol, A. 2021; arXiv:cs.LG/2105.05233].
- Jain, S.; Ziauddin, J.; Leonchyk, P.; Yenkanchi, S.; Geraci, J. Quantum and classical machine learning for the classification of non-small-cell lung cancer patients. SN Applied Sciences 2020, 2. [Google Scholar] [CrossRef]
- Thulasidasan, S. Generative Modeling for Machine Learning on the D-Wave. Technical report, 2016. [CrossRef]
- Amin, M.H.; Andriyash, E.; Rolfe, J.; Kulchytskyy, B.; Melko, R. Quantum Boltzmann Machine. Physical Review X 2018, 8. [Google Scholar] [CrossRef]
- Xiao, Z.; Kreis, K.; Vahdat, A. Tackling the Generative Learning Trilemma with Denoising Diffusion GANs. In Proceedings of the International Conference on Learning Representations (ICLR); 2022. [Google Scholar]
- Smolensky, P. Information processing in dynamical systems: foundations of harmony theory. 1986.
- Freund, Y.; Haussler, D. Unsupervised learning of distributions on binary vectors using two layer networks. In Proceedings of the Advances in Neural Information Processing Systems; Moody, J.; Hanson, S.; Lippmann, R., Eds. Morgan-Kaufmann, Vol. 4. 1991. [Google Scholar]
- Hopfield, J.J. Neural networks and physical systems with emergent collective computational abilities. Proceedings of the National Academy of Sciences 1982, 79, 2554–2558. [Google Scholar] [CrossRef] [PubMed]
- Hinton, G.E. A Practical Guide to Training Restricted Boltzmann Machines. In Lecture Notes in Computer Science; Springer Berlin Heidelberg, 2012; pp. 599–619. [CrossRef]
- Carreira-Perpiñán, M.Á.; Hinton, G.E. On Contrastive Divergence Learning. In Proceedings of the AISTATS; 2005. [Google Scholar]
- Kingma, D.P.; Welling, M. 2014; arXiv:stat.ML/1312.6114].
- Rocca, J. Understanding variational autoencoders (VAES), 2021.
- Goodfellow, I.J.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. 2014; arXiv:stat.ML/1406.2661].
- A beginner’s guide to generative adversarial networks (gans).
- Common problems | generative adversarial networks | google developers.
- Hinton, G.E. Training Products of Experts by Minimizing Contrastive Divergence. Neural Comput. 2002, 14, 1771–1800. [Google Scholar] [CrossRef] [PubMed]
- What is quantum annealing?
- Sleeman, J.; Dorband, J.; Halem, M. A Hybrid Quantum enabled RBM Advantage: Convolutional Autoencoders For Quantum Image Compression and Generative Learning 2020.
- Krizhevsky, A.; Nair, V.; Hinton, G. CIFAR-10 (Canadian Institute for Advanced Research).
- Krizhevsky, A. Learning multiple layers of features from tiny images. Technical report, 2009.
- Peter Eckersley, Y.N.e.a. EFF AI Progress Measurement Project, 2017.
- Mack, D. A simple explanation of the inception score, 2019.
- Szegedy, C.; Vanhoucke, V.; Ioffe, S.; Shlens, J.; Wojna, Z. Rethinking the Inception Architecture for Computer Vision, 2015. [CrossRef]
- Salimans, T.; Goodfellow, I.; Zaremba, W.; Cheung, V.; Radford, A.; Chen, X. Improved Techniques for Training GANs, 2016. [CrossRef]
- Heusel, M.; Ramsauer, H.; Unterthiner, T.; Nessler, B.; Hochreiter, S. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium. In Proceedings of the Advances in Neural Information Processing Systems; Guyon, I.; Luxburg, U.V.; Bengio, S.; Wallach, H.; Fergus, R.; Vishwanathan, S.; Garnett, R., Eds. Curran Associates, Inc., Vol. 30. 2017. [Google Scholar]
- Heusel, M.; Ramsauer, H.; Unterthiner, T.; Nessler, B.; Hochreiter, S. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium 2017. [CrossRef]
- Bińkowski, M.; Sutherland, D.J.; Arbel, M.; Gretton, A. Demystifying MMD GANs, 2018. [CrossRef]
- Cloud tensor processing units (tpus) | google cloud.
- Dhillon, P.S.; Foster, D.; Ungar, L. Transfer Learning Using Feature Selection, 2009. [CrossRef]








| QBM | RBM | VAE | GAN | DDPM | |
|---|---|---|---|---|---|
| Epochs | 10 | 10 | 50 | 50 | 30000 |
| Batch Size | 256 | 256 | 512 | 128 | - |
| # of Hidden Nodes | 128 | 2500 | 32 | 64 | 32 |
| Learning Rate () | 0.0035 | 0.0035 | 0.2 | 0.2 | 0.2 |
| QBM | RBM | VAE | GAN | DDPM | |
|---|---|---|---|---|---|
| Inception | 1.77 | 3.84 | 7.87 | 2.72 | 3.319 |
| FID | 210.83 | 379.65 | 93.48 | 122.49 | 307.51 |
| KID | 0.068 | 0.191 | 0.024 | 0.033 | 0.586 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).