Investigation of D-Wave quantum annealing for training Restricted Boltzmann Machines and mitigating catastrophic forgetting

Generative AI & LLMs
Published: arXiv: 2508.15697v1
Authors

Abdelmoula El-Yazizi Yaroslav Koshka

Abstract

Modest statistical differences between the sampling performances of the D-Wave quantum annealer (QA) and the classical Markov Chain Monte Carlo (MCMC), when applied to Restricted Boltzmann Machines (RBMs), are explored to explain, and possibly address, the absence of significant and consistent improvements in RBM trainability when the D-Wave sampling was used in previous investigations. A novel hybrid sampling approach, combining the classical and the QA contributions, is investigated as a promising way to benefit from the modest differences between the two sampling methods. No improvements in the RBM training are achieved in this work, thereby suggesting that the differences between the QA-based and MCMC sampling, mainly found in the medium-to-low probability regions of the distribution, which are less important for the quality of the sample, are insufficient to benefit the training. Difficulties in achieving sufficiently high quality of embedding RBMs into the lattice of the newer generation of D-Wave hardware could be further complicating the task. On the other hand, the ability to generate samples of sufficient variety from lower-probability parts of the distribution has a potential to benefit other machine learning applications, such as the mitigation of catastrophic forgetting (CF) during incremental learning. The feasibility of using QA-generated patterns of desirable classes for CF mitigation by the generative replay is demonstrated in this work for the first time. While the efficiency of the CF mitigation using the D-Wave QA was comparable to that of the classical mitigation, both the speed of generating a large number of distinct desirable patterns and the potential for further improvement make this approach promising for a variety of challenging machine learning applications.

Paper Summary

Problem
The main problem addressed in this research paper is the lack of significant improvements in training Restricted Boltzmann Machines (RBMs) using the D-Wave quantum annealer (QA). Despite initial promise, previous studies failed to achieve substantial improvements in RBM trainability when using the D-Wave QA for sampling.
Key Innovation
The key innovation of this work is the development of a novel hybrid sampling approach that combines the classical Markov Chain Monte Carlo (MCMC) method with the QA contribution. This approach aims to benefit from the modest differences between the two sampling methods and potentially address the lack of improvements in RBM training.
Practical Impact
The research could have a significant impact on various machine learning applications, particularly in the mitigation of catastrophic forgetting (CF) during incremental learning. The QA-generated patterns of desirable classes can be used for CF mitigation using generative replay, which could be beneficial for challenging machine learning tasks. Additionally, the approach could be used to generate samples of sufficient variety from lower-probability parts of the distribution, which could be useful in other machine learning applications.
Analogy / Intuitive Explanation
Imagine trying to find a needle in a haystack. The D-Wave QA is like a special kind of searchlight that can shine on the haystack and highlight the areas where the needle is likely to be. However, the searchlight may not always shine perfectly, and the needle may still be difficult to find. The hybrid sampling approach is like using multiple searchlights, including the D-Wave QA and the classical MCMC method, to cover more ground and increase the chances of finding the needle.
Paper Information
Categories:
cs.LG quant-ph stat.ML
Published Date:

arXiv ID:

2508.15697v1

Quick Actions