Fisher meets Feynman: score-based variational inference with a product of experts

Generative AI & LLMs
Published: arXiv: 2510.21598v1
Authors

Diana Cai Robert M. Gower David M. Blei Lawrence K. Saul

Abstract

We introduce a highly expressive yet distinctly tractable family for black-box variational inference (BBVI). Each member of this family is a weighted product of experts (PoE), and each weighted expert in the product is proportional to a multivariate $t$-distribution. These products of experts can model distributions with skew, heavy tails, and multiple modes, but to use them for BBVI, we must be able to sample from their densities. We show how to do this by reformulating these products of experts as latent variable models with auxiliary Dirichlet random variables. These Dirichlet variables emerge from a Feynman identity, originally developed for loop integrals in quantum field theory, that expresses the product of multiple fractions (or in our case, $t$-distributions) as an integral over the simplex. We leverage this simplicial latent space to draw weighted samples from these products of experts -- samples which BBVI then uses to find the PoE that best approximates a target density. Given a collection of experts, we derive an iterative procedure to optimize the exponents that determine their geometric weighting in the PoE. At each iteration, this procedure minimizes a regularized Fisher divergence to match the scores of the variational and target densities at a batch of samples drawn from the current approximation. This minimization reduces to a convex quadratic program, and we prove under general conditions that these updates converge exponentially fast to a near-optimal weighting of experts. We conclude by evaluating this approach on a variety of synthetic and real-world target distributions.

Paper Summary

Problem
The main problem addressed by this research paper is the challenge of variational inference (VI) in machine learning. VI is a method used to approximate an intractable probability density p by a simpler, more tractable density q. However, current methods for black-box variational inference (BBVI) face several challenges, including the trade-off between the expressivity of the variational family Q and its ease of use.
Key Innovation
The key innovation of this research paper is the introduction of a new family for score-based BBVI that navigates these trade-offs in an appealing fashion. The densities in this family are highly expressive and yet manageably tractable, and the researchers are able to provide certain theoretical guarantees for the optimizations required for score-based VI. Each density in this family is a product of experts (PoE), and each (weighted) expert is proportional to a multivariate t-distribution.
Practical Impact
This research has several practical impacts. First, it provides a new family of densities for BBVI that are highly expressive and yet manageably tractable. This means that researchers and practitioners can use this family to approximate complex probability distributions in a wide range of applications, from machine learning to statistics. Second, the researchers provide certain theoretical guarantees for the optimizations required for score-based VI, which means that users can have confidence in the accuracy of their approximations. Finally, the researchers show how to sample from and evaluate the densities in this family, which is a crucial requirement for using them for VI.
Analogy / Intuitive Explanation
One way to think about this research is to imagine a team of experts working together to approximate a complex probability distribution. Each expert has a different opinion about the distribution, and the team needs to find a way to combine these opinions in a way that is both accurate and efficient. The researchers in this paper introduce a new way of combining these opinions, using a product of experts (PoE) that is proportional to a multivariate t-distribution. This allows the team to approximate the complex distribution in a highly expressive and yet manageably tractable way.
Paper Information
Categories:
stat.ML cs.LG
Published Date:

arXiv ID:

2510.21598v1

Quick Actions