Abstract: We study a natural Wasserstein gradient flow on manifolds of probability distributions with discrete sample spaces. We derive the Riemannian structure for the probability simplex from the dynamical formulation of the Wasserstein distance on a weighted graph. We pull back the geometric structure to the parameter space of any given probability model, which allows us to define a natural gradient flow there. In contrast to the natural Fisher-Rao gradient, the natural Wasserstein gradient incorporates a ground metric on sample space. We illustrate the analysis of elementary exponential family examples and demonstrate an application of the Wasserstein natural gradient to maximum likelihood estimation.
Recommendations
Cites work
- scientific article; zbMATH DE number 3897897 (Why is no real title available?)
- scientific article; zbMATH DE number 3761167 (Why is no real title available?)
- scientific article; zbMATH DE number 3894218 (Why is no real title available?)
- A computational fluid mechanics solution to the Monge-Kantorovich mass transfer problem
- A gradient structure for reaction–diffusion systems and for energy-drift-diffusion systems
- An Extended Cencov Characterization of the Information Metric
- Axiomatic Geometry of Conditional Models
- Computations of optimal transport distance with Fisher information regularization
- Constrained steepest descent in the 2-Wasserstein metric
- Entropy dissipation of Fokker-Planck equations on graphs
- Fokker-Planck equations for a free energy functional or Markov process on a graph
- Geodesics of minimal length in the set of probability measures on graphs
- Geometry of matrix decompositions seen through optimal transport and information geometry
- Gradient flows of the entropy for finite Markov chains
- Information geometry
- Information geometry and its applications
- Information geometry connecting Wasserstein distance and Kullback-Leibler divergence via the entropy-relaxed transportation problem
- Information geometry of Wasserstein divergence
- Natural gradient flow in the mixture geometry of a discrete exponential family
- On the Fisher metric of conditional probability polytopes
- Optimal Transport
- Ricci curvature for parametric statistics via optimal transport
- Some geometric calculations on Wasserstein space
- THE GEOMETRY OF DISSIPATIVE EVOLUTION EQUATIONS: THE POROUS MEDIUM EQUATION
- The Density Manifold and Configuration Space Quantization
- Towards the geometry of estimation of distribution algorithms based on the exponential family
- Wasserstein geometry of Gaussian measures
Cited in
(33)- Analysis of asymptotic escape of strict saddle sets in manifold optimization
- Ricci curvature for parametric statistics via optimal transport
- Affine statistical bundle modeled on a Gaussian Orlicz-Sobolev space
- Geometry and convergence of natural policy gradient methods
- Optimal transport natural gradient for statistical manifolds with continuous sample space
- Wasserstein proximal of GANs
- Quantum statistical learning via quantum Wasserstein natural gradient
- Wasserstein gradients for the temporal evolution of probability distributions
- Neural parametric Fokker-Planck equation
- Interacting Langevin diffusions: gradient structure and ensemble Kalman sampler
- Natural gradient for combined loss using wavelets
- Weyl geometric approach to the gradient-flow equations in information geometry
- Wasserstein information matrix
- Mirror descent algorithms for minimizing interacting free energy
- Affine natural proximal learning
- The information geometry of mirror descent
- Pseudo-Riemannian geometry encodes information geometry in optimal transport
- Transport information geometry: Riemannian calculus on probability simplex
- Hessian transport gradient flows
- Lagrangian and Hamiltonian dynamics for probabilities on the statistical bundle
- Information geometry of physics-informed statistical manifolds and its use in data assimilation
- Online natural gradient as a Kalman filter
- Information geometry of smooth densities on the Gaussian space: Poincaré inequalities
- Accelerated information gradient flow
- High order spatial discretization for variational time implicit schemes: Wasserstein gradient flows and reaction-diffusion systems
- Transport information Bregman divergences
- Fisher information regularization schemes for Wasserstein gradient flows
- When optimal transport meets information geometry
- Gradient flow of the stochastic relaxation on a generic exponential family
- Sparse optimization on measures with over-parameterized gradient descent
- Hessian metric via transport information geometry
- Invariance properties of the natural gradient in overparametrised systems
- Efficient Natural Gradient Descent Methods for Large-Scale PDE-Based Optimization Problems
This page was built for publication: Natural gradient via optimal transport
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1713655)