Diffusion maps, spectral clustering and reaction coordinates of dynamical systems
From MaRDI portal
Publication:2497982
Abstract: A central problem in data analysis is the low dimensional representation of high dimensional data, and the concise description of its underlying geometry and density. In the analysis of large scale simulations of complex dynamical systems, where the notion of time evolution comes into play, important problems are the identification of slow variables and dynamically meaningful reaction coordinates that capture the long time evolution of the system. In this paper we provide a unifying view of these apparently different tasks, by considering a family of {em diffusion maps}, defined as the embedding of complex (high dimensional) data onto a low dimensional Euclidian space, via the eigenvectors of suitably defined random walks defined on the given datasets. Assuming that the data is randomly sampled from an underlying general probability distribution , we show that as the number of samples goes to infinity, the eigenvectors of each diffusion map converge to the eigenfunctions of a corresponding differential operator defined on the support of the probability distribution. Different normalizations of the Markov chain on the graph lead to different limiting differential operators. One normalization gives the Fokker-Planck operators with the same potential U(x), best suited for the study of stochastic differential equations as well as for clustering. Another normalization gives the Laplace-Beltrami (heat) operator on the manifold in which the data resides, best suited for the analysis of the geometry of the dataset, regardless of its possibly non-uniform density.
Recommendations
Cites work
- scientific article; zbMATH DE number 3686457 (Why is no real title available?)
- scientific article; zbMATH DE number 964896 (Why is no real title available?)
- Diffusion Maps, Reduction Coordinates, and Low Dimensional Representation of Stochastic Systems
- Diffusion maps
- Equation-free, coarse-grained multiscale computation: enabling microscopic simulators to perform system-level analysis
- Extracting macroscopic dynamics: model problems and algorithms
- Extracting macroscopic stochastic dynamics: Model problems
- Geometric diffusions as a tool for harmonic analysis and structure definition of data: diffusion maps
- Handbook of stochastic methods for physics, chemistry and the natural sciences.
- Laplacian Eigenmaps for Dimensionality Reduction and Data Representation
- Learning Theory
- Learning Theory
- Machine Learning: ECML 2004
- The Fokker-Planck equation. Methods of solution and applications.
- The elements of statistical learning. Data mining, inference, and prediction
Cited in
(97)- Point cloud discretization of Fokker-Planck operators for committor functions
- Modern Koopman theory for dynamical systems
- Doubly stochastic normalization of the Gaussian kernel is robust to heteroskedastic noise
- Fuzzy diffusion distance learning for cartoon similarity estimation
- Local kernels and the geometric structure of data
- Equation-free model reduction in agent-based computations: coarse-grained bifurcation and variable-free rare event analysis
- Data-driven efficient solvers for Langevin dynamics on manifold in high dimensions
- Equation free projective integration: a multiscale method applied to a plasma ion acoustic wave
- Physics-agnostic and physics-infused machine learning for thin films flows: modelling, and predictions from small data
- Computational coarse graining of a randomly forced one-dimensional Burgers equation
- Time-series forecasting using manifold learning, radial basis function interpolation, and geometric harmonics
- Diffusion-based kernel methods on Euclidean metric measure spaces
- Variable bandwidth diffusion kernels
- Diffusion maps
- Non-linear independent component analysis with diffusion maps
- Parsimonious representation of nonlinear dynamical systems through manifold learning: a chemotaxis case study
- Transition manifolds of complex metastable systems. Theory and data-driven computation of effective dynamics
- Nonlinear Laplacian spectral analysis for time series with intermittency and low-frequency variability
- Patch-to-tensor embedding
- Local and global perspectives on diffusion maps in the analysis of molecular systems
- Orientability and diffusion maps
- ATLAS: a geometric approach to learning high-dimensional stochastic systems near manifolds
- Tipping points of evolving epidemiological networks: machine learning-assisted, data-driven effective modeling
- Understanding the geometry of transport: diffusion maps for Lagrangian trajectory data unravel coherent sets
- Data-driven model reduction and transfer operator approximation
- Towards effective dynamics in complex systems by Markov kernel approximation
- Dynamics-adapted cone kernels
- An equivalence between the limit smoothness and the rate of convergence for a general contraction operator family
- Go with the flow, on Jupiter and snow. Coherence from model-free video data without trajectories
- Measure-based diffusion grid construction and high-dimensional data discretization
- Towards hybrid system modeling of uncertain complex dynamical systems
- Continuous-time random walks for the numerical solution of stochastic differential equations
- An efficient tree-based computation of a metric comparable to a natural diffusion distance
- Diffusion representation for asymmetric kernels
- Coarse analysis of collective motion with different communication mechanisms
- An experimental investigation of kernels on graphs for collaborative recommendation and semisupervised classification
- Recovering hidden components in multimodal data with composite diffusion operators
- Diffusion maps tailored to arbitrary non-degenerate Itô processes
- A geometrical method for low-dimensional representations of simulations
- Numerical bifurcation analysis of PDEs from lattice Boltzmann model simulations: a parsimonious machine learning approach
- Learning Binary Hash Codes for Large-Scale Image Search
- Robust Inference of Manifold Density and Geometry by Doubly Stochastic Scaling
- Time-Inhomogeneous Diffusion Geometry and Topology
- Kernel Analog Forecasting: Multiscale Test Problems
- A bag-of-paths framework for network data analysis
- Hearing the clusters of a graph: A distributed algorithm
- Towards mesoscopic ergodic theory
- Introducing User-Prescribed Constraints in Markov Chains for Nonlinear Dimensionality Reduction
- Grassmannian diffusion maps based surrogate modeling via geometric harmonics
- Functional diffusion maps
- Imaging geometry through dynamics: the observable representation
- A spectral notion of Gromov-Wasserstein distance and related methods
- Scalable extended dynamic mode decomposition using random kernel approximation
- Probing multipartite entanglement, coherence and quantum information preservation under classical Ornstein-Uhlenbeck noise
- A data-driven approximation of the koopman operator: extending dynamic mode decomposition
- Landmark diffusion maps (L-dMaps): accelerated manifold learning out-of-sample extension
- Learning the geometry of common latent variables using alternating-diffusion
- Understanding Graph Neural Networks with Generalized Geometric Scattering Transforms
- Some extensions of E. Stein's work on Littlewood-Paley theory applied to symmetric diffusion semigroups
- Extracting Sparse High-Dimensional Dynamics from Limited Data
- An equation-free approach to analyzing heterogeneous cell population dynamics
- Gaussian Process Landmarking for Three-Dimensional Geometric Morphometrics
- An equation-free approach to coarse-graining the dynamics of networks
- Fuzzy spectral clustering by PCCA+: application to Markov state models and data classification
- Thomas Bayes' walk on manifolds
- Data-driven prediction of multistable systems from sparse measurements
- Poincaré maps for multiscale physics discovery and nonlinear Floquet theory
- Transient anisotropic kernel for probabilistic learning on manifolds
- Diffusion Maps, Reduction Coordinates, and Low Dimensional Representation of Stochastic Systems
- Mathematics of smoothed particle hydrodynamics: a study via nonlocal Stokes equations
- Some remarks on diffusion distances
- Coarse analysis of collective behaviors: bifurcation analysis of the optimal velocity model for traffic jam formation
- Latent common manifold learning with alternating diffusion: analysis and applications
- Metric-based upscaling
- An equation-free computational approach for extracting population-level behavior from individual-based models of biological dispersal
- A framework for self-evolving computational material models inspired by deep learning
- Diffusion state distances: multitemporal analysis, fast algorithms, and applications to biological networks
- Parameter rating by diffusion gradient
- Data-driven control of agent-based models: an equation/variable-free machine learning approach
- Clustering Dynamics on Graphs: From Spectral Clustering to Mean Shift Through Fokker–Planck Interpolation
- A multiscale environment for learning by diffusion
- Dimensionality reduction of complex metastable systems via kernel embeddings of transition manifolds
- Reduction methods in climate dynamics -- a brief review
- Geometric fluid approximation for general continuous-time Markov chains
- Coarse-grained computation for particle coagulation and sintering processes by linking quadrature method of moments with Monte-Carlo
- A tailored convolutional neural network for nonlinear manifold learning of computational physics data using unstructured spatial discretizations
- Physics-constrained, data-driven discovery of coarse-grained dynamics
- Dimensionality reduction: an interpretation from manifold regularization perspective
- Manifold learning for the emulation of spatial fields from computational models
- A stochastic multiscale model for electricity generation capacity expansion
- Manifold learning for organizing unstructured sets of process observations
- Nonlinear Laplacian spectral analysis: capturing intermittent and low‐frequency spatiotemporal patterns in high‐dimensional data
- Geometric scattering on measure spaces
- Nonparametric uncertainty quantification for stochastic gradient flows
- Maximally predictive states: from partial observations to long timescales
- Data clustering based on Langevin annealing with a self-consistent potential
- Data-Driven Discovery of Governing Equations for Coarse-Grained Heterogeneous Network Dynamics
This page was built for publication: Diffusion maps, spectral clustering and reaction coordinates of dynamical systems
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2497982)