Efficient computation of optimal actions

From MaRDI portal
Publication:3069218


DOI10.1073/pnas.0710743106zbMath1203.68327WikidataQ37249250 ScholiaQ37249250MaRDI QIDQ3069218

Emanuel Todorov

Publication date: 24 January 2011

Published in: Proceedings of the National Academy of Sciences (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1073/pnas.0710743106


90Cxx: Mathematical programming

68W99: Algorithms in computer science


Related Items

Nonlinear stochastic receding horizon control: stability, robustness and Monte Carlo methods for control approximation, Ordinary Differential Equation Methods for Markov Decision Processes and Application to Kullback--Leibler Control Cost, Tensor Decomposition Methods for High-dimensional Hamilton--Jacobi--Bellman Equations, Unnamed Item, A diffusion wavelets-based multiscale framework for inverse optimal control of stochastic systems, SympOCnet: Solving Optimal Control Problems with Applications to High-Dimensional Multiagent Path Planning Problems, Distributed Control of Uncertain Systems Using Superpositions of Linear Operators, Variational approach to rare event simulation using least-squares regression, Linearly Solvable Stochastic Control Lyapunov Functions, A Reward-Maximizing Spiking Neuron as a Bounded Rational Decision Maker, Adaptive path-integral autoencoder: representation learning and planning for dynamical systems, Transition Path Theory for Langevin Dynamics on Manifolds: Optimal Control and Data-Driven Solver, Reward Maximization Through Discrete Active Inference, Optimal control of probabilistic Boolean control networks: A scalable infinite horizon approach, Adaptive importance sampling for control and inference, Bio-inspired feedback-circuit implementation of discrete, free energy optimizing, winner-take-all computations, Bayesian models of brain and behaviour, Optimal control as a graphical model inference problem, Stochastic optimal control via forward and backward stochastic differential equations and importance sampling, Overcoming the curse of dimensionality for some Hamilton-Jacobi partial differential equations via neural network architectures, Precautionary price stickiness, Pursuit of food \textit{versus} pursuit of information in a Markovian perception-action loop model of foraging, Balancing control: a Bayesian interpretation of habitual and goal-directed behavior, On a probabilistic approach to synthesize control policies from example datasets, On some neural network architectures that can represent viscosity solutions of certain high dimensional Hamilton-Jacobi partial differential equations, Bayesian inverse reinforcement learning for collective animal movement, Clustering and the efficient use of cognitive resources, Information projection on Banach spaces with applications to state independent KL-weighted optimal control, Robust policy schemes for differential R\&D games with asymmetric information, Efficient computation of optimal open-loop controls for stochastic systems, Design of biased random walks on a graph with application to collaborative recommendation, Neural network architectures using min-plus algebra for solving certain high-dimensional optimal control problems and Hamilton-Jacobi PDEs, A grid-based tool for optimal performance monitoring of a glycemic regulator, Moving least-squares approximations for linearly-solvable stochastic optimal control problems, A Cost/Speed/Reliability Tradeoff to Erasing, Action selection in growing state spaces: control of network structure growth, Systems of Bounded Rational Agents with Information-Theoretic Constraints, Optimal collision avoidance in swarms of active Brownian particles



Cites Work