Algorithms for reinforcement learning.
From MaRDI portal
Publication:3588852
planningsimulationstochastic approximationleast-squares methodsMarkov decision processesonline learningreinforcement learningfunction approximationQ-learningPAC-learningactive learningnatural gradientbias-variance tradeoffoverfittingstochastic gradient methodspolicy gradienttemporal difference learningactor-critic methods
Recommendations
Cited in
(55)- Undiscounted reinforcement learning algorithm based on performance potentials
- Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model
- A Two-Timescale Stochastic Algorithm Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic
- Efficient augmentation and relaxation learning for individualized treatment rules using observational data
- Investigating the properties of neural network representations in reinforcement learning
- On learning and branching: a survey
- Model selection in reinforcement learning
- Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm
- Adaptive representations for reinforcement learning.
- Closed-form Approximations in Multi-asset Market Making
- Statistical reinforcement learning. Modern machine learning approaches
- Dynamic treatment regimes: technical challenges and applications
- Convergence of entropy-regularized natural policy gradient with linear function approximation
- Bayesian exploration for approximate dynamic programming
- Markov decision processes with sequential sensor measurements
- Adaptive playouts for online learning of policies during Monte Carlo tree search
- Continuous-action planning for discounted infinite-horizon nonlinear optimal control with Lipschitz values
- Editorial: some recent advances in learning and adaptation for uncertain feedback control systems
- Asymptotic analysis of value prediction by well-specified and misspecified models
- Optimal activation of halting multi‐armed bandit models
- A Reinforcement Learning Neural Network for Robotic Manipulator Control
- Robust adaptive dynamic programming for linear and nonlinear systems: an overview
- Non-parametric policy search with limited information loss
- Abstraction from demonstration for efficient reinforcement learning in high-dimensional domains
- A systematic study on meta-heuristic approaches for solving the graph coloring problem
- Structure in machine learning
- On convergence of value iteration for a class of total cost Markov decision processes
- scientific article; zbMATH DE number 6982305 (Why is no real title available?)
- A convex optimization approach to dynamic programming in continuous state and action spaces
- TEXPLORE: temporal difference reinforcement learning for robots and time-constrained domains
- Least squares policy iteration with instrumental variables vs. direct policy search: comparison against optimal benchmarks using energy storage
- Efficient model-based reinforcement learning for approximate online optimal control
- Empirical \(Q\)-value iteration
- Reinforcement learning. An introduction
- A unified DC programming framework and efficient DCA based approaches for large scale batch reinforcement learning
- Approximate Q Learning for Controlled Diffusion Processes and Its Near Optimality
- Reinforcement learning agents
- scientific article; zbMATH DE number 1950579 (Why is no real title available?)
- scientific article; zbMATH DE number 7626721 (Why is no real title available?)
- Deep reinforcement trading with predictable returns
- Crowd computing as a cooperation problem: An evolutionary approach
- Formalization of methods for the development of autonomous artificial intelligence systems
- scientific article; zbMATH DE number 836011 (Why is no real title available?)
- Online spatio-temporal matching in stochastic and dynamic domains
- Hypervolume indicator and dominance reward based multi-objective Monte-Carlo tree search
- Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning
- Modern Bayesian experimental design
- Deep exploration via randomized value functions
- Finite-time performance of distributed temporal-difference learning with linear function approximation
- Reinforcement learning algorithms with function approximation: recent advances and applications
- Reinforcement learning theory, algorithms and its application
- Proximal algorithms and temporal difference methods for solving fixed point problems
- Fundamental design principles for reinforcement learning algorithms
- A unified framework for stochastic optimization
- Decision making under uncertainty and reinforcement learning. Theory and algorithms
This page was built for publication: Algorithms for reinforcement learning.
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3588852)