scientific article; zbMATH DE number 7415125
From MaRDI portal
Publication:5159474
Recommendations
- A randomized stochastic approximation algorithm for self-learning
- Asymptotic Learnability of Reinforcement Problems with Arbitrary Dependence
- Understanding self-paced learning under concave conjugacy theory
- Learning reward machines: a study in partially observable reinforcement learning
- A probabilistic interpretation of the constant gain learning algorithm
- Stochastic dynamics of reinforcement learning
- A probabilistic description of the learning process
- The dynamics of generalized reinforcement learning
- Reinforcement learning with algorithms from probabilistic structure estimation
Cites work
- scientific article; zbMATH DE number 4048925 (Why is no real title available?)
- scientific article; zbMATH DE number 7014212 (Why is no real title available?)
- scientific article; zbMATH DE number 7306877 (Why is no real title available?)
- A theoretical understanding of self-paced learning
- BRINGING UP ROBOT: FUNDAMENTAL MECHANISMS FOR CREATING A SELF-MOTIVATED, SELF-ORGANIZING ARCHITECTURE
- Bayesian reinforcement learning: a survey
- Computer vision. Models, learning, and inference. Foreword by Andrew Fitzgibbon.
- End-to-end training of deep visuomotor policies
- Introduction to Derivative-Free Optimization
- Nearly unbiased variable selection under minimax concave penalty
- Non-parametric policy search with limited information loss
- Optimization by simulated annealing
- Probabilistic numerics and uncertainty in computations
- Simulating normalizing constants: From importance sampling to bridge sampling to path sampling
- Transfer learning for reinforcement learning domains: a survey
- Using Expectation-Maximization for Reinforcement Learning
Cited in
(2)
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5159474)