Reinforcement learning
From MaRDI portal
Publication:6602227
Recommendations
Cites Work
- scientific article; zbMATH DE number 5957269 (Why is no real title available?)
- scientific article; zbMATH DE number 5547890 (Why is no real title available?)
- scientific article; zbMATH DE number 1753152 (Why is no real title available?)
- scientific article; zbMATH DE number 1753153 (Why is no real title available?)
- scientific article; zbMATH DE number 2243395 (Why is no real title available?)
- 10.1162/1532443041827907
- A generalized Kalman filter for fixed point approximation and efficient temporal-difference learning
- A survey of multi-objective sequential decision-making
- A tutorial on the cross-entropy method
- Bayesian reinforcement learning: a survey
- Coherent measures of risk
- Functional Approximations and Dynamic Programming
- Innovations in multi-agent systems and application -- 1.
- Interactive policy learning through confidence-based autonomy
- Inverse reinforcement learning in partially observable environments
- Kalman temporal differences
- Linear least-squares algorithms for temporal difference learning
- Natural evolution strategies
- Neuroevolution strategies for episodic reinforcement learning
- Polynomial Approximation--A New Computational Technique in Dynamic Programming: Allocation Processes
- Preference-based reinforcement learning: a formal framework and a policy iteration algorithm
- Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm
- Risk measurement with equivalent utility principles
- Risk-Constrained Markov Decision Processes
- Risk-sensitive reinforcement learning
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Stochastic dynamic programming with factored representations
- The Linear Programming Approach to Approximate Dynamic Programming
- The \(K\)-armed dueling bandits problem
- The logic of adaptive behavior. Knowledge representation and algorithms for adaptive sequential decision making under uncertainty in first-order and relational domains.
- Training parsers by inverse reinforcement learning
- Transfer learning for reinforcement learning domains: a survey
- Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques
Cited In (3)
This page was built for publication: Reinforcement learning
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6602227)