Reinforcement learning with algorithms from probabilistic structure estimation
DOI10.1016/J.AUTOMATICA.2022.110483OpenAlexW3136563486WikidataQ114204749 ScholiaQ114204749MaRDI QIDQ2165986FDOQ2165986
Authors: Jonathan P. Epperlein, Roman Overko, Djallel Bouneffouf, Andrew Cullen, Sergiy M. Zhuk, Christopher King, Robert N. Shorten
Publication date: 23 August 2022
Published in: Automatica (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/2103.08241
Recommendations
- Probabilistic inference for determining options in reinforcement learning
- Reinforcement learning
- k-Certainty Exploration Method: an action selector to identify the environment in reinforcement learning
- A model for system uncertainty in reinforcement learning
- scientific article; zbMATH DE number 1804129
machine learningMarkov decision processreinforcement learningdecision support systemstatistical testing
Cites Work
- Title not available (Why is that?)
- The Large-Sample Distribution of the Likelihood Ratio for Testing Composite Hypotheses
- \({\mathcal Q}\)-learning
- Non-negative matrices and Markov chains.
- Ergodicity coefficients defined by vector norms
- Reinforcement learning. An introduction
- Adaptive control using multiple models
- Title not available (Why is that?)
- Title not available (Why is that?)
- Decision making under uncertainty. Theory and application. With contributions from Christopher Amato, Girish Chowdhary, Jonathan P. How, Hayley J. Davison Reynolds, Jason R. Thornton, Pedro A. Torres-Carrasquillo, N. Kemal Üre and John Vian
Cited In (7)
- Title not available (Why is that?)
- Reinforcement learning, sequential Monte Carlo and the EM algorithm
- Probability matching and reinforcement learning
- Learning structured data from unspecific reinforcement
- Learning reward machines: a study in partially observable reinforcement learning
- Algorithm of stable state spaces in reinforcement learning
- Reinforcement Learning State Estimator
This page was built for publication: Reinforcement learning with algorithms from probabilistic structure estimation
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2165986)