Model-based average reward reinforcement learning
From MaRDI portal
Publication:1128769
DOI10.1016/S0004-3702(98)00002-2zbMATH Open0906.68122OpenAlexW2028357975WikidataQ126645478 ScholiaQ126645478MaRDI QIDQ1128769FDOQ1128769
Authors: Prasad Tadepalli, DoKyeong Ok
Publication date: 13 August 1998
Published in: Artificial Intelligence (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/s0004-3702(98)00002-2
Recommendations
- scientific article; zbMATH DE number 5957504
- Average reward reinforcement learning: foundations, algorithms, and empirical results
- Model-free average reward multi-step reinforcement learning
- scientific article; zbMATH DE number 1501821
- Solving semi-Markov decision problems using average reward reinforcement learning
machine learningBayesian networkslinear regressionaverage rewardreinforcement learningexplorationmodel-basedAGV scheduling
Cites Work
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- \({\mathcal Q}\)-learning
- Title not available (Why is that?)
- Title not available (Why is that?)
- A New Value Iteration method for the Average Cost Dynamic Programming Problem
- Average reward reinforcement learning: foundations, algorithms, and empirical results
- Distributed dynamic programming
- The effect of representation and knowledge on goal-directed exploration with reinforcement-learning algorithms
- Elevator group control using multiple reinforcement learning agents
- Practical issues in temporal difference learning
- Computationally efficient algorithms for on-line optimization of Markov decision processes
- Model-based average reward reinforcement learning
Cited In (13)
- Transfer in variable-reward hierarchical reinforcement learning
- Minimizing mean weighted tardiness in unrelated parallel machine scheduling with reinforcement learning
- Model-based average reward reinforcement learning
- Reward machines: exploiting reward function structure in reinforcement learning
- Title not available (Why is that?)
- Title not available (Why is that?)
- Average cost temporal-difference learning
- AI 2003: Advances in Artificial Intelligence
- Title not available (Why is that?)
- Title not available (Why is that?)
- Integrating a partial model into model free reinforcement learning
- Model-free average reward multi-step reinforcement learning
- Average reward reinforcement learning: foundations, algorithms, and empirical results
This page was built for publication: Model-based average reward reinforcement learning
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1128769)