The factored policy-gradient planner
From MaRDI portal
Publication:835832
DOI10.1016/j.artint.2008.11.008zbMath1192.68636OpenAlexW2172261094WikidataQ113442975 ScholiaQ113442975MaRDI QIDQ835832
Olivier Buffet, Douglas Aberdeen
Publication date: 31 August 2009
Published in: Artificial Intelligence (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.artint.2008.11.008
Related Items
Probabilistic planning with clear preferences on missing information, The factored policy-gradient planner, Practical solution techniques for first-order MDPs, Real-time dynamic programming for Markov decision processes with imprecise probabilities, Discovering hidden structure in factored MDPs
Uses Software
Cites Work
- The factored policy-gradient planner
- Practical solution techniques for first-order MDPs
- Natural actor-critic algorithms
- Fast planning through planning graph analysis
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- The Complexity of Decentralized Control of Markov Decision Processes
- LAO*: A heuristic search algorithm that finds solutions with loops
- Relational reinforcement learning
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item