Solving Semi-Markov Decision Problems Using Average Reward Reinforcement Learning

From MaRDI portal

Publication:3116659

Jump to:navigation, search

DOI10.1287/mnsc.45.4.560zbMath1231.90225OpenAlexW2075754841MaRDI QIDQ3116659

Abhijit Gosavi, Sridhar Mahadevan, Tapas K. Das, Nicholas Marchalleck

Publication date: 12 February 2012

Published in: Management Science (Search for Journal in Brave)

Full work available at URL: https://semanticscholar.org/paper/45295aa0fd92ecbad325ab8ba856ad8207ba3310

zbMATH Keywords

reinforcement learning semi-Markov decision processes (SMDP)

Mathematics Subject Classification ID

Search theory (90B40)

Related Items (15)

On undiscounted semi-Markov decision processes with absorbing states ⋮ New approximate dynamic programming algorithms for large-scale undiscounted Markov decision processes and their application to optimize a production and distribution system ⋮ Reinforcement learning for joint pricing, lead-time and scheduling decisions in make-to-order systems ⋮ The optimal control of just-in-time-based production and distribution systems and performance comparisons with optimized pull systems ⋮ A heuristically accelerated reinforcement learning method for maintenance policy of an assembly line ⋮ Approximate dynamic programming for capacity allocation in the service industry ⋮ Reinforcement learning for long-run average cost. ⋮ Application of reinforcement learning to the game of Othello ⋮ The explicit form of the rate function for semi-Markov processes and its contractions ⋮ Minimizing mean weighted tardiness in unrelated parallel machine scheduling with reinforcement learning ⋮ A simulation-based approach to study stochastic inventory-planning games ⋮ A performance-centred approach to optimising maintenance of complex systems ⋮ Representation and Timing in Theories of the Dopamine System ⋮ A Neurocomputational Model for Cocaine Addiction ⋮ Look-ahead control of conveyor-serviced production station by using potential-based online policy iteration

This page was built for publication: Solving Semi-Markov Decision Problems Using Average Reward Reinforcement Learning

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3116659&oldid=16191261"