A least squares temporal difference actor–critic algorithm with applications to warehouse management
From MaRDI portal
Publication:3120552
DOI10.1002/nav.21481zbMath1407.90334OpenAlexW1964782533MaRDI QIDQ3120552
Ioannis Ch. Paschalidis, Reza Moazzez Estanjini, Keyong Li
Publication date: 5 March 2019
Published in: Naval Research Logistics (NRL) (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1002/nav.21481
Markov decision processesvehicle routingpartial observabilityactor-critic algorithmsapproximate dynamic programmingwarehouse management
Related Items (2)
Neural circuits for learning context-dependent associations of stimuli ⋮ Performance optimization for a class of generalized stochastic Petri nets
This page was built for publication: A least squares temporal difference actor–critic algorithm with applications to warehouse management