A performance gradient perspective on gradient‐based policy iteration and a modified value iteration
From MaRDI portal
Publication:3613729
DOI10.1108/17563780810919096zbMath1155.90473OpenAlexW2051169325MaRDI QIDQ3613729
James Dankert, Lei Yang, Jennie Si
Publication date: 12 March 2009
Published in: International Journal of Intelligent Computing and Cybernetics (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1108/17563780810919096
Cites Work