Average cost temporal-difference learning

From MaRDI portal
Revision as of 10:15, 1 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:1805802


DOI10.1016/S0005-1098(99)00099-0zbMath0932.93085MaRDI QIDQ1805802

John N. Tsitsiklis, Benjamin van Roy

Publication date: 28 February 2000

Published in: Automatica (Search for Journal in Brave)


49L20: Dynamic programming in optimal control and differential games

90C39: Dynamic programming

93E20: Optimal stochastic control

93E35: Stochastic learning and adaptive control


Related Items