On the existence of fixed points for approximate value iteration and temporal-difference learning

From MaRDI portal
Publication:1586803