Reinforcement learning with immediate rewards and linear hypotheses

From MaRDI portal
Publication:1762980