A unified approach to adaptive control of average reward Markov decision processes

From MaRDI portal
Publication:1095048