Learning Representation and Control in Markov Decision Processes: New Frontiers

From MaRDI portal

Publication:3580907

Jump to:navigation, search

DOI10.1561/2200000003MaRDI QIDQ3580907zbMATH OpenOpenAlexFDO

Authors Sridhar Mahadevan

Publication date 14 August 2010

Published in Foundations and Trends® in Machine Learning (Search for Journal in Brave)

Full work available at URL https://semanticscholar.org/paper/481dc549e4fc8e0895fe98080dcd6fba138f7136

zbMATH Keywords

Markov decision processes Drazin inverse generic algorithm generalized spectral inverse of Laplacian machine learning framework representation policy iteration solution of sequential decision problems

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40) Hierarchical systems (93A13) Stochastic learning and adaptive control (93E35)

Recommendations

Cited in

(8)

This page was built for publication: Learning Representation and Control in Markov Decision Processes: New Frontiers

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3580907)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3580907&oldid=16986413"