Trading Value and Information in MDPs

From MaRDI portal

Publication:3112566

Jump to:navigation, search

DOI10.1007/978-3-642-24647-0_3zbMath1229.91106OpenAlexW48257500MaRDI QIDQ3112566

Naftali Tishby, Ohad Shamir, Jonathan D. Rubin

Publication date: 11 January 2012

Published in: Intelligent Systems Reference Library (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/978-3-642-24647-0_3

Mathematics Subject Classification ID

Decision theory (91B06) Markov and semi-Markov decision processes (90C40) General considerations in statistical decision theory (62C05)

Related Items (4)

Design of biased random walks on a graph with application to collaborative recommendation ⋮ A Reward-Maximizing Spiking Neuron as a Bounded Rational Decision Maker ⋮ Sparse randomized shortest paths routing with Tsallis divergence regularization ⋮ Bounded rationality in learning, perception, decision-making, and stochastic games

This page was built for publication: Trading Value and Information in MDPs

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3112566&oldid=16206691"