Trading Value and Information in MDPs
From MaRDI portal
Publication:3112566
DOI10.1007/978-3-642-24647-0_3zbMath1229.91106OpenAlexW48257500MaRDI QIDQ3112566
Naftali Tishby, Ohad Shamir, Jonathan D. Rubin
Publication date: 11 January 2012
Published in: Intelligent Systems Reference Library (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/978-3-642-24647-0_3
Decision theory (91B06) Markov and semi-Markov decision processes (90C40) General considerations in statistical decision theory (62C05)
Related Items (4)
Design of biased random walks on a graph with application to collaborative recommendation ⋮ A Reward-Maximizing Spiking Neuron as a Bounded Rational Decision Maker ⋮ Sparse randomized shortest paths routing with Tsallis divergence regularization ⋮ Bounded rationality in learning, perception, decision-making, and stochastic games
This page was built for publication: Trading Value and Information in MDPs