Publication:3093299
From MaRDI portal
zbMath1222.68406arXiv1301.0600MaRDI QIDQ3093299
David Heckerman, Guy Shani, Ronen I. Brafman
Publication date: 12 October 2011
Full work available at URL: https://arxiv.org/abs/1301.0600
68T05: Learning and adaptive systems in artificial intelligence
68U35: Computing methodologies for information systems (hypertext navigation, interfaces, decision support, etc.)
Related Items
Risk-Constrained Reinforcement Learning with Percentile Risk Criteria, Multilevel Preconditioners for Temporal-Difference Learning Methods Related to Recommendation Engines, Learning over No-Preferred and Preferred Sequence of Items for Robust Recommendation, The skyline algorithm for POMDP value function pruning, Dealing with multiple experts and non-stationarity in inverse reinforcement learning: an application to real-life problems, On the equivalence of optimal recommendation sets and myopically optimal query sets, Sequential event prediction, Topic model for analyzing purchase data with price information