Q5149240 (Q5149240): Difference between revisions

@@ Property / cites work @@
+Linear Thompson sampling revisited
@@ Property / cites work: Linear Thompson sampling revisited / rank @@
+Normal rank
@@ Property / cites work @@
+Finite-time analysis of the multiarmed bandit problem
+Normal rank
@@ Property / cites work @@
+.1162/153244303765208377
@@ Property / cites work: 10.1162/153244303765208377 / rank @@
+Normal rank
@@ Property / cites work @@
+Finite state Markovian decision processes
@@ Property / cites work: Finite state Markovian decision processes / rank @@
+Normal rank
@@ Property / cites work @@
+Compactification methods in the control of degenerate diffusions: existence of an optimal control
+Normal rank
@@ Property / cites work @@
+On stochastic relaxed control for partially observed diffusions
+Normal rank
@@ Property / cites work @@
+Q4057976
@@ Property / cites work: Q4057976 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4002114
@@ Property / cites work: Q4002114 / rank @@
+Normal rank
@@ Property / cites work @@
+Multi-armed bandits in discrete and continuous time
+Normal rank
@@ Property / cites work @@
+Existence of Markov Controls and Characterization of Optimal Markov Controls
+Normal rank
@@ Property / cites work @@
+Stationary solutions and forward equations for controlled and singular martingale problems
+Normal rank
@@ Property / cites work @@
+Q2810828
@@ Property / cites work: Q2810828 / rank @@
+Normal rank
@@ Property / cites work @@
+Iterative linearization methods for approximately optimal control and estimation of non-linear stochastic system
+Normal rank
@@ Property / cites work @@
+Continuous multi-armed bandits and multiparameter processes
+Normal rank
@@ Property / cites work @@
+Q5214215
@@ Property / cites work: Q5214215 / rank @@
+Normal rank
@@ Property / cites work @@
+Learning to Optimize via Posterior Sampling
@@ Property / cites work: Learning to Optimize via Posterior Sampling / rank @@
+Normal rank
@@ Property / cites work @@
+An analysis of model-based interval estimation for Markov decision processes
+Normal rank
@@ Property / cites work @@
+Q2880979
@@ Property / cites work: Q2880979 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4626283
@@ Property / cites work: Q4626283 / rank @@
+Normal rank
@@ Property / cites work @@
+Continuous‐time mean–variance portfolio selection: A reinforcement learning framework
+Normal rank
@@ Property / cites work @@
+Q4255599
@@ Property / cites work: Q4255599 / rank @@
+Normal rank
@@ Property / cites work @@
+On the Existence of Optimal Relaxed Controls of Stochastic Partial Differential Equations
+Normal rank