Zero-sum Games for Discrete-time Multi-armed Bandit Processes with a Generalized Discount (Q4024144): Difference between revisions

@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1080/02522667.1992.10699109
+Normal rank
@@ Property / OpenAlex ID @@
+W2079206036
@@ Property / OpenAlex ID: W2079206036 / rank @@
+Normal rank
@@ Property / cites work @@
+Evaluating strategies for generalized bandit problems
+Normal rank
@@ Property / cites work @@
+Q4692329
@@ Property / cites work: Q4692329 / rank @@
+Normal rank
@@ Property / cites work @@
+Markov strategies for optimal control problems indexed by a partially ordered set
+Normal rank
@@ Property / cites work @@
+Discrete multiarmed bandits and multiparameter processes
+Normal rank
@@ Property / cites work @@
+Optimal stopping and supermartingales over partially ordered sets
+Normal rank