The set-indexed bandit problem. (Q2574504): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
ReferenceBot (talk | contribs)
Changed an Item
 
(One intermediate revision by one other user not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / cites work
 
Property / cites work: On different topologies for set-indexing collections / rank
 
Normal rank
Property / cites work
 
Property / cites work: Compactness of stopping times / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5522002 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3940289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal Stopping of Two-Parameter Processes on Nonstandard Probability Spaces / rank
 
Normal rank
Property / cites work
 
Property / cites work: Randomization in the two-armed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4692329 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The optional sampling theorem for processes indexed by a partially ordered set / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4264743 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Random censoring in set-indexed survival analysis / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multi-armed bandits in discrete and continuous time / rank
 
Normal rank
Property / cites work
 
Property / cites work: The optional sampling theorem for martingales indexed by directed sets / rank
 
Normal rank
Property / cites work
 
Property / cites work: Continuous multi-armed bandits and multiparameter processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3798562 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic control of two-parameter processes application:the two-armed bandit problem / rank
 
Normal rank

Latest revision as of 13:28, 11 June 2024

scientific article
Language Label Description Also known as
English
The set-indexed bandit problem.
scientific article

    Statements

    The set-indexed bandit problem. (English)
    0 references
    0 references
    0 references
    29 November 2005
    0 references
    This paper proves the existence of an optimal solution to the bandit problem formulated in the framework of set-indexed processes, i.e., the existence of a set-valued increasing path \(\xi \) maximizing the functional \(E(\int _{[0,1]}X_\xi \,\text dV_t)\) with appropriate processes \(X\) and \(V\). The basic idea of the proof is to fuzzify set-valued increasing paths, and then to prove for a fuzzy extension of the above functional that an optimal solution exists and is assumed for fuzzy increasing paths that correspond to the usual (crisp) increasing paths. The concluding part of the proof has been inspired by the proof of an analogical result for continuous-time processes, given by \textit{R. C. Dalang} [Ann. Probab. 18, 218--225 (1990; Zbl 0713.60054)], but the extension to set-indexed processes requires three preparatory steps. First, fuzzy stopping sets and fuzzy increasing paths are defined, and their basic properties are established. Those concepts rely on the usual relationship between random and fuzzy sets, although the formal definition of fuzzy sets in the paper differs from what is common in the fuzzy set theory. Second, the integration on collections of compact connected subsets of compact complete metric spaces is defined and used to extend the considered functional to fuzzy increasing paths. Finally, an extension of the Baxter-Chacon topology is introduced [\textit{J.\ R.\ Baxter} and \textit{R.\ V.\ Chacon}, Z.\ Wahrscheinlichkeitstheorie Verw.\ Geb.\ 40, 169--181 (1977; Zbl 0349.60048)], and the compactness of fuzzy stopping sets and fuzzy increasing paths in that topology is shown. The result presented in the paper is in a sense a continuation of deep results concerning set-indexed processed, many of which have been collected in a recent monograph by \textit{G.\ Ivanoff} and \textit{E.\ Merzbach} [``Set-indexed martingales'' (2000; Zbl 0948.60039)]. Differently to that monograph, the paper is very difficult to read in detail because it contains numerous sources of unintentional confusion (concepts used in a more general setting than in which they were defined, a statement proved in a lemma but forgotten in its formulation, various misprints in notation, and an undiscoverable misprint in a key proposition concerning integration on set collections). Nevertheless, it is well structured and all important steps of the proof are clearly conveyed.
    0 references
    0 references
    set-indexed processes
    0 references
    stochastic control
    0 references
    randomization
    0 references