Reading policies for joins: an asymptotic analysis
From MaRDI portal
Publication:2467117
Abstract: Suppose that observations are made from the distribution and from the distribution . Associate with each pair, from and from , a nonnegative score . An optimal reading policy is one that yields a sequence that maximizes , the expected sum of the observed scores, uniformly in . The alternating policy, which switches between the two sources, is the optimal nonadaptive policy. In contrast, the greedy policy, which chooses its source to maximize the expected gain on the next step, is shown to be the optimal policy. Asymptotics are provided for the case where the and distributions are discrete and according as or not (i.e., the observations match). Specifically, an invariance result is proved which guarantees that for a wide class of policies, including the alternating and the greedy, the variable M(n) obeys the same CLT and LIL. A more delicate analysis of the sequence and the sample paths of M(n), for both alternating and greedy, reveals the slender sense in which the latter policy is asymptotically superior to the former, as well as a sense of equivalence of the two and robustness of the former.
Recommendations
Cites work
- scientific article; zbMATH DE number 3885030 (Why is no real title available?)
- scientific article; zbMATH DE number 3906232 (Why is no real title available?)
- scientific article; zbMATH DE number 3720710 (Why is no real title available?)
- scientific article; zbMATH DE number 3723610 (Why is no real title available?)
- scientific article; zbMATH DE number 3502497 (Why is no real title available?)
- scientific article; zbMATH DE number 700091 (Why is no real title available?)
- scientific article; zbMATH DE number 194374 (Why is no real title available?)
- Approximation Theorems of Mathematical Statistics
- Complete convergence of triangular arrays and the law of the iterated logarithm for U-statistics
- Dynamic productivity improvement in a model with multiple processes
- Extensions of the multiarmed bandit problem: The discounted case
- On the Gittins index for multiarmed bandits
- On the convergence of the empirical mass function
- Optimal policies to obtain the most join results
- Probability. Theory and examples.
- Sample path optimality for a Markov optimization problem
- Some results on increments of the Wiener process with applications to lag sums of i.i.d. random variables
- Tauberian Theory
- Upper and lower functions for martingales and mixing processes
This page was built for publication: Reading policies for joins: an asymptotic analysis
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2467117)