Indices for families of competing Markov decision processes with influence (Q1313068)

scientific article

Language	Label	Description	Also known as
English	Indices for families of competing Markov decision processes with influence	scientific article

Statements

instance of

scholarly article

0 references

title

Indices for families of competing Markov decision processes with influence (English)

0 references

published in

The Annals of Applied Probability

0 references

publication date

14 September 1994

0 references

review text

In 1980 Whittle (a) explained Gittins indices for multiarmed bandit processes (MBP) in terms of equivalent retirement rewards; and (b) gave conditions under which there is an optimal policy of index type for MBP, where each arm has its own decision apparatus. In the same year Nash proved indexation for a further class of MBP in which the current states of all of the arms can influence the reward earned from the active one. In the present paper the author develops analogues of (a) and (b) for Nash's model. He then illustrates the theory by examining stoppable MBP, where each arm has two actions: continue and stop.

0 references

zbMATH Keywords

Gittins indices

0 references

multiarmed bandit processes

0 references

author

Kevin D. Glazebrook

0 references

MaRDI profile type

MaRDI publication profile

0 references