Online Markov Decision Processes Under Bandit Feedback (Q2983230)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Online Markov Decision Processes Under Bandit Feedback
scientific article

    Statements

    Online Markov Decision Processes Under Bandit Feedback (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    16 May 2017
    0 references

    Identifiers