Learning in Structured MDPs with Convex Cost Functions: Improved Regret Bounds for Inventory Management (Q5095166)

From MaRDI portal
scientific article; zbMATH DE number 7568773
Language Label Description Also known as
English
Learning in Structured MDPs with Convex Cost Functions: Improved Regret Bounds for Inventory Management
scientific article; zbMATH DE number 7568773

    Statements

    Learning in Structured MDPs with Convex Cost Functions: Improved Regret Bounds for Inventory Management (English)
    0 references
    0 references
    0 references
    5 August 2022
    0 references
    inventory control problem
    0 references
    censored demand
    0 references
    reinforcement learning
    0 references
    online convex optimization
    0 references
    regret bounds
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references