Learning in Structured MDPs with Convex Cost Functions: Improved Regret Bounds for Inventory Management (Q5095166)
From MaRDI portal
scientific article; zbMATH DE number 7568773
Language | Label | Description | Also known as |
---|---|---|---|
English | Learning in Structured MDPs with Convex Cost Functions: Improved Regret Bounds for Inventory Management |
scientific article; zbMATH DE number 7568773 |
Statements
Learning in Structured MDPs with Convex Cost Functions: Improved Regret Bounds for Inventory Management (English)
0 references
5 August 2022
0 references
inventory control problem
0 references
censored demand
0 references
reinforcement learning
0 references
online convex optimization
0 references
regret bounds
0 references
0 references
0 references