Learning in Structured MDPs with Convex Cost Functions: Improved Regret Bounds for Inventory Management

From MaRDI portal
Publication:5095166

DOI10.1287/opre.2022.2263zbMath1494.90004arXiv1905.04337OpenAlexW2944461362MaRDI QIDQ5095166

Randy Jia, Shipra Agrawal

Publication date: 5 August 2022

Published in: Operations Research (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1905.04337




Related Items (1)



Cites Work


This page was built for publication: Learning in Structured MDPs with Convex Cost Functions: Improved Regret Bounds for Inventory Management