Policy Optimization Using Semiparametric Models for Dynamic Pricing

From MaRDI portal
Publication:6154014

DOI10.1080/01621459.2022.2128359arXiv2109.06368OpenAlexW3199034510MaRDI QIDQ6154014FDOQ6154014


Authors: Jianqing Fan, Yongyi Guo Edit this on Wikidata


Publication date: 19 March 2024

Published in: Journal of the American Statistical Association (Search for Journal in Brave)

Abstract: In this paper, we study the contextual dynamic pricing problem where the market value of a product is linear in its observed features plus some market noise. Products are sold one at a time, and only a binary response indicating success or failure of a sale is observed. Our model setting is similar to Javanmard and Nazerzadeh [2019] except that we expand the demand curve to a semiparametric model and need to learn dynamically both parametric and nonparametric components. We propose a dynamic statistical learning and decision-making policy that combines semiparametric estimation from a generalized linear model with an unknown link and online decision-making to minimize regret (maximize revenue). Under mild conditions, we show that for a market noise c.d.f. F(cdot) with m-th order derivative (mgeq2), our policy achieves a regret upper bound of ildeOd(Tfrac2m+14m1), where T is time horizon and ildeOd is the order that hides logarithmic terms and the dimensionality of feature d. The upper bound is further reduced to ildeOd(sqrtT) if F is super smooth whose Fourier transform decays exponentially. In terms of dependence on the horizon T, these upper bounds are close to Omega(sqrtT), the lower bound where F belongs to a parametric class. We further generalize these results to the case with dynamically dependent product features under the strong mixing condition.


Full work available at URL: https://arxiv.org/abs/2109.06368







Cites Work


Cited In (1)





This page was built for publication: Policy Optimization Using Semiparametric Models for Dynamic Pricing

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6154014)