Nonstationary Bandits with Habituation and Recovery Dynamics

DOI10.1287/opre.2019.1918zbMath1455.90095arXiv1707.08423OpenAlexW3040804160MaRDI QIDQ5144777

Yoshimi Fukuoka, Elena Flowers, Anil Aswani, Yonatan Mintz, Philip M. Kaminsky

Publication date: 19 January 2021

Published in: Operations Research (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1707.08423

zbMATH Keywords

multiarmed bandits personalized healthcare-adherence

Mathematics Subject Classification ID

Management decision making, including multiple objectives (90B50)

Uses Software

Gurobi

Cites Work

Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Generalized linear models
Asymptotically efficient adaptive allocation rules
Statistics with set-valued functions: applications to inverse approximate optimization
Behavioral modeling in weight loss interventions
The Complexity of Optimal Queuing Network Control
Non-Stationary Stochastic Optimization
The Knowledge Gradient Algorithm for a General Class of Online Learning Problems
A Dynamic Near-Optimal Algorithm for Online Linear Programming
Statistical Learning of Service-Dependent Demand in a Multiperiod Newsvendor Setting
Approximation algorithms for restless bandit problems
Information Collection on a Graph
Dynamic Assortment with Demand Learning for Seasonal Consumer Goods
The Post-Disaster Debris Clearance Problem Under Incomplete Information
A Learning Approach for Interactive Marketing to a Customer Segment
Revenue Management for a Primary-Care Clinic in the Presence of Patient Choice
Multistate Bayesian Control Chart Over a Finite Horizon
The Optimal Search for a Moving Target When the Search Path Is Constrained
Repeated Principal-Agent Games with Discounting
Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards
State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms
Asymptotic Behavior of Optimal Solutions in Stochastic Programming
Restless Bandits, Linear Programming Relaxations, and a Primal-Dual Index Heuristic
Distributed Learning in Multi-Armed Bandit With Multiple Players
High-Dimensional Statistics
The Nonstochastic Multiarmed Bandit Problem
10.1162/153244303321897690
OR Forum—A POMDP Approach to Personalize Mammography Screening Decisions
Optimal M-Switch Surveillance Policies for Liver Cancer in a Hepatitis C–Infected Population
Inverse Optimization with Noisy Data
The Big Data Newsvendor: Practical Insights from Machine Learning
Online Decision Making with High-Dimensional Covariates
Sequential Bayes-Optimal Policies for Multiple Comparisons with a Known Standard
Improving Health Outcomes Through Better Capacity Allocation in a Community-Based Chronic Care Model
Learning to Optimize via Posterior Sampling
Indexability of Restless Bandit Problems and Optimality of Whittle Index for Dynamic Multichannel Access
An Adaptive Sampling Algorithm for Solving Markov Decision Processes
Capacity Investment with Demand Learning
Bayesian Optimization via Simulation with Pairwise Sampling and Correlated Prior Beliefs
Finite-time analysis of the multiarmed bandit problem

This page was built for publication: Nonstationary Bandits with Habituation and Recovery Dynamics