The robot routing problem for collecting aggregate stochastic rewards

DOI10.4230/LIPICS.CONCUR.2017.13zbMATH Open1442.68237arXiv1704.05303OpenAlexW2606115451MaRDI QIDQ5111626FDOQ5111626

Authors: Rayna Dimitrova, Ivan Gavran, Rupak Majumdar, Vinayak S. Prabhu, Sadegh Esmaeil Zadeh Soudjani

Publication date: 27 May 2020

Abstract: We propose a new model for formalizing reward collection problems on graphs with dynamically generated rewards which may appear and disappear based on a stochastic model. The *robot routing problem* is modeled as a graph whose nodes are stochastic processes generating potential rewards over discrete time. The rewards are generated according to the stochastic process, but at each step, an existing reward disappears with a given probability. The edges in the graph encode the (unit-distance) paths between the rewards' locations. On visiting a node, the robot collects the accumulated reward at the node at that time, but traveling between the nodes takes time. The optimization question asks to compute an optimal (or epsilon-optimal) path that maximizes the expected collected rewards. We consider the finite and infinite-horizon robot routing problems. For finite-horizon, the goal is to maximize the total expected reward, while for infinite horizon we consider limit-average objectives. We study the computational and strategy complexity of these problems, establish NP-lower bounds and show that optimal strategies require memory in general. We also provide an algorithm for computing epsilon-optimal infinite paths for arbitrary epsilon > 0.

Full work available at URL: https://arxiv.org/abs/1704.05303

Recommendations

zbMATH Keywords

discounting path planning graph games quantitative objectives

Mathematics Subject Classification ID

Probability in computer science (algorithm analysis, random structures, phase transitions, etc.) (68Q87) Graph theory (including graph drawing) in computer science (68R10) Computational difficulty of problems (lower bounds, completeness, difficulty of approximation, etc.) (68Q17) Artificial intelligence for robotics (68T40) Games involving graphs (91A43)

Cites Work

Cited In (2)

This page was built for publication: The robot routing problem for collecting aggregate stochastic rewards

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5111626)