On the Computational Complexity of Stochastic Controller Optimization in POMDPs

DOI10.1145/2382559.2382563MaRDI QIDQ2947572zbMATH OpenOpenAlexFDO

Authors Nikos Vlassis, Michael L. Littman, David Barber

Publication date 24 September 2015

Published in ACM Transactions on Computation Theory (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/1107.3090

computational complexity nonlinear optimization partially observable Markov decision process computations on polynomials bilinear program sum-of-square-roots problem stochastic controller Motzkin-Straus theorem matrix fractional program

Mathematics Subject Classification ID

Analysis of algorithms and problem complexity (68Q25) Computational difficulty of problems (lower bounds, completeness, difficulty of approximation, etc.) (68Q17) Markov and semi-Markov decision processes (90C40)

Abstract: We show that the problem of finding an optimal stochastic 'blind' controller in a Markov decision process is an NP-hard problem. The corresponding decision problem is NP-hard, in PSPACE, and SQRT-SUM-hard, hence placing it in NP would imply breakthroughs in long-standing open problems in computer science. Our result establishes that the more general problem of stochastic controller optimization in POMDPs is also NP-hard. Nonetheless, we outline a special case that is convex and admits efficient global solutions.

Recommendations

Cited in

(13)

This page was built for publication: On the Computational Complexity of Stochastic Controller Optimization in POMDPs

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2947572)