scientific article
From MaRDI portal
Publication:3661341
zbMath0514.90085MaRDI QIDQ3661341
Publication date: 1983
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
average rewardintegral representation theoremtotal expected rewardMarkovian decision modeldynamic programming with Borel state and action spaceinfinite futuremixtures of deterministic policiesrandomized Markovian policy
Related Items