Gradient-descent for randomized controllers under partial observability
DOI10.1007/978-3-030-94583-1_7zbMath1498.68159arXiv2111.04407OpenAlexW3214634944MaRDI QIDQ2152644
Sebastian Junges, Joshua Moerman, Linus Heck, Jip Spel, Joost-Pieter Katoen
Publication date: 8 July 2022
Full work available at URL: https://arxiv.org/abs/2111.04407
Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20) Specification and verification (program logics, model checking, etc.) (68Q60) Probability in computer science (algorithm analysis, random structures, phase transitions, etc.) (68Q87)
Related Items (1)
Uses Software
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Planning and acting in partially observable stochastic domains
- On the undecidability of probabilistic planning and related stochastic optimization problems
- Handbook of weighted automata
- Parametric probabilistic transition systems for system design and analysis
- Verification and control of partially observable probabilistic systems
- Parameter synthesis for Markov models: faster than ever
- Inductive synthesis for probabilistic programs reaches new horizons
- Bayesian inference by symbolic model checking
- Parametric Markov chains: PCTL complexity and fraction-free Gaussian elimination
- Properties of the sign gradient descent algorithms
- The complexity of reachability in parametric Markov decision processes
- Model Repair for Probabilistic Systems
- Perturbation Analysis in Verification of Discrete-Time Markov Chains
- Are Parametric Markov Chains Monotonic?
- Sequential Convex Programming for the Efficient Verification of Parametric MDPs
- Quantitative Model Checking Revisited: Neither Decidable Nor Approximable
- Computationally Feasible Bounds for Partially Observed Markov Decision Processes
- Strategy Synthesis for POMDPs in Robot Planning via Game-Based Abstractions
- Probabilistic ω-automata
- Theoretical Aspects of Computing - ICTAC 2004
- Linear programming. Foundations and extensions
- Reactive control improvisation
- Synthesis in pMDPs: a tale of 1001 parameters
- Accelerated model checking of parametric Markov chains
This page was built for publication: Gradient-descent for randomized controllers under partial observability