Pages that link to "Item:Q814474"
From MaRDI portal
The following pages link to Equivalence notions and model minimization in Markov decision processes (Q814474):
Displaying 19 items.
- Approximation metrics based on probabilistic bisimulations for general state-space Markov processes: a survey (Q271706) (← links)
- Extreme state aggregation beyond Markov decision processes (Q329613) (← links)
- Adaptive aggregation for reinforcement learning in average reward Markov decision processes (Q378753) (← links)
- A uniform framework for modeling nondeterministic, probabilistic, stochastic, or mixed processes and their behavioral equivalences (Q384933) (← links)
- Exploiting symmetries for single- and multi-agent partially observable stochastic domains (Q456732) (← links)
- Regret bounds for restless Markov bandits (Q465253) (← links)
- Algebraic decompositions of DP problems with linear dynamics (Q888809) (← links)
- Algebraic results and bottom-up algorithm for policies generalization in reinforcement learning using concept lattices (Q1003553) (← links)
- Planning in hybrid relational MDPs (Q1699911) (← links)
- Relevant states and memory in Markov chain bootstrapping and simulation (Q1752182) (← links)
- Exact finite approximations of average-cost countable Markov decision processes (Q2440756) (← links)
- Model Refinement Using Bisimulation Quotients (Q3067468) (← links)
- Pseudometrics for State Aggregation in Average Reward Markov Decision Processes (Q3520073) (← links)
- (Q5054599) (← links)
- (Q5148991) (← links)
- A Sufficient Statistic for Influence in Structured Multiagent Environments (Q5856481) (← links)
- Robust Control for Dynamical Systems with Non-Gaussian Noise via Formal Abstractions (Q5881801) (← links)
- A taxonomy for similarity metrics between Markov decision processes (Q6097106) (← links)
- Mixed nondeterministic-probabilistic automata: blending graphical probabilistic models with nondeterminism (Q6201390) (← links)