Deterministic policies based on maximum regrets in MDPs with imprecise rewards (Q5069649)

scientific article; zbMATH DE number 7509003

Language	Label	Description	Also known as
English	Deterministic policies based on maximum regrets in MDPs with imprecise rewards	scientific article; zbMATH DE number 7509003

Statements

instance of

scholarly article

0 references

title

Deterministic policies based on maximum regrets in MDPs with imprecise rewards (English)

0 references

0 references

0 references

0 references

0 references

19 April 2022

0 references

zbMATH Keywords

Markov decision process

0 references

minimax regret

0 references

unknown rewards

0 references

branch-and-bound

0 references

deterministic policy

0 references

stochastic policy

0 references

MaRDI profile type

MaRDI publication profile

0 references

cites work

Sampling Based Approaches for Minimizing Regret in Uncertain Markov Decision Processes (MDPs)

0 references

Regret in Decision Making under Uncertainty

0 references

Partitioning procedures for solving mixed-variables programming problems

0 references

Machine learning and knowledge discovery in databases. European conference, ECML PKDD 2011, Athens, Greece, September 5--9, 2011. Proceedings, Part I

0 references

Preference-based reinforcement learning: a formal framework and a policy iteration algorithm

0 references

Bounded-parameter Markov decision processes

0 references

Robust Dynamic Programming

0 references

Bias and Variance Approximation in Value Function Estimates

0 references

Robust Control of Markov Decision Processes with Uncertain Transition Matrices

0 references

Q4315289

0 references

Q3455618

0 references

Robust Markov Decision Processes

0 references

full work available at URL

https://doi.org/10.3233/aic-190632

0 references

Identifiers

zbMATH Open document ID

1487.68205

0 references

DOI

10.3233/AIC-190632

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:5069649