Second order bounds for Markov decision processes (Q1150312): Difference between revisions
From MaRDI portal
Set profile property. |
ReferenceBot (talk | contribs) Changed an Item |
||
Property / cites work | |||
Property / cites work: Q3266141 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: A modified dynamic programming method for Markovian decision problems / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Improved iterative computation of the expected discounted return in Markov and semi-Markov chains / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q3908795 / rank | |||
Normal rank |
Revision as of 10:45, 13 June 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Second order bounds for Markov decision processes |
scientific article |
Statements
Second order bounds for Markov decision processes (English)
0 references
1981
0 references
second order bounds
0 references
Markov decision process
0 references
value iteration algorithm
0 references
lower bound on the optimal value
0 references
0 references