Q3552451 (Q3552451): Difference between revisions
From MaRDI portal
Set profile property. |
ReferenceBot (talk | contribs) Changed an Item |
||
Property / cites work | |||
Property / cites work: Uniform convergence of value iteration policies for discounted Markov decision processes / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Conditions for the uniqueness of optimal policies of discounted Markov decision processes / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q3428417 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q4255598 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q5305630 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Optimal Stationary Policies in General State Space Markov Decision Chains with Finite Action Sets / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q4002751 / rank | |||
Normal rank |
Latest revision as of 16:57, 2 July 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | No label defined |
scientific article |
Statements
22 April 2010
0 references
discounted cost
0 references
Markov decision process
0 references
ergodicity condition
0 references
value iteration
0 references
optimal policy
0 references
myopic policies
0 references
0 references
0 references
0 references