Controller exploitation-exploration reinforcement learning architecture for computing near-optimal policies (Q2318167)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Controller exploitation-exploration reinforcement learning architecture for computing near-optimal policies |
scientific article |
Statements
Controller exploitation-exploration reinforcement learning architecture for computing near-optimal policies (English)
0 references
14 August 2019
0 references
reinforcement learning
0 references
architecture
0 references
average cost
0 references
Markov chains
0 references
optimization
0 references
0 references