Policy mirror descent for reinforcement learning: linear convergence, new sampling complexity, and generalized problem classes (Q2687069)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Policy mirror descent for reinforcement learning: linear convergence, new sampling complexity, and generalized problem classes |
scientific article |
Statements
Policy mirror descent for reinforcement learning: linear convergence, new sampling complexity, and generalized problem classes (English)
0 references
1 March 2023
0 references