Convergence of Finite Memory Q Learning for POMDPs and Near Optimality of Learned Policies Under Filter Stability (Q6122574)
From MaRDI portal
scientific article; zbMATH DE number 7811853
Language | Label | Description | Also known as |
---|---|---|---|
English | Convergence of Finite Memory Q Learning for POMDPs and Near Optimality of Learned Policies Under Filter Stability |
scientific article; zbMATH DE number 7811853 |
Statements
Convergence of Finite Memory Q Learning for POMDPs and Near Optimality of Learned Policies Under Filter Stability (English)
0 references
1 March 2024
0 references
reinforcement learning
0 references
partially observed MDP
0 references
reinforcement learning partially observed MDP
0 references