Offline reinforcement learning with task hierarchies (Q1698854): Difference between revisions
From MaRDI portal
Set OpenAlex properties. |
Normalize DOI. |
||
(One intermediate revision by one other user not shown) | |||
Property / DOI | |||
Property / DOI: 10.1007/s10994-017-5650-8 / rank | |||
Property / cites work | |||
Property / cites work: 10.1162/1532443041827907 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Recent advances in hierarchical reinforcement learning / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Recent advances in hierarchical reinforcement learning / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q4527272 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q3174040 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q5405216 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q4878667 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q5305630 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q4737595 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: \({\mathcal Q}\)-learning / rank | |||
Normal rank | |||
Property / DOI | |||
Property / DOI: 10.1007/S10994-017-5650-8 / rank | |||
Normal rank |
Latest revision as of 04:33, 11 December 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Offline reinforcement learning with task hierarchies |
scientific article |
Statements
Offline reinforcement learning with task hierarchies (English)
0 references
16 February 2018
0 references
reinforcement learning
0 references
hierarchical reinforcement learning
0 references
MAXQ
0 references
least-squares policy iteration (LSPI)
0 references
0 references