Joint learning of reward machines and policies in environments with partially known semantics (Q6579299)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Joint learning of reward machines and policies in environments with partially known semantics |
scientific article; zbMATH DE number 7887414
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Joint learning of reward machines and policies in environments with partially known semantics |
scientific article; zbMATH DE number 7887414 |
Statements
Joint learning of reward machines and policies in environments with partially known semantics (English)
0 references
25 July 2024
0 references
reinforcement learning
0 references
reward machines
0 references
perception limitations
0 references