Model selection in reinforcement learning (Q415618): Difference between revisions
From MaRDI portal
Created a new Item |
Changed an Item |
||
Property / Mathematics Subject Classification ID | |||
Property / Mathematics Subject Classification ID: 68T05 / rank | |||
Normal rank | |||
Property / Mathematics Subject Classification ID | |||
Property / Mathematics Subject Classification ID: 68Q32 / rank | |||
Normal rank | |||
Property / Mathematics Subject Classification ID | |||
Property / Mathematics Subject Classification ID: 90C40 / rank | |||
Normal rank | |||
Property / zbMATH DE Number | |||
Property / zbMATH DE Number: 6031868 / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
reinforcement learning | |||
Property / zbMATH Keywords: reinforcement learning / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
model selection | |||
Property / zbMATH Keywords: model selection / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
complexity regularization | |||
Property / zbMATH Keywords: complexity regularization / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
adaptivity | |||
Property / zbMATH Keywords: adaptivity / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
offline learning | |||
Property / zbMATH Keywords: offline learning / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
off-policy learning | |||
Property / zbMATH Keywords: off-policy learning / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
finite-sample bounds | |||
Property / zbMATH Keywords: finite-sample bounds / rank | |||
Normal rank |
Revision as of 19:34, 29 June 2023
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Model selection in reinforcement learning |
scientific article |
Statements
Model selection in reinforcement learning (English)
0 references
8 May 2012
0 references
reinforcement learning
0 references
model selection
0 references
complexity regularization
0 references
adaptivity
0 references
offline learning
0 references
off-policy learning
0 references
finite-sample bounds
0 references