Q4558197 (Q4558197): Difference between revisions
From MaRDI portal
Item:Q4558197
Changed label, description and/or aliases in en, and other parts |
EloiFerrer (talk | contribs) Merged Item into Q3305109 Tag: Replaced |
||||||||||||||
label / en | label / en | ||||||||||||||
description / en | description / en | ||||||||||||||
Property / instance of | |||||||||||||||
Property / instance of: scholarly article / rank | |||||||||||||||
Property / zbMATH Open document ID | |||||||||||||||
Property / zbMATH Open document ID: 1465.90117 / rank | |||||||||||||||
Property / author | |||||||||||||||
Property / author: Huizhen Yu / rank | |||||||||||||||
Property / author | |||||||||||||||
Property / author: Ashique Rupam Mahmood / rank | |||||||||||||||
Property / author | |||||||||||||||
Property / author: Richard S. Sutton / rank | |||||||||||||||
Property / publication date | |||||||||||||||
| |||||||||||||||
Property / publication date: 21 November 2018 / rank | |||||||||||||||
Property / full work available at URL | |||||||||||||||
Property / full work available at URL: https://arxiv.org/abs/1704.04463 / rank | |||||||||||||||
Property / full work available at URL | |||||||||||||||
Property / full work available at URL: http://jmlr.csail.mit.edu/papers/v19/17-283.html / rank | |||||||||||||||
Property / Mathematics Subject Classification ID | |||||||||||||||
Property / Mathematics Subject Classification ID: 90C40 / rank | |||||||||||||||
Property / Mathematics Subject Classification ID | |||||||||||||||
Property / Mathematics Subject Classification ID: 60J20 / rank | |||||||||||||||
Property / Mathematics Subject Classification ID | |||||||||||||||
Property / Mathematics Subject Classification ID: 68T05 / rank | |||||||||||||||
Property / Mathematics Subject Classification ID | |||||||||||||||
Property / Mathematics Subject Classification ID: 90C39 / rank | |||||||||||||||
Property / zbMATH DE Number | |||||||||||||||
Property / zbMATH DE Number: 6982339 / rank | |||||||||||||||
Property / zbMATH Keywords | |||||||||||||||
Property / zbMATH Keywords: Markov decision process / rank | |||||||||||||||
Property / zbMATH Keywords | |||||||||||||||
Property / zbMATH Keywords: approximate policy evaluation / rank | |||||||||||||||
Property / zbMATH Keywords | |||||||||||||||
Property / zbMATH Keywords: generalized Bellman equation / rank | |||||||||||||||
Property / zbMATH Keywords | |||||||||||||||
Property / zbMATH Keywords: reinforcement learning / rank | |||||||||||||||
Property / zbMATH Keywords | |||||||||||||||
Property / zbMATH Keywords: temporal-difference method / rank | |||||||||||||||
Property / zbMATH Keywords | |||||||||||||||
Property / zbMATH Keywords: Markov chain / rank | |||||||||||||||
Property / zbMATH Keywords | |||||||||||||||
Property / zbMATH Keywords: randomized stopping time / rank | |||||||||||||||
Property / describes a project that uses | |||||||||||||||
Property / describes a project that uses: SBEED / rank | |||||||||||||||
Property / MaRDI profile type | |||||||||||||||
Property / MaRDI profile type: MaRDI publication profile / rank | |||||||||||||||
Property / arXiv ID | |||||||||||||||
Property / arXiv ID: 1704.04463 / rank | |||||||||||||||
Property / arXiv classification | |||||||||||||||
Property / arXiv classification: cs.LG / rank | |||||||||||||||
Property / arXiv classification | |||||||||||||||
Property / arXiv classification: math.OC / rank | |||||||||||||||
links / mardi / name | links / mardi / name | ||||||||||||||
Revision as of 10:13, 6 May 2024
No description defined
Language | Label | Description | Also known as |
---|---|---|---|
English | No label defined |
No description defined |