On Generalized Bellman Equations and Temporal-Difference Learning (Q3305109)

scientific article; zbMATH DE number 6982339

Language	Label	Description	Also known as
English	On Generalized Bellman Equations and Temporal-Difference Learning	scientific article; zbMATH DE number 6982339

Statements

instance of

scholarly article

0 references

title

On Generalized Bellman Equations and Temporal-Difference Learning (English)

0 references

zbMATH Open document ID

1454.68135

0 references

1465.90117

0 references

DOI

10.1007/978-3-319-57351-9_1

0 references

author

Huizhen Yu

0 references

Ashique Rupam Mahmood

0 references

Richard S. Sutton

0 references

published in

Advances in Artificial Intelligence

0 references

publication date

5 August 2020

0 references

21 November 2018

0 references

full work available at URL

https://arxiv.org/abs/1704.04463

0 references

http://jmlr.csail.mit.edu/papers/v19/17-283.html

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

Markov decision process

0 references

policy evaluation

0 references

generalized Bellman equation

0 references

temporal differences

0 references

Markov chain

0 references

randomized stopping time

0 references

approximate policy evaluation

0 references

reinforcement learning

0 references

temporal-difference method

0 references

MaRDI profile type

MaRDI publication profile

0 references

0 references

0 references

describes a project that uses

SBEED

0 references

arXiv classification

cs.LG

0 references

math.OC

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:3305109