scientific article; zbMATH DE number 7164724

From MaRDI portal

Publication:5214215

Jump to:navigation, search

zbMath1434.68515arXiv1703.07608MaRDI QIDQ5214215

Ian Osband, Benjamin van Roy, Zheng Wen, Daniel J. Russo

Publication date: 7 February 2020

Full work available at URL: https://arxiv.org/abs/1703.07608

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

neural network reinforcement learning value function exploration

Mathematics Subject Classification ID

Artificial neural networks and deep learning (68T07) Bayesian inference (62F15) Markov and semi-Markov decision processes (90C40) Sequential statistical analysis (62L10)

Related Items (9)

Unnamed Item ⋮ Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning ⋮ Priors in Bayesian Deep Learning: A Review ⋮ Reinforcement Learning, Bit by Bit ⋮ Deep Reinforcement Learning: A State-of-the-Art Walkthrough ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Sophisticated Inference ⋮ Fundamental design principles for reinforcement learning algorithms

Cites Work

This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5214215&oldid=19819887"