Mohammad Ghavamzadeh

List of research outcomes

This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!

Publication	Date of Publication	Type
Proximal gradient temporal difference learning: stable reinforcement learning with polynomial sample complexity (available as arXiv preprint)	2018-11-30	Paper
Risk-constrained reinforcement learning with percentile risk criteria	2018-11-22	Paper
Risk-constrained reinforcement learning with percentile risk criteria (available as arXiv preprint)	2018-11-22	Paper
Variance-constrained actor-critic algorithms for discounted and average reward MDPs Machine Learning	2018-01-12	Paper
Sequential Decision Making With Coherent Risk IEEE Transactions on Automatic Control	2017-09-21	Paper
Classification-Based Approximate Policy Iteration IEEE Transactions on Automatic Control	2017-05-16	Paper
Regularized policy iteration with nonparametric function spaces Journal of Machine Learning Research (JMLR)	2016-11-22	Paper
Bayesian policy gradient and actor-critic algorithms Journal of Machine Learning Research (JMLR)	2016-06-06	Paper
Analysis of classification-based policy iteration algorithms Journal of Machine Learning Research (JMLR)	2016-06-06	Paper
Bayesian reinforcement learning: a survey Foundations and Trends in Machine Learning	2016-05-30	Paper
scientific article; zbMATH DE number 6542806 (Why is no real title available?)	2016-02-19	Paper
Robust Policy Optimization with Baseline Guarantees	2015-06-15	Paper
Constrained Stochastic Optimal Control with a Baseline Performance Guarantee	2014-10-10	Paper
Finite-sample analysis of least-squares policy iteration	2014-04-01	Paper
Upper-Confidence-Bound Algorithms for Active Learning in Multi-armed Bandits Lecture Notes in Computer Science	2011-10-19	Paper
scientific article; zbMATH DE number 5957504 (Why is no real title available?)	2011-10-12	Paper
Natural actor-critic algorithms Automatica	2010-01-08	Paper

Research outcomes over time

This page was built for person: Mohammad Ghavamzadeh