Mohammad Ghavamzadeh

From MaRDI portal
Person:1049134


List of research outcomes

This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!

PublicationDate of PublicationType
Proximal gradient temporal difference learning: stable reinforcement learning with polynomial sample complexity
 
2018-11-30Paper
Risk-constrained reinforcement learning with percentile risk criteria
 
2018-11-22Paper
Variance-constrained actor-critic algorithms for discounted and average reward MDPs
Machine Learning
2018-01-12Paper
Sequential Decision Making With Coherent Risk
IEEE Transactions on Automatic Control
2017-09-21Paper
Classification-Based Approximate Policy Iteration
IEEE Transactions on Automatic Control
2017-05-16Paper
Regularized policy iteration with nonparametric function spaces
Journal of Machine Learning Research (JMLR)
2016-11-22Paper
Bayesian policy gradient and actor-critic algorithms
Journal of Machine Learning Research (JMLR)
2016-06-06Paper
Analysis of classification-based policy iteration algorithms
Journal of Machine Learning Research (JMLR)
2016-06-06Paper
Bayesian reinforcement learning: a survey
Foundations and Trends in Machine Learning
2016-05-30Paper
scientific article; zbMATH DE number 6542806 (Why is no real title available?)
 
2016-02-19Paper
Robust Policy Optimization with Baseline Guarantees
 
2015-06-15Paper
Constrained Stochastic Optimal Control with a Baseline Performance Guarantee
 
2014-10-10Paper
Finite-sample analysis of least-squares policy iteration
 
2014-04-01Paper
Upper-Confidence-Bound Algorithms for Active Learning in Multi-armed Bandits
Lecture Notes in Computer Science
2011-10-19Paper
scientific article; zbMATH DE number 5957504 (Why is no real title available?)
 
2011-10-12Paper
Natural actor-critic algorithms
Automatica
2010-01-08Paper


Research outcomes over time


This page was built for person: Mohammad Ghavamzadeh