Csaba Szepesvári

From MaRDI portal
Person:399885

Available identifiers

zbMath Open szepesvari.csabaMaRDI QIDQ399885

List of research outcomes





PublicationDate of PublicationType
Optimistic MLE: a generic model-based algorithm for partially observable sequential decision making2024-05-08Paper
https://portal.mardi4nfdi.de/entity/Q50532352022-12-06Paper
Gradient descent for sparse rank-one matrix completion for crowd-sourced aggregation of sparsely interacting workers2020-10-05Paper
Bandit algorithms2020-05-11Paper
A modular analysis of adaptive (non-)convex optimization: optimism, composite objectives, variance reduction, and variational bounds2020-01-29Paper
Mixing time estimation in reversible Markov chains from a single sample path2019-10-22Paper
A modular analysis of adaptive (non-)convex optimization: optimism, composite objectives, and variational bounds2019-01-10Paper
Stochastic Optimization in a Cumulative Prospect Theory Framework2018-09-18Paper
A Linearly Relaxed Approximate Linear Program for Markov Decision Processes2018-06-27Paper
Following the leader and fast rates in online linear prediction: curved constraint sets and other regularities2018-04-17Paper
Online Markov Decision Processes Under Bandit Feedback2017-05-16Paper
Regularized policy iteration with nonparametric function spaces2016-11-22Paper
Partial monitoring -- classification, regret bounds, and algorithms2015-04-24Paper
On Learning the Optimal Waiting Time2015-01-14Paper
Alignment based kernel learning with a continuous set of base kernels2014-08-20Paper
\(X\)-armed bandits2014-02-03Paper
Toward a classification of finite partial-monitoring games2013-03-04Paper
Partial monitoring with side information2012-10-16Paper
Model selection in reinforcement learning2012-05-08Paper
Regularized least-squares regression: learning from a sequence2011-11-10Paper
Finite-time bounds for fitted value iteration2011-11-08Paper
Training parsers by inverse reinforcement learning2010-10-07Paper
Toward a classification of finite partial-monitoring games2010-10-01Paper
Algorithms for reinforcement learning.2010-09-10Paper
Active learning in heteroscedastic noise2010-07-07Paper
Models of active learning in group-structured state spaces2010-04-08Paper
Exploration-exploitation tradeoff using variance estimates in multi-armed bandits2009-05-12Paper
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path2009-03-31Paper
Active Learning in Multi-armed Bandits2008-10-14Paper
Active Learning of Group-Structured Environments2008-10-14Paper
Tuning Bandit Algorithms in Stochastic Environments2008-08-19Paper
Machine Learning: ECML 20042008-03-14Paper
Improved Rates for the Stochastic Continuum-Armed Bandit Problem2008-01-03Paper
Learning Near-Optimal Policies with Bellman-Residual Minimization Based Fitted Policy Iteration and a Single Sample Path2007-09-14Paper
Computer Vision - ECCV 20042005-12-27Paper
Efficient approximate planning in continuous space Markovian decision problems2002-05-02Paper
An asynchronous stochastic approximation theorem and some applications2001-04-01Paper
https://portal.mardi4nfdi.de/entity/Q45152532000-11-13Paper
Convergence results for single-step on-policy reinforcement-learning algorithms2000-06-21Paper
https://portal.mardi4nfdi.de/entity/Q42586511999-09-14Paper
Module-based reinforcement learning: Experiments with a real robot1998-10-13Paper
An integrated architecture for motion-control and path-planning1998-06-08Paper
Robust control using inverse dynamics neurocontrollers1998-05-11Paper
Approximate geometry representations and sensory fusion1997-03-31Paper

Research outcomes over time

This page was built for person: Csaba Szepesvári