scientific article
From MaRDI portal
Publication:3174155
zbMath1222.68253MaRDI QIDQ3174155
Sridhar Mahadevan, Mauro Maggioni
Publication date: 12 October 2011
Full work available at URL: http://www.jmlr.org/papers/v8/mahadevan07a.html
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Markov decision processesreinforcement learningspectral graph theorymanifold learningvalue function approximation
Related Items (14)
A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications ⋮ Markov Reward Models and Markov Decision Processes in Discrete and Continuous Time: Performance Evaluation and Optimization ⋮ Continual curiosity-driven skill acquisition from high-dimensional video inputs for humanoid robots ⋮ AUTOMATIC COMPLEXITY REDUCTION IN REINFORCEMENT LEARNING ⋮ Multi-scale geometric methods for data sets. II: Geometric multi-resolution analysis ⋮ Reinforcement learning for robust adaptive control of partially unknown nonlinear systems subject to unmatched uncertainties ⋮ Reinforcement learning algorithms with function approximation: recent advances and applications ⋮ Optimal Curiosity-Driven Modular Incremental Slow Feature Analysis ⋮ Slowness as a Proxy for Temporal Predictability: An Empirical Comparison ⋮ Diffusion wavelets ⋮ Adaptive critic design with graph Laplacian for online learning control of nonlinear systems ⋮ Regularized feature selection in reinforcement learning ⋮ A Sufficient Statistic for Influence in Structured Multiagent Environments ⋮ Actor-Critic Algorithms with Online Feature Adaptation
This page was built for publication: