Kernel-based reinforcement learning

From MaRDI portal
Revision as of 02:43, 1 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:1604813

DOI10.1023/A:1017928328829zbMath1014.68069MaRDI QIDQ1604813

Dirk Ormoneit, Śaunak Sen

Publication date: 8 July 2002

Published in: Machine Learning (Search for Journal in Brave)




Related Items (29)

A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applicationsReinforcement Learning Strategies for Clinical Trials in Nonsmall Cell Lung CancerAn algorithmic approach to optimal asset liquidation problemsSolving average cost Markov decision processes by means of a two-phase time aggregation algorithmAlgorithms for Optimal Control of Stochastic Switching SystemsLow-discrepancy sampling for approximate dynamic programming with local approximatorsRestricted gradient-descent algorithm for value-function approximation in reinforcement learningBandit Theory: Applications to Learning Healthcare Systems and Clinical TrialsHybrid MDP based integrated hierarchical Q-learningBatch mode reinforcement learning based on the synthesis of artificial trajectoriesEfficient algorithms of pathwise dynamic programming for decision optimization in mining operationsMulti-agent DRL-based data-driven approach for PEVs charging/discharging scheduling in smart gridDeep reinforcement trading with predictable returnsDesign of experiments for the calibration of history-dependent models via deep reinforcement learning and an enhanced Kalman filterFrom Reinforcement Learning to Deep Reinforcement Learning: An OverviewApproximated multi-agent fitted Q iterationReinforcement learning algorithms with function approximation: recent advances and applicationsGraph kernels and Gaussian processes for relational reinforcement learningAdaptive-resolution reinforcement learning with polynomial exploration in deterministic domainsShape constraints in economics and operations researchTowards Min Max Generalization in Reinforcement LearningAdaptive critic design with graph Laplacian for online learning control of nonlinear systemsAn Approximate Dynamic Programming Algorithm for Monotone Value FunctionsFitted Q-iteration by functional networks for control problemsLearning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample pathSMART: A Stochastic Multiscale Model for the Analysis of Energy Resources, Technology, and PolicyUnnamed ItemMean-Field Controls with Q-Learning for Cooperative MARL: Convergence and Complexity AnalysisBatch policy learning in average reward Markov decision processes






This page was built for publication: Kernel-based reinforcement learning