Pages that link to "Item:Q1604813"

What links here

⧼whatlinkshere-whatlinkshere-target⧽

Page:

⧼whatlinkshere-whatlinkshere-ns⧽

Namespace:

Invert selection

⧼whatlinkshere-whatlinkshere-filter⧽

Hide transclusions

Hide links

Hide redirects

The following pages link to Kernel-based reinforcement learning (Q1604813):

Displaying 29 items.

Solving average cost Markov decision processes by means of a two-phase time aggregation algorithm (Q300040) (← links)
Low-discrepancy sampling for approximate dynamic programming with local approximators (Q336896) (← links)
Hybrid MDP based integrated hierarchical Q-learning (Q350987) (← links)
Batch mode reinforcement learning based on the synthesis of artificial trajectories (Q378762) (← links)
Reinforcement learning algorithms with function approximation: recent advances and applications (Q903601) (← links)
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path (Q1009248) (← links)
An algorithmic approach to optimal asset liquidation problems (Q1627810) (← links)
Shape constraints in economics and operations research (Q1730901) (← links)
Adaptive-resolution reinforcement learning with polynomial exploration in deterministic domains (Q1959632) (← links)
Batch policy learning in average reward Markov decision processes (Q2112817) (← links)
Efficient algorithms of pathwise dynamic programming for decision optimization in mining operations (Q2178364) (← links)
Fitted Q-iteration by functional networks for control problems (Q2293779) (← links)
Restricted gradient-descent algorithm for value-function approximation in reinforcement learning (Q2389624) (← links)
Graph kernels and Gaussian processes for relational reinforcement learning (Q2433177) (← links)
Multi-agent DRL-based data-driven approach for PEVs charging/discharging scheduling in smart grid (Q2667469) (← links)
Adaptive critic design with graph Laplacian for online learning control of nonlinear systems (Q2795795) (← links)
An Approximate Dynamic Programming Algorithm for Monotone Value Functions (Q2797467) (← links)
SMART: A Stochastic Multiscale Model for the Analysis of Energy Resources, Technology, and Policy (Q2815479) (← links)
A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications (Q2887630) (← links)
Reinforcement Learning Strategies for Clinical Trials in Nonsmall Cell Lung Cancer (Q2893403) (← links)
Towards Min Max Generalization in Reinforcement Learning (Q3006026) (← links)
Algorithms for Optimal Control of Stochastic Switching Systems (Q3178726) (← links)
(Q4636981) (← links)
Mean-Field Controls with Q-Learning for Cooperative MARL: Convergence and Complexity Analysis (Q5018896) (← links)
Bandit Theory: Applications to Learning Healthcare Systems and Clinical Trials (Q5072150) (← links)
Deep reinforcement trading with predictable returns (Q6098411) (← links)
Design of experiments for the calibration of history-dependent models via deep reinforcement learning and an enhanced Kalman filter (Q6159325) (← links)
Approximated multi-agent fitted Q iteration (Q6174070) (← links)
A kernel-based approximate dynamic programming approach: theory and application (Q6491089) (← links)