Pages that link to "Item:Q1604813"
From MaRDI portal
The following pages link to Kernel-based reinforcement learning (Q1604813):
Displaying 29 items.
- Solving average cost Markov decision processes by means of a two-phase time aggregation algorithm (Q300040) (← links)
- Low-discrepancy sampling for approximate dynamic programming with local approximators (Q336896) (← links)
- Hybrid MDP based integrated hierarchical Q-learning (Q350987) (← links)
- Batch mode reinforcement learning based on the synthesis of artificial trajectories (Q378762) (← links)
- Reinforcement learning algorithms with function approximation: recent advances and applications (Q903601) (← links)
- Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path (Q1009248) (← links)
- An algorithmic approach to optimal asset liquidation problems (Q1627810) (← links)
- Shape constraints in economics and operations research (Q1730901) (← links)
- Adaptive-resolution reinforcement learning with polynomial exploration in deterministic domains (Q1959632) (← links)
- Batch policy learning in average reward Markov decision processes (Q2112817) (← links)
- Efficient algorithms of pathwise dynamic programming for decision optimization in mining operations (Q2178364) (← links)
- Fitted Q-iteration by functional networks for control problems (Q2293779) (← links)
- Restricted gradient-descent algorithm for value-function approximation in reinforcement learning (Q2389624) (← links)
- Graph kernels and Gaussian processes for relational reinforcement learning (Q2433177) (← links)
- Multi-agent DRL-based data-driven approach for PEVs charging/discharging scheduling in smart grid (Q2667469) (← links)
- Adaptive critic design with graph Laplacian for online learning control of nonlinear systems (Q2795795) (← links)
- An Approximate Dynamic Programming Algorithm for Monotone Value Functions (Q2797467) (← links)
- SMART: A Stochastic Multiscale Model for the Analysis of Energy Resources, Technology, and Policy (Q2815479) (← links)
- A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications (Q2887630) (← links)
- Reinforcement Learning Strategies for Clinical Trials in Nonsmall Cell Lung Cancer (Q2893403) (← links)
- Towards Min Max Generalization in Reinforcement Learning (Q3006026) (← links)
- Algorithms for Optimal Control of Stochastic Switching Systems (Q3178726) (← links)
- (Q4636981) (← links)
- Mean-Field Controls with Q-Learning for Cooperative MARL: Convergence and Complexity Analysis (Q5018896) (← links)
- Bandit Theory: Applications to Learning Healthcare Systems and Clinical Trials (Q5072150) (← links)
- Deep reinforcement trading with predictable returns (Q6098411) (← links)
- Design of experiments for the calibration of history-dependent models via deep reinforcement learning and an enhanced Kalman filter (Q6159325) (← links)
- Approximated multi-agent fitted Q iteration (Q6174070) (← links)
- A kernel-based approximate dynamic programming approach: theory and application (Q6491089) (← links)