scientific article; zbMATH DE number 5957269
From MaRDI portal
Publication:3093261
Recommendations
- Reinforcement learning trees
- Improving reinforcement learning by using sequence trees
- Tree-based reinforcement learning for estimating optimal dynamic treatment regimes
- Cover tree Bayesian reinforcement learning
- Decision tree algorithm with reinforcement learning strategy
- Model-free reinforcement learning for branching Markov decision processes
- Scalable and efficient Bayes-adaptive reinforcement learning based on Monte-Carlo tree search
Cited in
(51)- Extremely randomized trees
- Reinforcement learning trees
- Tutorial on Amortized Optimization
- Cover tree Bayesian reinforcement learning
- Batch policy learning in average reward Markov decision processes
- The QLBS Q-Learner goes NuQLear: fitted Q iteration, inverse RL, and option portfolios
- A Q-learning algorithm for Markov decision processes with continuous state spaces
- Approximated multi-agent fitted Q iteration
- Hessian matrix distribution for Bayesian policy gradient reinforcement learning
- Model selection in reinforcement learning
- Learning output reference model tracking for higher-order nonlinear systems with unknown dynamics
- Target Network and Truncation Overcome the Deadly Triad in \(\boldsymbol{Q}\)-Learning
- Towards min max generalization in reinforcement learning
- Iteratively extending time horizon reinforcement learning.
- Bounds for Multistage Stochastic Programs Using Supervised Learning Strategies
- Recovery of simultaneous low rank and two-way sparse coefficient matrices, a nonconvex approach
- Minimax weight learning for absorbing MDPs
- Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
- Bandit Theory: Applications to Learning Healthcare Systems and Clinical Trials
- Quadratic approximate dynamic programming for input-affine systems
- Data-driven switching modeling for MPC using regression trees and random forests
- Reinforcement learning
- Machine Learning: ECML 2004
- Approximate dynamic programming with a fuzzy parameterization
- Epoch-incremental reinforcement learning algorithms
- Reinforcement learning strategies for clinical trials in nonsmall cell lung cancer
- Abstraction from demonstration for efficient reinforcement learning in high-dimensional domains
- Regularized feature selection in reinforcement learning
- Extreme state aggregation beyond Markov decision processes
- Optimized ensemble value function approximation for dynamic programming
- Evolving interpretable decision trees for reinforcement learning
- Super-learning of an optimal dynamic treatment rule
- Efficient approximate dynamic programming based on design and analysis of computer experiments for infinite-horizon optimization
- Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization
- Estimating optimal shared-parameter dynamic regimens with application to a multistage depression clinical trial
- Making friends on the fly: cooperating with new teammates
- A unified DC programming framework and efficient DCA based approaches for large scale batch reinforcement learning
- Exploiting action impact regularity and exogenous state variables for offline reinforcement learning
- Fitted Q-iteration by functional networks for control problems
- scientific article; zbMATH DE number 7626792 (Why is no real title available?)
- Deep spatial Q-learning for infectious disease control
- Learning when-to-treat policies
- Multi-agent reinforcement learning: a selective overview of theories and algorithms
- A deep reinforcement learning framework for continuous intraday market bidding
- Challenges of real-world reinforcement learning: definitions, benchmarks and analysis
- Scalable transfer learning in heterogeneous, dynamic environments
- Extremely randomized trees
- Batch mode reinforcement learning based on the synthesis of artificial trajectories
- On sparse representation for optimal individualized treatment selection with penalized outcome weighted learning
- Reinforcement learning algorithms with function approximation: recent advances and applications
- Deep spectral Q-learning with application to mobile health
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3093261)