scientific article; zbMATH DE number 5957269
From MaRDI portal
Publication:3093261
zbMATH Open1222.68193MaRDI QIDQ3093261FDOQ3093261
Authors: D. Ernst, Pierre Geurts, Louis Wehenkel
Publication date: 12 October 2011
Full work available at URL: http://www.jmlr.org/papers/v6/ernst05a.html
Title of this publication is not available (Why is that?)
Recommendations
- Reinforcement learning trees
- Improving reinforcement learning by using sequence trees
- Tree-based reinforcement learning for estimating optimal dynamic treatment regimes
- Cover tree Bayesian reinforcement learning
- Decision tree algorithm with reinforcement learning strategy
- Model-free reinforcement learning for branching Markov decision processes
- Scalable and efficient Bayes-adaptive reinforcement learning based on Monte-Carlo tree search
supervised learningensemble methodsoptimal controlregression treesbatch mode reinforcement learningfitted value iteration
Classification and discrimination; cluster analysis (statistical aspects) (62H30) Learning and adaptive systems in artificial intelligence (68T05)
Cited In (51)
- Reinforcement learning trees
- Tutorial on Amortized Optimization
- Cover tree Bayesian reinforcement learning
- A Q-learning algorithm for Markov decision processes with continuous state spaces
- Approximated multi-agent fitted Q iteration
- The QLBS Q-Learner goes NuQLear: fitted Q iteration, inverse RL, and option portfolios
- Batch policy learning in average reward Markov decision processes
- Hessian matrix distribution for Bayesian policy gradient reinforcement learning
- Target Network and Truncation Overcome the Deadly Triad in \(\boldsymbol{Q}\)-Learning
- Learning output reference model tracking for higher-order nonlinear systems with unknown dynamics
- Model selection in reinforcement learning
- Iteratively extending time horizon reinforcement learning.
- Towards min max generalization in reinforcement learning
- Bounds for Multistage Stochastic Programs Using Supervised Learning Strategies
- Minimax weight learning for absorbing MDPs
- Recovery of simultaneous low rank and two-way sparse coefficient matrices, a nonconvex approach
- Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
- Bandit Theory: Applications to Learning Healthcare Systems and Clinical Trials
- Quadratic approximate dynamic programming for input-affine systems
- Reinforcement learning
- Data-driven switching modeling for MPC using regression trees and random forests
- Machine Learning: ECML 2004
- Epoch-incremental reinforcement learning algorithms
- Approximate dynamic programming with a fuzzy parameterization
- Reinforcement learning strategies for clinical trials in nonsmall cell lung cancer
- Abstraction from demonstration for efficient reinforcement learning in high-dimensional domains
- Regularized feature selection in reinforcement learning
- Extreme state aggregation beyond Markov decision processes
- Optimized ensemble value function approximation for dynamic programming
- Evolving interpretable decision trees for reinforcement learning
- Super-learning of an optimal dynamic treatment rule
- Efficient approximate dynamic programming based on design and analysis of computer experiments for infinite-horizon optimization
- Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization
- Estimating optimal shared-parameter dynamic regimens with application to a multistage depression clinical trial
- Making friends on the fly: cooperating with new teammates
- A unified DC programming framework and efficient DCA based approaches for large scale batch reinforcement learning
- Exploiting action impact regularity and exogenous state variables for offline reinforcement learning
- Fitted Q-iteration by functional networks for control problems
- Title not available (Why is that?)
- Deep spatial Q-learning for infectious disease control
- Learning when-to-treat policies
- Multi-agent reinforcement learning: a selective overview of theories and algorithms
- A deep reinforcement learning framework for continuous intraday market bidding
- Challenges of real-world reinforcement learning: definitions, benchmarks and analysis
- Scalable transfer learning in heterogeneous, dynamic environments
- Extremely randomized trees
- Batch mode reinforcement learning based on the synthesis of artificial trajectories
- On sparse representation for optimal individualized treatment selection with penalized outcome weighted learning
- Deep spectral Q-learning with application to mobile health
- Reinforcement learning algorithms with function approximation: recent advances and applications
- Extremely randomized trees
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3093261)