The following pages link to (Q4626283):
Displayed 50 items.
- A reinforcement learning approach for dynamic multi-objective optimization (Q2055564) (← links)
- Neural precedence recommender (Q2055885) (← links)
- Enhancing gene expression programming based on space partition and jump for symbolic regression (Q2056306) (← links)
- Improving pest monitoring networks using a simulation-based approach to contribute to pesticide reduction (Q2056405) (← links)
- Bias-policy iteration based adaptive dynamic programming for unknown continuous-time linear systems (Q2063829) (← links)
- Fully asynchronous policy evaluation in distributed reinforcement learning over networks (Q2063869) (← links)
- A deep learning model for gas storage optimization (Q2064630) (← links)
- A survey of learning-based control of robotic visual servoing systems (Q2068326) (← links)
- Anticipative dynamic slotting for attended home deliveries (Q2068855) (← links)
- Revisiting the ODE method for recursive algorithms: fast convergence using quasi stochastic approximation (Q2070010) (← links)
- Superquantiles at work: machine learning applications and efficient subgradient computation (Q2070410) (← links)
- A solution to the path planning problem via algebraic geometry and reinforcement learning (Q2071234) (← links)
- Variational learning from implicit bandit feedback (Q2071347) (← links)
- Challenges of real-world reinforcement learning: definitions, benchmarks and analysis (Q2071388) (← links)
- Grounded action transformation for sim-to-real reinforcement learning (Q2071389) (← links)
- Dealing with multiple experts and non-stationarity in inverse reinforcement learning: an application to real-life problems (Q2071401) (← links)
- Partially observable environment estimation with uplift inference for reinforcement learning based recommendation (Q2071406) (← links)
- Model-free LQR design by Q-function learning (Q2071928) (← links)
- Reinforcement learning and stochastic optimisation (Q2072112) (← links)
- Interpretable machine learning: fundamental principles and 10 grand challenges (Q2074414) (← links)
- Deep reinforcement learning for inventory control: a roadmap (Q2076812) (← links)
- Deep Q-learning for same-day delivery with vehicles and drones (Q2076870) (← links)
- Learning to select operators in meta-heuristics: an integration of Q-learning into the iterated greedy algorithm for the permutation flowshop scheduling problem (Q2079447) (← links)
- Inverse reinforcement learning for multi-player noncooperative apprentice games (Q2081813) (← links)
- Risk-averse policy optimization via risk-neutral policy optimization (Q2082514) (← links)
- Adaptive output regulation for cyber-physical systems under time-delay attacks (Q2082758) (← links)
- FBSDE based neural network algorithms for high-dimensional quasilinear parabolic PDEs (Q2083635) (← links)
- A PAC algorithm in relative precision for bandit problem with costly sampling (Q2084297) (← links)
- Deep reinforcement learning for \textsf{FlipIt} security game (Q2086680) (← links)
- A taxonomy of surprise definitions (Q2087566) (← links)
- Reinforcement learning for the knapsack problem (Q2089607) (← links)
- Stochastic variance-reduced prox-linear algorithms for nonconvex composite optimization (Q2089785) (← links)
- Does lifelong learning affect mobile robot evolution? (Q2091556) (← links)
- A reinforcement learning model to inform optimal decision paths for HIV elimination (Q2092170) (← links)
- A reinforcement learning algorithm for rescheduling preempted tasks in fog nodes (Q2093184) (← links)
- Hierarchical clustering optimizes the tradeoff between compositionality and expressivity of task structures for flexible reinforcement learning (Q2093367) (← links)
- Simplified risk-aware decision making with belief-dependent rewards in partially observable domains (Q2093380) (← links)
- What may lie ahead in reinforcement learning (Q2094025) (← links)
- Reinforcement learning for distributed control and multi-player games (Q2094026) (← links)
- From reinforcement learning to optimal control: a unified framework for sequential decisions (Q2094027) (← links)
- Fundamental design principles for reinforcement learning algorithms (Q2094028) (← links)
- Mixed density methods for approximate dynamic programming (Q2094030) (← links)
- Adaptive dynamic programming in the Hamiltonian-driven framework (Q2094034) (← links)
- Optimal adaptive control of partially uncertain linear continuous-time systems with state delay (Q2094036) (← links)
- Dissipativity-based verification for autonomous systems in adversarial environments (Q2094038) (← links)
- Multi-agent reinforcement learning: a selective overview of theories and algorithms (Q2094040) (← links)
- A top-down approach to attain decentralized multi-agents (Q2094043) (← links)
- Bounded rationality in learning, perception, decision-making, and stochastic games (Q2094047) (← links)
- Trading utility and uncertainty: applying the value of information to resolve the exploration-exploitation dilemma in reinforcement learning (Q2094051) (← links)
- Reinforcement learning: an industrial perspective (Q2094053) (← links)