The following pages link to (Q4257216):
Displayed 50 items.
- QUANTUM COMPUTATION FOR ACTION SELECTION USING REINFORCEMENT LEARNING (Q3427060) (← links)
- Optimal empty vehicle redistribution for hub‐and‐spoke transportation systems (Q3539892) (← links)
- Robust Optimizers for Nonlinear Programming in Approximate Dynamic Programming (Q3564534) (← links)
- Reward-Modulated Hebbian Learning of Decision Making (Q3568365) (← links)
- A Spiking Neural Network Model of an Actor-Critic Learning Agent (Q3612121) (← links)
- Opportunistic Transmission over Randomly Varying Channels (Q3616977) (← links)
- Simultaneous Optimal Control and Discrete Stochastic Sensor Selection (Q3624562) (← links)
- Value and Policy Function Approximations in Infinite-Horizon Optimization Problems (Q3626047) (← links)
- Challenges in Enterprise Wide Optimization for the Process Industries (Q3638498) (← links)
- Bounds for Multistage Stochastic Programs Using Supervised Learning Strategies (Q3646118) (← links)
- Optimal control of a class of nonlinear stochastic systems (Q3931287) (← links)
- On the structure of value functions for threshold policies in queueing models (Q4462692) (← links)
- Risk-Constrained Reinforcement Learning with Percentile Risk Criteria (Q4558492) (← links)
- From Infinite to Finite Programs: Explicit Error Bounds with Applications to Approximate Dynamic Programming (Q4571046) (← links)
- Ordinary Differential Equation Methods for Markov Decision Processes and Application to Kullback--Leibler Control Cost (Q4602532) (← links)
- (Q4636981) (← links)
- Decomposition Methods for Computing Directional Stationary Solutions of a Class of Nonsmooth Nonconvex Optimization Problems (Q4641680) (← links)
- Optimal Dynamic Treatment Regimes (Q4665861) (← links)
- Computable approximations for average Markov decision processes in continuous time (Q4684960) (← links)
- New Rollout Algorithms for Combinatorial Optimization Problems (Q4709749) (← links)
- (Q4969174) (← links)
- Bayesian Exploration for Approximate Dynamic Programming (Q4971589) (← links)
- Suboptimal Policies for Stochastic $$N$$-Stage Optimization: Accuracy Analysis and a Case Study from Optimal Consumption (Q4979399) (← links)
- (Q4999027) (← links)
- (Q4999029) (← links)
- (Q4999096) (← links)
- Finite-Time Performance of Distributed Temporal-Difference Learning with Linear Function Approximation (Q4999359) (← links)
- Adaptive dynamic programming for model‐free tracking of trajectories with time‐varying parameters (Q5003423) (← links)
- Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic Approximation (Q5009779) (← links)
- Finite-horizon optimal control for continuous-time uncertain nonlinear systems using reinforcement learning (Q5027971) (← links)
- Some Limit Properties of Markov Chains Induced by Recursive Stochastic Algorithms (Q5037552) (← links)
- (Q5053195) (← links)
- (Q5054599) (← links)
- ExpertRNA: A New Framework for RNA Secondary Structure Prediction (Q5057995) (← links)
- Actor-Critic–Like Stochastic Adaptive Search for Continuous Simulation Optimization (Q5060521) (← links)
- Scalable Reinforcement Learning for Multiagent Networked Systems (Q5060525) (← links)
- Stochastic Learning Approach for Binary Optimization: Application to Bayesian Optimal Design of Experiments (Q5071443) (← links)
- Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning (Q5076329) (← links)
- Asymptotics of Reinforcement Learning with Neural Networks (Q5084496) (← links)
- Multiple-sets split quasi-convex feasibility problems: Adaptive subgradient methods with convergence guarantee (Q5088832) (← links)
- Automated Reinforcement Learning (AutoRL): A Survey and Open Problems (Q5094025) (← links)
- Flexible FOND Planning with Explicit Fairness Assumptions (Q5094037) (← links)
- (Q5096718) (← links)
- Risk-Sensitive Reinforcement Learning via Policy Gradient Search (Q5102286) (← links)
- Dynamic Stochastic Matching Under Limited Time (Q5106373) (← links)
- Experience replay–based output feedback Q‐learning scheme for optimal output tracking control of discrete‐time linear systems (Q5128864) (← links)
- On the Taylor Expansion of Value Functions (Q5131481) (← links)
- Benchmarking a Scalable Approximate Dynamic Programming Algorithm for Stochastic Control of Grid-Level Energy Storage (Q5131714) (← links)
- Rectified deep neural networks overcome the curse of dimensionality for nonsmooth value functions in zero-sum games of nonlinear stiff systems (Q5132232) (← links)
- Spare Parts Inventory Management with Substitution-Dependent Reliability (Q5136077) (← links)