The following pages link to (Q4257216):
Displayed 50 items.
- Monte Carlo \(TD(\lambda)\)-methods for the optimal control of discrete-time Markovian jump linear systems (Q1596471) (← links)
- A time aggregation approach to Markov decision processes (Q1614322) (← links)
- Solving variational inequality and fixed point problems by line searches and potential optimization (Q1764238) (← links)
- Exploiting structure in adaptive dynamic programming algorithms for a stochastic batch service problem (Q1847251) (← links)
- Online stochastic optimization under time constraints (Q1958625) (← links)
- Modeling and optimization of M/G/1-type queueing networks: an efficient sensitivity analysis approach (Q1958793) (← links)
- Training parsers by inverse reinforcement learning (Q1959536) (← links)
- Guiding exploration by pre-existing knowledge without modifying reward (Q2383522) (← links)
- Restricted gradient-descent algorithm for value-function approximation in reinforcement learning (Q2389624) (← links)
- Optimization of a special case of continuous-time Markov decision processes with compact action set (Q2426563) (← links)
- Dynamic pricing and inventory control: robust vs. stochastic uncertainty models---a computational study (Q2430595) (← links)
- Blood platelet production: optimization by dynamic programming and simulation (Q2433384) (← links)
- Neuro-dynamic trading methods (Q2433500) (← links)
- Converging marriage in honey-bees optimization and application to stochastic dynamic programming (Q2457912) (← links)
- Application of reinforcement learning to the game of Othello (Q2462546) (← links)
- Adaptive stepsize selection for tracking in a regime-switching environment (Q2470042) (← links)
- Efficient sampling in approximate dynamic programming algorithms (Q2477014) (← links)
- Motion planning in uncertain environments with vision-like sensors (Q2477711) (← links)
- Dynamic modeling and control of supply chain systems: A review (Q2483501) (← links)
- A tutorial on the cross-entropy method (Q2485925) (← links)
- Basis function adaptation in temporal difference reinforcement learning (Q2485935) (← links)
- Approximate dynamic programming-based approaches for input--output data-driven control of nonlinear processes (Q2486109) (← links)
- Convergence property of gradient-type methods with non-monotone line search in the presence of perturbations (Q2489332) (← links)
- Linear stochastic approximation driven by slowly varying Markov chains (Q2503529) (← links)
- An actor-critic algorithm for constrained Markov decision processes (Q2504518) (← links)
- Boundedness of iterates in \(Q\)-learning (Q2504669) (← links)
- Learning dynamic prices in electronic retail markets with customer segmentation (Q2507374) (← links)
- A note on linear function approximation using random projections (Q2519761) (← links)
- Dynamic pricing models for electronic business (Q2571441) (← links)
- Monte Carlo methods for pricing financial options (Q2571446) (← links)
- Stochastic iterative dynamic programming: a Monte Carlo approach to dual control (Q2576079) (← links)
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach (Q2576080) (← links)
- Towards Min Max Generalization in Reinforcement Learning (Q3006026) (← links)
- Decomposition of large-scale stochastic optimal control problems (Q3057528) (← links)
- Approximate dynamic programming methods for an inventory allocation problem under uncertainty (Q3424512) (← links)
- QUANTUM COMPUTATION FOR ACTION SELECTION USING REINFORCEMENT LEARNING (Q3427060) (← links)
- Optimal empty vehicle redistribution for hub‐and‐spoke transportation systems (Q3539892) (← links)
- Robust Optimizers for Nonlinear Programming in Approximate Dynamic Programming (Q3564534) (← links)
- Reward-Modulated Hebbian Learning of Decision Making (Q3568365) (← links)
- A Spiking Neural Network Model of an Actor-Critic Learning Agent (Q3612121) (← links)
- Opportunistic Transmission over Randomly Varying Channels (Q3616977) (← links)
- Simultaneous Optimal Control and Discrete Stochastic Sensor Selection (Q3624562) (← links)
- Value and Policy Function Approximations in Infinite-Horizon Optimization Problems (Q3626047) (← links)
- Challenges in Enterprise Wide Optimization for the Process Industries (Q3638498) (← links)
- Bounds for Multistage Stochastic Programs Using Supervised Learning Strategies (Q3646118) (← links)
- Optimal control of a class of nonlinear stochastic systems (Q3931287) (← links)
- On the structure of value functions for threshold policies in queueing models (Q4462692) (← links)
- Optimal Dynamic Treatment Regimes (Q4665861) (← links)
- New Rollout Algorithms for Combinatorial Optimization Problems (Q4709749) (← links)
- Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning (Q5189863) (← links)