Pages that link to "Item:Q4395828"
From MaRDI portal
The following pages link to The policy iteration algorithm for average reward Markov decision processes with general state space (Q4395828):
Displaying 27 items.
- Potential-based least-squares policy iteration for a parameterized feedback control system (Q289143) (← links)
- Weak convergence and fluid limits in optimal time-to-empty queueing control problems (Q434250) (← links)
- Average control of Markov decision processes with Feller transition probabilities and general action spaces (Q450971) (← links)
- Weakly coupled event triggered output feedback system in wireless networked control systems (Q461475) (← links)
- The policy iteration algorithm for average continuous control of piecewise deterministic Markov processes (Q607558) (← links)
- Stochastic control via direct comparison (Q633815) (← links)
- Completion-of-squares: revisited and extended (Q643690) (← links)
- Policy iteration for continuous-time average reward Markov decision processes in Polish spaces (Q963139) (← links)
- Single sample path-based optimization of Markov chains (Q1289394) (← links)
- Approximate receding horizon approach for Markov decision processes: average reward case (Q1414220) (← links)
- Planning for the long run: programming with patient, Pareto responsive preferences (Q1757582) (← links)
- On structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policies (Q2069795) (← links)
- Dispatching to parallel servers. Solutions of Poisson's equation for first-policy improvement (Q2070675) (← links)
- The policy iteration algorithm for a compound Poisson process applied to optimal dividend strategies under a Cramér-Lundberg risk model (Q2146337) (← links)
- An optimal control approach to day-to-day congestion pricing for stochastic transportation networks (Q2177822) (← links)
- Coding and control for communication networks (Q2269497) (← links)
- A policy improvement method for constrained average Markov decision processes (Q2457257) (← links)
- Reliability by design in distributed power transmission networks (Q2507913) (← links)
- Dynamic load balancing in parallel queueing systems: stability and optimal control (Q2569045) (← links)
- Dynamic safety-stocks for asymptotic optimality in stochastic networks (Q2572911) (← links)
- A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications (Q2887630) (← links)
- (Q3303466) (← links)
- Optimal Inventory Control with Jump Diffusion and Nonlinear Dynamics in the Demand (Q4601236) (← links)
- On Iteration Improvement for Averaged Expected Cost Control for One-Dimensional Ergodic Diffusions (Q5117358) (← links)
- Average Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable Policies (Q5130921) (← links)
- On the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded Costs (Q5220188) (← links)
- A note on the existence of optimal stationary policies for average Markov decision processes with countable states (Q6163982) (← links)