Pages that link to "Item:Q4395828"

From MaRDI portal

← The policy iteration algorithm for average reward Markov decision processes with general state space (Q4395828)

Jump to:navigation, search

The following pages link to The policy iteration algorithm for average reward Markov decision processes with general state space (Q4395828):

Displaying 27 items.

Potential-based least-squares policy iteration for a parameterized feedback control system (Q289143) (← links)
Weak convergence and fluid limits in optimal time-to-empty queueing control problems (Q434250) (← links)
Average control of Markov decision processes with Feller transition probabilities and general action spaces (Q450971) (← links)
Weakly coupled event triggered output feedback system in wireless networked control systems (Q461475) (← links)
The policy iteration algorithm for average continuous control of piecewise deterministic Markov processes (Q607558) (← links)
Stochastic control via direct comparison (Q633815) (← links)
Completion-of-squares: revisited and extended (Q643690) (← links)
Policy iteration for continuous-time average reward Markov decision processes in Polish spaces (Q963139) (← links)
Single sample path-based optimization of Markov chains (Q1289394) (← links)
Approximate receding horizon approach for Markov decision processes: average reward case (Q1414220) (← links)
Planning for the long run: programming with patient, Pareto responsive preferences (Q1757582) (← links)
On structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policies (Q2069795) (← links)
Dispatching to parallel servers. Solutions of Poisson's equation for first-policy improvement (Q2070675) (← links)
The policy iteration algorithm for a compound Poisson process applied to optimal dividend strategies under a Cramér-Lundberg risk model (Q2146337) (← links)
An optimal control approach to day-to-day congestion pricing for stochastic transportation networks (Q2177822) (← links)
Coding and control for communication networks (Q2269497) (← links)
A policy improvement method for constrained average Markov decision processes (Q2457257) (← links)
Reliability by design in distributed power transmission networks (Q2507913) (← links)
Dynamic load balancing in parallel queueing systems: stability and optimal control (Q2569045) (← links)
Dynamic safety-stocks for asymptotic optimality in stochastic networks (Q2572911) (← links)
A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications (Q2887630) (← links)
(Q3303466) (← links)
Optimal Inventory Control with Jump Diffusion and Nonlinear Dynamics in the Demand (Q4601236) (← links)
On Iteration Improvement for Averaged Expected Cost Control for One-Dimensional Ergodic Diffusions (Q5117358) (← links)
Average Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable Policies (Q5130921) (← links)
On the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded Costs (Q5220188) (← links)
A note on the existence of optimal stationary policies for average Markov decision processes with countable states (Q6163982) (← links)

Retrieved from "https://portal.mardi4nfdi.de/wiki/Special:WhatLinksHere"