A policy gradient method for semi-Markov decision processes with application to call admission control
From MaRDI portal
(Redirected from Publication:859693)
Recommendations
- Solving semi-Markov decision problems using average reward reinforcement learning
- A basic formula for performance gradient estimation of semi-Markov decision processes
- Performance optimization of semi-Markov decision processes with discounted-cost criteria
- A policy approximation method for the UMTS connection admission control problem modelled as an MDP
- Performance optimization algorithms based on potentials for semi-Markov control processes
Cites work
- scientific article; zbMATH DE number 1206370 (Why is no real title available?)
- scientific article; zbMATH DE number 1321699 (Why is no real title available?)
- scientific article; zbMATH DE number 1753152 (Why is no real title available?)
- scientific article; zbMATH DE number 805121 (Why is no real title available?)
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Gradient Convergence in Gradient methods with Errors
- Integrated voice/data call admission control for wireless DS-CDMA systems
- Markov chains and stochastic stability
- Multiservice loss models for broadband telecommunication networks
- On the convergence of temporal-difference learning with linear function approximation
- Reinforcement learning for long-run average cost.
- Semi-markov decision problems and performance sensitivity analysis
Cited in
(12)- Flow shop scheduling with reinforcement learning
- A reinforcement-learning approach for admission control in distributed network service systems
- Finite horizon semi-Markov decision processes with application to maintenance systems
- Semiconductor final test scheduling with Sarsa\((\lambda , k)\) algorithm
- Look-ahead control of conveyor-serviced production station by using potential-based online policy iteration
- Minimizing mean weighted tardiness in unrelated parallel machine scheduling with reinforcement learning
- Approximate dynamic programming for capacity allocation in the service industry
- Maintenance optimization in a digital twin for industry 4.0
- A minimization problem of the risk probability in first passage semi-Markov decision processes with loss rates
- Solving semi-Markov decision problems using average reward reinforcement learning
- A policy approximation method for the UMTS connection admission control problem modelled as an MDP
- Performance analysis for controlled semi-Markov systems with application to maintenance
This page was built for publication: A policy gradient method for semi-Markov decision processes with application to call admission control
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q859693)