A policy gradient method for semi-Markov decision processes with application to call admission control
From MaRDI portal
Publication:859693
DOI10.1016/J.EJOR.2006.02.023zbMATH Open1163.90790OpenAlexW2169293926MaRDI QIDQ859693FDOQ859693
Authors: Sumeetpal S. Singh, Vladislav B. Tadić, Arnaud Doucet
Publication date: 16 January 2007
Published in: European Journal of Operational Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.ejor.2006.02.023
Recommendations
- Solving semi-Markov decision problems using average reward reinforcement learning
- A basic formula for performance gradient estimation of semi-Markov decision processes
- Performance optimization of semi-Markov decision processes with discounted-cost criteria
- A policy approximation method for the UMTS connection admission control problem modelled as an MDP
- Performance optimization algorithms based on potentials for semi-Markov control processes
Cites Work
- Title not available (Why is that?)
- Markov chains and stochastic stability
- Title not available (Why is that?)
- Reinforcement learning for long-run average cost.
- Title not available (Why is that?)
- Gradient Convergence in Gradient methods with Errors
- Semi-markov decision problems and performance sensitivity analysis
- On the convergence of temporal-difference learning with linear function approximation
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Title not available (Why is that?)
- Multiservice loss models for broadband telecommunication networks
- Integrated voice/data call admission control for wireless DS-CDMA systems
Cited In (12)
- Flow shop scheduling with reinforcement learning
- A reinforcement-learning approach for admission control in distributed network service systems
- Look-ahead control of conveyor-serviced production station by using potential-based online policy iteration
- Finite horizon semi-Markov decision processes with application to maintenance systems
- Semiconductor final test scheduling with Sarsa\((\lambda , k)\) algorithm
- Minimizing mean weighted tardiness in unrelated parallel machine scheduling with reinforcement learning
- Approximate dynamic programming for capacity allocation in the service industry
- Maintenance optimization in a digital twin for industry 4.0
- A minimization problem of the risk probability in first passage semi-Markov decision processes with loss rates
- Solving semi-Markov decision problems using average reward reinforcement learning
- A policy approximation method for the UMTS connection admission control problem modelled as an MDP
- Performance analysis for controlled semi-Markov systems with application to maintenance
This page was built for publication: A policy gradient method for semi-Markov decision processes with application to call admission control
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q859693)