Parallel Nonstationary Direct Policy Search for Risk-Averse Stochastic Optimization
DOI10.1287/IJOC.2016.0733zbMATH Open1371.90137OpenAlexW2605711652MaRDI QIDQ5364280FDOQ5364280
Authors: Somayeh Moazeni, Warren Powell, Boris Defourny, Belgacem Bouzaiene-Ayari
Publication date: 4 October 2017
Published in: INFORMS Journal on Computing (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1287/ijoc.2016.0733
Recommendations
- Parallel sequential Monte Carlo for stochastic gradient-free nonconvex optimization
- Risk-averse policy optimization via risk-neutral policy optimization
- Nonconvex policy search using variational inequalities
- Parallel Algorithms for Stochastic Dynamic Programming with Continuous State and Control Variables
- Risk-Sensitive Reinforcement Learning via Policy Gradient Search
- Global Convergence of Policy Gradient Primal–Dual Methods for Risk-Constrained LQRs
- On parallelization of a stochastic dynamic programming algorithm for solving large-scale mixed \(0-1\) problems under uncertainty
- Approximate gradient methods in policy-space optimization of Markov reward processes
- Policy iteration accelerated with Krylov methods
learningdynamic optimizationderivative-free optimizationenergy storageparallel optimizationdirect policy searchrisk-averse stochastic optimization
Dynamic programming (90C39) Stochastic programming (90C15) Markov and semi-Markov decision processes (90C40)
Cited In (4)
This page was built for publication: Parallel Nonstationary Direct Policy Search for Risk-Averse Stochastic Optimization
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5364280)