scientific article; zbMATH DE number 4003938
From MaRDI portal
Publication:3755250
zbMATH Open0618.90093MaRDI QIDQ3755250FDOQ3755250
Authors: Onésimo Hernández-Lerma
Publication date: 1986
Title of this publication is not available (Why is that?)
Recommendations
- scientific article; zbMATH DE number 4045510
- Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes
- Adaptive control of discounted Markov decision chains
- Estimation and control in discounted stochastic dynamic programming
- scientific article; zbMATH DE number 4112513
approximationadaptive policiesasymptotically optimal policyinfinite-horizon discounted dynamic programmingPolish state and action spaces
Cited In (22)
- Shrinking-horizon dynamic programming
- Dynamic policy programming
- Nearly optimal policies in risk-sensitive positive dynamic programming on discrete spaces.
- Complexity bounds for approximately solving discounted MDPs by value iterations
- Discrete Event Dynamic Programming with Simultaneous Events
- Suboptimal solutions to dynamic optimization problems via approximations of the policy functions
- Approximate policy optimization and adaptive control in regression models
- Adaptive policies for stochastic systems under a randomized discounted cost criterion
- Title not available (Why is that?)
- The Compactness of a Policy Space in Dynamic Programming Via an Extension Theorem for Carathéodory Functions
- Adaptive stepsizes for recursive estimation with applications in approximate dynamic programming
- A perturbation approach to a class of discounted approximate value iteration algorithms with Borel spaces
- Modified policy iteration algorithms are not strongly polynomial for discounted dynamic programming
- Adaptive aggregation methods for infinite horizon dynamic programming
- Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes
- The Parameter Iteration Method in Dynamic Programming
- Continuous state dynamic programming via nonexpansive approximation
- Optimal adaptive policies for sequential allocation problems
- Adaptive control of Markov processes with incomplete state information and unknown parameters
- Off-policy based adaptive dynamic programming method for nonzero-sum games on discrete-time system
- Reduced complexity dynamic programming based on policy iteration
- Estimation and control in discounted stochastic dynamic programming
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3755250)