The following pages link to Huizhen Yu (Q378729):
Displayed 25 items.
- Q-learning and policy iteration algorithms for stochastic shortest path problems (Q378731) (← links)
- Projected equation methods for approximate solution of large linear systems (Q1012492) (← links)
- On structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policies (Q2069795) (← links)
- Soliton molecules, multi-breathers and hybrid solutions in (2+1)-dimensional Korteweg-de Vries-Sawada-Kotera-Ramani equation (Q2113223) (← links)
- Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming (Q2884305) (← links)
- (Q2953645) (← links)
- A Unifying Polyhedral Approximation Framework for Convex Optimization (Q3004988) (← links)
- Error Bounds for Approximations from Projected Linear Equations (Q3169095) (← links)
- On Generalized Bellman Equations and Temporal-Difference Learning (Q3305109) (← links)
- (Q3361926) (← links)
- A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies (Q3465941) (← links)
- (Q4445851) (← links)
- Least Squares Temporal Difference Methods: An Analysis under General Conditions (Q4910565) (← links)
- Convergence Results for Some Temporal Difference Methods Based on Least Squares (Q4974645) (← links)
- On Linear Programming for Constrained and Unconstrained Average-Cost Markov Decision Processes with Countable Action Spaces and Strictly Unbounded Costs (Q5085149) (← links)
- Average Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable Policies (Q5130921) (← links)
- On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems (Q5169662) (← links)
- On the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded Costs (Q5220188) (← links)
- On Near Optimality of the Set of Finite-State Controllers for Average Cost POMDP (Q5388068) (← links)
- (Q5405381) (← links)
- On Convergence of Value Iteration for a Class of Total Cost Markov Decision Processes (Q5502179) (← links)
- Stochastic Shortest Path Games and Q-Learning (Q6257695) (← links)
- On Markov Decision Processes with Borel Spaces and an Average Cost Criterion (Q6312308) (← links)
- Average-Cost Optimality Results for Borel-Space Markov Decision Processes with Universally Measurable Policies (Q6364294) (← links)
- On Strategic Measures and Optimality Properties in Discrete-Time Stochastic Control with Universally Measurable Policies (Q6401973) (← links)