Huizhen Yu

From MaRDI portal
Person:378729

Available identifiers

zbMath Open yu.huizhenMaRDI QIDQ378729

List of research outcomes





PublicationDate of PublicationType
On strategic measures and optimality properties in discrete-time stochastic control with universally measurable policies2024-11-07Paper
Soliton molecules, multi-breathers and hybrid solutions in (2+1)-dimensional Korteweg-de Vries-Sawada-Kotera-Ramani equation2023-01-12Paper
On linear programming for constrained and unconstrained average-cost Markov decision processes with countable action spaces and strictly unbounded costs2022-06-27Paper
On Strategic Measures and Optimality Properties in Discrete-Time Stochastic Control with Universally Measurable Policies2022-06-13Paper
On structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policies2022-01-21Paper
Average-Cost Optimality Results for Borel-Space Markov Decision Processes with Universally Measurable Policies2021-03-31Paper
Average cost optimality inequality for Markov decision processes with Borel spaces and universally measurable policies2020-10-30Paper
On Generalized Bellman Equations and Temporal-Difference Learning2020-08-05Paper
On the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded Costs2020-03-11Paper
On Markov Decision Processes with Borel Spaces and an Average Cost Criterion2019-01-10Paper
On Generalized Bellman Equations and Temporal-Difference Learning2018-11-21Paper
Convergence Results for Some Temporal Difference Methods Based on Least Squares2017-08-08Paper
Weak convergence properties of constrained emphatic temporal-difference learning with constant and slowly diminishing stepsize2017-01-05Paper
A mixed value and policy iteration method for stochastic control with universally measurable policies2016-01-29Paper
On convergence of value iteration for a class of total cost Markov decision processes2015-08-18Paper
Stochastic Shortest Path Games and Q-Learning2014-12-30Paper
On boundedness of Q-learning iterates for stochastic shortest path problems2014-07-11Paper
https://portal.mardi4nfdi.de/entity/Q54053812014-04-02Paper
Q-learning and policy iteration algorithms for stochastic shortest path problems2013-11-12Paper
Least squares temporal difference methods: An analysis under general conditions2013-03-19Paper
Q-learning and enhanced policy iteration in discounted dynamic programming2012-05-24Paper
A unifying polyhedral approximation framework for convex optimization2011-06-06Paper
Error bounds for approximations from projected linear equations2011-04-27Paper
Projected equation methods for approximate solution of large linear systems2009-04-21Paper
On Near Optimality of the Set of Finite-State Controllers for Average Cost POMDP2008-05-27Paper
https://portal.mardi4nfdi.de/entity/Q44458512004-02-02Paper
https://portal.mardi4nfdi.de/entity/Q33619261991-01-01Paper

Research outcomes over time

This page was built for person: Huizhen Yu