Huizhen Yu

From MaRDI portal



List of research outcomes

This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!

PublicationDate of PublicationType
On strategic measures and optimality properties in discrete-time stochastic control with universally measurable policies
Mathematics of Operations Research
2024-11-07Paper
Soliton molecules, multi-breathers and hybrid solutions in (2+1)-dimensional Korteweg-de Vries-Sawada-Kotera-Ramani equation
Chaos, Solitons and Fractals
2023-01-12Paper
On linear programming for constrained and unconstrained average-cost Markov decision processes with countable action spaces and strictly unbounded costs
Mathematics of Operations Research
2022-06-27Paper
On Strategic Measures and Optimality Properties in Discrete-Time Stochastic Control with Universally Measurable Policies2022-06-13Paper
On structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policies
Journal of Mathematical Analysis and Applications
2022-01-21Paper
Average-Cost Optimality Results for Borel-Space Markov Decision Processes with Universally Measurable Policies2021-03-31Paper
Average cost optimality inequality for Markov decision processes with Borel spaces and universally measurable policies
SIAM Journal on Control and Optimization
2020-10-30Paper
On generalized Bellman equations and temporal-difference learning
Lecture Notes in Computer Science
2020-08-05Paper
On the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded Costs
SIAM Journal on Control and Optimization
2020-03-11Paper
On Markov Decision Processes with Borel Spaces and an Average Cost Criterion2019-01-10Paper
On generalized Bellman equations and temporal-difference learning
Journal of Machine Learning Research (JMLR)
2018-11-21Paper
Convergence Results for Some Temporal Difference Methods Based on Least Squares
IEEE Transactions on Automatic Control
2017-08-08Paper
Weak convergence properties of constrained emphatic temporal-difference learning with constant and slowly diminishing stepsize2017-01-05Paper
Weak convergence properties of constrained emphatic temporal-difference learning with constant and slowly diminishing stepsize
(available as arXiv preprint)
2017-01-05Paper
A mixed value and policy iteration method for stochastic control with universally measurable policies
Mathematics of Operations Research
2016-01-29Paper
On convergence of value iteration for a class of total cost Markov decision processes
SIAM Journal on Control and Optimization
2015-08-18Paper
Stochastic Shortest Path Games and Q-Learning2014-12-30Paper
On boundedness of Q-learning iterates for stochastic shortest path problems
Mathematics of Operations Research
2014-07-11Paper
scientific article; zbMATH DE number 6277636 (Why is no real title available?)2014-04-02Paper
Q-learning and policy iteration algorithms for stochastic shortest path problems
Annals of Operations Research
2013-11-12Paper
Least squares temporal difference methods: An analysis under general conditions
SIAM Journal on Control and Optimization
2013-03-19Paper
Q-learning and enhanced policy iteration in discounted dynamic programming
Mathematics of Operations Research
2012-05-24Paper
A unifying polyhedral approximation framework for convex optimization
SIAM Journal on Optimization
2011-06-06Paper
Error bounds for approximations from projected linear equations
Mathematics of Operations Research
2011-04-27Paper
Projected equation methods for approximate solution of large linear systems
Journal of Computational and Applied Mathematics
2009-04-21Paper
On Near Optimality of the Set of Finite-State Controllers for Average Cost POMDP
Mathematics of Operations Research
2008-05-27Paper
scientific article; zbMATH DE number 2036376 (Why is no real title available?)2004-02-02Paper
scientific article; zbMATH DE number 4215407 (Why is no real title available?)1991-01-01Paper


Research outcomes over time


This page was built for person: Huizhen Yu