Zhaoran Wang

From MaRDI portal
Person:482872

Available identifiers

zbMath Open wang.zhaoranMaRDI QIDQ482872

List of research outcomes





PublicationDate of PublicationType
Federated Offline Reinforcement Learning2024-12-10Paper
Nearly Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset Selection2024-03-19Paper
Neural Temporal Difference and Q Learning Provably Converge to Global Optima2024-03-05Paper
Provably Efficient Reinforcement Learning with Linear Function Approximation2024-02-27Paper
Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium2024-02-23Paper
Online Bootstrap Inference For Policy Evaluation In Reinforcement Learning2024-01-08Paper
A Two-Timescale Stochastic Algorithm Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic2023-03-30Paper
Provably training overparameterized neural network classifiers with non-convex constraints2022-12-19Paper
Spectrum Truncation Power Iteration for Agnostic Matrix Phase Retrieval2022-09-23Paper
A Primal-Dual Approach to Constrained Markov Decision Processes2021-01-26Paper
Provably Training Overparameterized Neural Network Classifiers with Non-convex Constraints2020-12-30Paper
https://portal.mardi4nfdi.de/entity/Q49691992020-10-05Paper
A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic2020-07-10Paper
Accelerating Nonconvex Learning via Replica Exchange Langevin Diffusion2020-07-03Paper
https://portal.mardi4nfdi.de/entity/Q52142462020-02-07Paper
Symmetry, Saddle Points, and Global Optimization Landscape of Nonconvex Matrix Factorization2019-07-19Paper
Sparse Generalized Eigenvalue Problem: Optimal Statistical Rates via Truncated Rayleigh Flow2019-03-06Paper
A convex formulation for high-dimensional sparse sliced inverse regression2018-12-18Paper
Sparse Generalized Eigenvalue Problem: Optimal Statistical Rates via Truncated Rayleigh Flow2016-04-29Paper
Optimal computational and statistical rates of convergence for sparse nonconvex learning problems2015-01-06Paper
A Strictly Contractive Peaceman--Rachford Splitting Method for Convex Programming2014-12-12Paper

Research outcomes over time

This page was built for person: Zhaoran Wang