Alec Koppel

From MaRDI portal
Person:2058901

Available identifiers

zbMath Open koppel.alecMaRDI QIDQ2058901

List of research outcomes





PublicationDate of PublicationType
Occupancy information ratio: infinite-horizon, information-directed, parameterized policy search2024-12-12Paper
Balancing rates and variance via adaptive batch-size for stochastic optimization problems2024-09-12Paper
Escaping saddle points for successive convex approximation2024-09-12Paper
Sparse representations of positive functions via first- and second-order pseudo-mirror descent2024-09-12Paper
Nearly consistent finite particle estimates in streaming importance sampling2024-09-09Paper
Online MCMC Thinning with Kernelized Stein Discrepancy2024-02-12Paper
Achieving zero constraint violation for concave utility constrained reinforcement learning via primal-dual approach2023-12-20Paper
On the sample complexity of actor-critic method for reinforcement learning with function approximation2023-08-22Paper
High-Dimensional Nonconvex Stochastic Optimization by Doubly Stochastic Successive Convex Approximation2022-09-23Paper
Nonparametric Compositional Stochastic Optimization for Risk-Sensitive Kernel Learning2022-09-23Paper
Dynamic Online Learning via Frank-Wolfe Algorithm2022-09-23Paper
Sharpened Quasi-Newton Methods: Faster Superlinear Rate and Larger Local Convergence Neighborhood2022-02-21Paper
Consistent online Gaussian process regression without the sample complexity bottleneck2021-12-10Paper
Global convergence of policy gradient methods to (almost) locally optimal policies2020-12-10Paper
A class of parallel doubly stochastic algorithms for large-scale learning2020-10-05Paper
Projected Stochastic Primal-Dual Method for Constrained Online Learning With Kernels2019-10-28Paper
Parsimonious online learning with kernels via sparse projections in function space2019-05-02Paper
Asynchronous Saddle Point Algorithm for Stochastic Optimization in Heterogeneous Networks2019-03-29Paper
Decentralized Online Learning With Kernels2019-02-12Paper
A Class of Prediction-Correction Methods for Time-Varying Convex Optimization2019-02-08Paper
Proximity Without Consensus in Online Multiagent Optimization2019-02-08Paper
A Saddle Point Algorithm for Networked Online Convex Optimization2018-08-22Paper
Decentralized Prediction-Correction Methods for Networked Time-Varying Convex Optimization2018-06-27Paper
Policy Evaluation in Continuous MDPs with Efficient Kernelized Gradient Temporal Difference2017-09-13Paper
Asynchronous Decentralized Stochastic Optimization in Heterogeneous Networks2017-07-18Paper
A Class of Parallel Doubly Stochastic Algorithms for Large-Scale Learning2016-06-15Paper

Research outcomes over time

This page was built for person: Alec Koppel