Alec Koppel

MaRDI QIDQ2058901zbMATH OpenFDO

List of research outcomes

This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!

Publication	Date of Publication	Type
Decentralized upper confidence bound algorithms for homogeneous multi-agent multi-armed bandits IEEE Transactions on Automatic Control	2025-07-14	Paper
Occupancy information ratio: infinite-horizon, information-directed, parameterized policy search SIAM Journal on Control and Optimization	2024-12-12	Paper
Balancing rates and variance via adaptive batch-size for stochastic optimization problems IEEE Transactions on Signal Processing	2024-09-12	Paper
Escaping saddle points for successive convex approximation IEEE Transactions on Signal Processing	2024-09-12	Paper
Sparse representations of positive functions via first- and second-order pseudo-mirror descent IEEE Transactions on Signal Processing	2024-09-12	Paper
Nearly consistent finite particle estimates in streaming importance sampling IEEE Transactions on Signal Processing	2024-09-09	Paper
Online MCMC Thinning with Kernelized Stein Discrepancy SIAM Journal on Mathematics of Data Science	2024-02-12	Paper
Achieving zero constraint violation for concave utility constrained reinforcement learning via primal-dual approach The Journal of Artificial Intelligence Research (JAIR)	2023-12-20	Paper
On the sample complexity of actor-critic method for reinforcement learning with function approximation Machine Learning	2023-08-22	Paper
High-Dimensional Nonconvex Stochastic Optimization by Doubly Stochastic Successive Convex Approximation IEEE Transactions on Signal Processing	2022-09-23	Paper
Nonparametric Compositional Stochastic Optimization for Risk-Sensitive Kernel Learning IEEE Transactions on Signal Processing	2022-09-23	Paper
Dynamic Online Learning via Frank-Wolfe Algorithm IEEE Transactions on Signal Processing	2022-09-23	Paper
Sharpened Quasi-Newton Methods: Faster Superlinear Rate and Larger Local Convergence Neighborhood	2022-02-21	Paper
Consistent online Gaussian process regression without the sample complexity bottleneck Statistics and Computing	2021-12-10	Paper
Global convergence of policy gradient methods to (almost) locally optimal policies SIAM Journal on Control and Optimization	2020-12-10	Paper
A class of parallel doubly stochastic algorithms for large-scale learning (available as arXiv preprint)	2020-10-05	Paper
A class of parallel doubly stochastic algorithms for large-scale learning	2020-10-05	Paper
Projected Stochastic Primal-Dual Method for Constrained Online Learning With Kernels IEEE Transactions on Signal Processing	2019-10-28	Paper
Parsimonious online learning with kernels via sparse projections in function space	2019-05-02	Paper
Parsimonious online learning with kernels via sparse projections in function space (available as arXiv preprint)	2019-05-02	Paper
Asynchronous Saddle Point Algorithm for Stochastic Optimization in Heterogeneous Networks IEEE Transactions on Signal Processing	2019-03-29	Paper
Decentralized Online Learning With Kernels IEEE Transactions on Signal Processing	2019-02-12	Paper
A Class of Prediction-Correction Methods for Time-Varying Convex Optimization IEEE Transactions on Signal Processing	2019-02-08	Paper
Proximity Without Consensus in Online Multiagent Optimization IEEE Transactions on Signal Processing	2019-02-08	Paper
A Saddle Point Algorithm for Networked Online Convex Optimization IEEE Transactions on Signal Processing	2018-08-22	Paper
Decentralized Prediction-Correction Methods for Networked Time-Varying Convex Optimization IEEE Transactions on Automatic Control	2018-06-27	Paper
Policy Evaluation in Continuous MDPs with Efficient Kernelized Gradient Temporal Difference (available as arXiv preprint)	2017-09-13	Paper
Asynchronous Decentralized Stochastic Optimization in Heterogeneous Networks	2017-07-18	Paper
A Class of Parallel Doubly Stochastic Algorithms for Large-Scale Learning (available as arXiv preprint)	2016-06-15	Paper

Research outcomes over time

This page was built for person: Alec Koppel