| Publication | Date of Publication | Type |
|---|
A General Framework for Bandit Problems Beyond Cumulative Objectives Mathematics of Operations Research | 2024-03-01 | Paper |
Inverse reinforcement learning in contextual MDPs Machine Learning | 2022-01-28 | Paper |
Source Estimation in Time Series and the Surprising Resilience of HMMs IEEE Transactions on Information Theory | 2018-09-19 | Paper |
High-Throughput Energy-Efficient LDPC Decoders Using Differential Binary Message Passing IEEE Transactions on Signal Processing | 2018-08-22 | Paper |
Delayed Stochastic Decoding of LDPC Codes IEEE Transactions on Signal Processing | 2018-07-18 | Paper |
Relaxation Dynamics in Stochastic Iterative Decoders IEEE Transactions on Signal Processing | 2018-07-09 | Paper |
Majority-Based Tracking Forecast Memories for Stochastic LDPC Decoding IEEE Transactions on Signal Processing | 2018-07-09 | Paper |
Fully Parallel Stochastic LDPC Decoders IEEE Transactions on Signal Processing | 2018-06-27 | Paper |
Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback SIAM Journal on Computing | 2017-12-08 | Paper |
Sequential Decision Making With Coherent Risk IEEE Transactions on Automatic Control | 2017-09-21 | Paper |
The Kernel Recursive Least-Squares Algorithm IEEE Transactions on Signal Processing | 2017-09-08 | Paper |
A Kalman Filter Design Based on the Performance/Robustness Tradeoff IEEE Transactions on Automatic Control | 2017-08-08 | Paper |
Network Formation: Bilateral Contracting and Myopic Dynamics IEEE Transactions on Automatic Control | 2017-08-08 | Paper |
Robust Regression and Lasso IEEE Transactions on Information Theory | 2017-07-27 | Paper |
Design of /spl lscr//sub 1/-optimal controllers with flexible disturbance rejection level IEEE Transactions on Automatic Control | 2017-07-27 | Paper |
Efficiency loss in a network resource allocation game: the case of elastic supply IEEE Transactions on Automatic Control | 2017-07-12 | Paper |
Outlier-Robust PCA: The High-Dimensional Case IEEE Transactions on Information Theory | 2017-06-08 | Paper |
Distinguishing Infections on Different Graph Topologies IEEE Transactions on Information Theory | 2017-04-28 | Paper |
Regularized policy iteration with nonparametric function spaces Journal of Machine Learning Research (JMLR) | 2016-11-22 | Paper |
Reinforcement learning in robust Markov decision processes Mathematics of Operations Research | 2016-11-16 | Paper |
Robust MDPs with \(k\)-rectangular uncertainty Mathematics of Operations Research | 2016-11-16 | Paper |
Statistical optimization in high dimensions Operations Research | 2016-10-31 | Paper |
Learning the variance of the reward-to-go Journal of Machine Learning Research (JMLR) | 2016-06-06 | Paper |
A state action frequency approach to throughput maximization over uncertain wireless channels Internet Mathematics | 2016-05-30 | Paper |
Bayesian reinforcement learning: a survey Foundations and Trends in Machine Learning | 2016-05-30 | Paper |
Oracle-based robust optimization via online learning Operations Research | 2015-11-06 | Paper |
Approximate Value Iteration with Temporally Extended Actions Journal of Artificial Intelligence Research | 2015-08-25 | Paper |
Algorithmic aspects of mean-variance optimization in Markov decision processes European Journal of Operational Research | 2015-07-29 | Paper |
Opportunistic approachability and generalized no-regret problems Mathematics of Operations Research | 2015-04-24 | Paper |
A primal condition for approachability with partial monitoring Journal of Dynamics and Games | 2015-01-05 | Paper |
| Set-valued approachability and online learning with partial monitoring | 2014-12-08 | Paper |
Dynamics in tree formation games Games and Economic Behavior | 2014-02-18 | Paper |
| The sample complexity of dictionary learning | 2014-02-03 | Paper |
The sample complexity of dictionary learning (available as arXiv preprint) | 2014-02-03 | Paper |
Approximately optimal bidding policies for repeated first-price auctions Annals of Operations Research | 2012-11-15 | Paper |
Optimization under probabilistic envelope constraints Operations Research | 2012-11-08 | Paper |
Distributionally robust Markov decision processes Mathematics of Operations Research | 2012-05-24 | Paper |
A distributional interpretation of robust optimization Mathematics of Operations Research | 2012-05-24 | Paper |
Robustness and generalization Machine Learning | 2012-05-23 | Paper |
Robustness and regularization of support vector machines Journal of Machine Learning Research (JMLR) | 2012-04-17 | Paper |
Online learning with sample path constraints Journal of Machine Learning Research (JMLR) | 2012-04-17 | Paper |
| Go Viral, or Not: Rate-Optimal Control for Resource-Constrained Branching Processes | 2012-03-05 | Paper |
Bias and variance approximation in value function estimates Management Science | 2012-02-21 | Paper |
Percentile Optimization for Markov Decision Processes with Parameter Uncertainty Operations Research | 2011-11-24 | Paper |
| Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems | 2011-10-12 | Paper |
| The sample complexity of exploration in the multi-armed bandit problem | 2011-10-12 | Paper |
| scientific article; zbMATH DE number 5957207 (Why is no real title available?) | 2011-10-12 | Paper |
Strategies for Prediction Under Imperfect Monitoring Mathematics of Operations Research | 2011-04-27 | Paper |
Markov decision processes with arbitrary reward processes Mathematics of Operations Research | 2011-04-27 | Paper |
A geometric proof of calibration Mathematics of Operations Research | 2011-04-27 | Paper |
Learning Theory and Kernel Machines Lecture Notes in Computer Science | 2010-03-23 | Paper |
Lower bounds on the sample complexity of exploration in the multi-armed bandit problem. Lecture Notes in Computer Science | 2010-03-23 | Paper |
Multi-agent learning for engineers Artificial Intelligence | 2009-07-09 | Paper |
Approachability in repeated games: Computational aspects and a Stackelberg variant Games and Economic Behavior | 2009-06-08 | Paper |
An Inequality for Nearly Log-Concave Distributions With Applications to Learning IEEE Transactions on Information Theory | 2008-12-21 | Paper |
Regret minimization in repeated matrix games with variable stage duration Games and Economic Behavior | 2008-05-21 | Paper |
Strategies for Prediction Under Imperfect Monitoring Learning Theory | 2008-01-03 | Paper |
Online calibrated forecasts: memory efficiency versus universality for learning in games Machine Learning | 2007-09-20 | Paper |
Online Learning with Constraints Learning Theory | 2007-09-14 | Paper |
Online Learning with Variable Stage Duration Learning Theory | 2007-09-14 | Paper |
A contract-based model for directed network formation Games and Economic Behavior | 2006-10-05 | Paper |
On the Empirical State-Action Frequencies in Markov Decision Processes Under General Policies Mathematics of Operations Research | 2005-11-11 | Paper |
The Empirical Bayes Envelope and Regret Minimization in Competitive Markov Decision Processes Mathematics of Operations Research | 2005-11-11 | Paper |
A tutorial on the cross-entropy method Annals of Operations Research | 2005-08-05 | Paper |
Basis function adaptation in temporal difference reinforcement learning Annals of Operations Research | 2005-08-05 | Paper |
Learning Theory Lecture Notes in Computer Science | 2005-06-13 | Paper |
Learning Theory Lecture Notes in Computer Science | 2005-06-13 | Paper |
10.1162/153244304773936108 CrossRef Listing of Deleted DOIs | 2004-11-23 | Paper |
| scientific article; zbMATH DE number 2089367 (Why is no real title available?) | 2004-08-12 | Paper |
| scientific article; zbMATH DE number 2089371 (Why is no real title available?) | 2004-08-12 | Paper |
| scientific article; zbMATH DE number 1931843 (Why is no real title available?) | 2003-06-20 | Paper |
| scientific article; zbMATH DE number 1931826 (Why is no real title available?) | 2003-06-20 | Paper |
| scientific article; zbMATH DE number 1804118 (Why is no real title available?) | 2002-09-22 | Paper |
| scientific article; zbMATH DE number 1804100 (Why is no real title available?) | 2002-09-22 | Paper |
On the existence of linear weak learners and applications to boosting Machine Learning | 2002-04-11 | Paper |
| scientific article; zbMATH DE number 1944045 (Why is no real title available?) | 2002-01-01 | Paper |