| Publication | Date of Publication | Type |
|---|
| A General Framework for Bandit Problems Beyond Cumulative Objectives | 2024-03-01 | Paper |
| Inverse reinforcement learning in contextual MDPs | 2022-01-28 | Paper |
| Source Estimation in Time Series and the Surprising Resilience of HMMs | 2018-09-19 | Paper |
| High-Throughput Energy-Efficient LDPC Decoders Using Differential Binary Message Passing | 2018-08-22 | Paper |
| Delayed Stochastic Decoding of LDPC Codes | 2018-07-18 | Paper |
| Relaxation Dynamics in Stochastic Iterative Decoders | 2018-07-09 | Paper |
| Majority-Based Tracking Forecast Memories for Stochastic LDPC Decoding | 2018-07-09 | Paper |
| Fully Parallel Stochastic LDPC Decoders | 2018-06-27 | Paper |
| Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback | 2017-12-08 | Paper |
| Sequential Decision Making With Coherent Risk | 2017-09-21 | Paper |
| The Kernel Recursive Least-Squares Algorithm | 2017-09-08 | Paper |
| A Kalman Filter Design Based on the Performance/Robustness Tradeoff | 2017-08-08 | Paper |
| Network Formation: Bilateral Contracting and Myopic Dynamics | 2017-08-08 | Paper |
| Robust Regression and Lasso | 2017-07-27 | Paper |
| Design of /spl lscr//sub 1/-optimal controllers with flexible disturbance rejection level | 2017-07-27 | Paper |
| Efficiency loss in a network resource allocation game: the case of elastic supply | 2017-07-12 | Paper |
| Outlier-Robust PCA: The High-Dimensional Case | 2017-06-08 | Paper |
| Distinguishing Infections on Different Graph Topologies | 2017-04-28 | Paper |
| Regularized policy iteration with nonparametric function spaces | 2016-11-22 | Paper |
| Reinforcement learning in robust Markov decision processes | 2016-11-16 | Paper |
| Robust MDPs with \(k\)-rectangular uncertainty | 2016-11-16 | Paper |
| Statistical optimization in high dimensions | 2016-10-31 | Paper |
| Learning the variance of the reward-to-go | 2016-06-06 | Paper |
| A state action frequency approach to throughput maximization over uncertain wireless channels | 2016-05-30 | Paper |
| Bayesian reinforcement learning: a survey | 2016-05-30 | Paper |
| Oracle-Based Robust Optimization via Online Learning | 2015-11-06 | Paper |
| Approximate Value Iteration with Temporally Extended Actions | 2015-08-25 | Paper |
| Algorithmic aspects of mean-variance optimization in Markov decision processes | 2015-07-29 | Paper |
| Opportunistic Approachability and Generalized No-Regret Problems | 2015-04-24 | Paper |
| A primal condition for approachability with partial monitoring | 2015-01-05 | Paper |
| https://portal.mardi4nfdi.de/entity/Q2934107 | 2014-12-08 | Paper |
| Dynamics in tree formation games | 2014-02-18 | Paper |
| https://portal.mardi4nfdi.de/entity/Q5396727 | 2014-02-03 | Paper |
| Approximately optimal bidding policies for repeated first-price auctions | 2012-11-15 | Paper |
| Optimization Under Probabilistic Envelope Constraints | 2012-11-08 | Paper |
| Distributionally robust Markov decision processes | 2012-05-24 | Paper |
| A distributional interpretation of robust optimization | 2012-05-24 | Paper |
| Robustness and generalization | 2012-05-23 | Paper |
| Robustness and regularization of support vector machines | 2012-04-17 | Paper |
| Online learning with sample path constraints | 2012-04-17 | Paper |
| Go Viral, or Not: Rate-Optimal Control for Resource-Constrained Branching Processes | 2012-03-05 | Paper |
| Bias and Variance Approximation in Value Function Estimates | 2012-02-21 | Paper |
| Percentile Optimization for Markov Decision Processes with Parameter Uncertainty | 2011-11-24 | Paper |
| https://portal.mardi4nfdi.de/entity/Q3093383 | 2011-10-12 | Paper |
| https://portal.mardi4nfdi.de/entity/Q3093197 | 2011-10-12 | Paper |
| https://portal.mardi4nfdi.de/entity/Q3093188 | 2011-10-12 | Paper |
| Strategies for Prediction Under Imperfect Monitoring | 2011-04-27 | Paper |
| Markov Decision Processes with Arbitrary Reward Processes | 2011-04-27 | Paper |
| A Geometric Proof of Calibration | 2011-04-27 | Paper |
| Learning Theory and Kernel Machines | 2010-03-23 | Paper |
| Learning Theory and Kernel Machines | 2010-03-23 | Paper |
| Multi-agent learning for engineers | 2009-07-09 | Paper |
| Approachability in repeated games: Computational aspects and a Stackelberg variant | 2009-06-08 | Paper |
| An Inequality for Nearly Log-Concave Distributions With Applications to Learning | 2008-12-21 | Paper |
| Regret minimization in repeated matrix games with variable stage duration | 2008-05-21 | Paper |
| Strategies for Prediction Under Imperfect Monitoring | 2008-01-03 | Paper |
| Online calibrated forecasts: memory efficiency versus universality for learning in games | 2007-09-20 | Paper |
| Online Learning with Constraints | 2007-09-14 | Paper |
| Online Learning with Variable Stage Duration | 2007-09-14 | Paper |
| A contract-based model for directed network formation | 2006-10-05 | Paper |
| On the Empirical State-Action Frequencies in Markov Decision Processes Under General Policies | 2005-11-11 | Paper |
| The Empirical Bayes Envelope and Regret Minimization in Competitive Markov Decision Processes | 2005-11-11 | Paper |
| A tutorial on the cross-entropy method | 2005-08-05 | Paper |
| Basis function adaptation in temporal difference reinforcement learning | 2005-08-05 | Paper |
| Learning Theory | 2005-06-13 | Paper |
| Learning Theory | 2005-06-13 | Paper |
| 10.1162/153244304773936108 | 2004-11-23 | Paper |
| https://portal.mardi4nfdi.de/entity/Q3046711 | 2004-08-12 | Paper |
| https://portal.mardi4nfdi.de/entity/Q3046715 | 2004-08-12 | Paper |
| https://portal.mardi4nfdi.de/entity/Q4709211 | 2003-06-20 | Paper |
| https://portal.mardi4nfdi.de/entity/Q4709187 | 2003-06-20 | Paper |
| https://portal.mardi4nfdi.de/entity/Q3148820 | 2002-09-22 | Paper |
| https://portal.mardi4nfdi.de/entity/Q3148802 | 2002-09-22 | Paper |
| On the existence of linear weak learners and applications to boosting | 2002-04-11 | Paper |
| https://portal.mardi4nfdi.de/entity/Q4410094 | 2002-01-01 | Paper |