Shie Mannor

From MaRDI portal
Person:239358

Available identifiers

zbMath Open mannor.shieDBLP20/1669WikidataQ89599584 ScholiaQ89599584MaRDI QIDQ239358

List of research outcomes





PublicationDate of PublicationType
A General Framework for Bandit Problems Beyond Cumulative Objectives2024-03-01Paper
Inverse reinforcement learning in contextual MDPs2022-01-28Paper
Source Estimation in Time Series and the Surprising Resilience of HMMs2018-09-19Paper
High-Throughput Energy-Efficient LDPC Decoders Using Differential Binary Message Passing2018-08-22Paper
Delayed Stochastic Decoding of LDPC Codes2018-07-18Paper
Relaxation Dynamics in Stochastic Iterative Decoders2018-07-09Paper
Majority-Based Tracking Forecast Memories for Stochastic LDPC Decoding2018-07-09Paper
Fully Parallel Stochastic LDPC Decoders2018-06-27Paper
Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback2017-12-08Paper
Sequential Decision Making With Coherent Risk2017-09-21Paper
The Kernel Recursive Least-Squares Algorithm2017-09-08Paper
A Kalman Filter Design Based on the Performance/Robustness Tradeoff2017-08-08Paper
Network Formation: Bilateral Contracting and Myopic Dynamics2017-08-08Paper
Robust Regression and Lasso2017-07-27Paper
Design of /spl lscr//sub 1/-optimal controllers with flexible disturbance rejection level2017-07-27Paper
Efficiency loss in a network resource allocation game: the case of elastic supply2017-07-12Paper
Outlier-Robust PCA: The High-Dimensional Case2017-06-08Paper
Distinguishing Infections on Different Graph Topologies2017-04-28Paper
Regularized policy iteration with nonparametric function spaces2016-11-22Paper
Reinforcement learning in robust Markov decision processes2016-11-16Paper
Robust MDPs with \(k\)-rectangular uncertainty2016-11-16Paper
Statistical optimization in high dimensions2016-10-31Paper
Learning the variance of the reward-to-go2016-06-06Paper
A state action frequency approach to throughput maximization over uncertain wireless channels2016-05-30Paper
Bayesian reinforcement learning: a survey2016-05-30Paper
Oracle-Based Robust Optimization via Online Learning2015-11-06Paper
Approximate Value Iteration with Temporally Extended Actions2015-08-25Paper
Algorithmic aspects of mean-variance optimization in Markov decision processes2015-07-29Paper
Opportunistic Approachability and Generalized No-Regret Problems2015-04-24Paper
A primal condition for approachability with partial monitoring2015-01-05Paper
https://portal.mardi4nfdi.de/entity/Q29341072014-12-08Paper
Dynamics in tree formation games2014-02-18Paper
https://portal.mardi4nfdi.de/entity/Q53967272014-02-03Paper
Approximately optimal bidding policies for repeated first-price auctions2012-11-15Paper
Optimization Under Probabilistic Envelope Constraints2012-11-08Paper
Distributionally robust Markov decision processes2012-05-24Paper
A distributional interpretation of robust optimization2012-05-24Paper
Robustness and generalization2012-05-23Paper
Robustness and regularization of support vector machines2012-04-17Paper
Online learning with sample path constraints2012-04-17Paper
Go Viral, or Not: Rate-Optimal Control for Resource-Constrained Branching Processes2012-03-05Paper
Bias and Variance Approximation in Value Function Estimates2012-02-21Paper
Percentile Optimization for Markov Decision Processes with Parameter Uncertainty2011-11-24Paper
https://portal.mardi4nfdi.de/entity/Q30933832011-10-12Paper
https://portal.mardi4nfdi.de/entity/Q30931972011-10-12Paper
https://portal.mardi4nfdi.de/entity/Q30931882011-10-12Paper
Strategies for Prediction Under Imperfect Monitoring2011-04-27Paper
Markov Decision Processes with Arbitrary Reward Processes2011-04-27Paper
A Geometric Proof of Calibration2011-04-27Paper
Learning Theory and Kernel Machines2010-03-23Paper
Learning Theory and Kernel Machines2010-03-23Paper
Multi-agent learning for engineers2009-07-09Paper
Approachability in repeated games: Computational aspects and a Stackelberg variant2009-06-08Paper
An Inequality for Nearly Log-Concave Distributions With Applications to Learning2008-12-21Paper
Regret minimization in repeated matrix games with variable stage duration2008-05-21Paper
Strategies for Prediction Under Imperfect Monitoring2008-01-03Paper
Online calibrated forecasts: memory efficiency versus universality for learning in games2007-09-20Paper
Online Learning with Constraints2007-09-14Paper
Online Learning with Variable Stage Duration2007-09-14Paper
A contract-based model for directed network formation2006-10-05Paper
On the Empirical State-Action Frequencies in Markov Decision Processes Under General Policies2005-11-11Paper
The Empirical Bayes Envelope and Regret Minimization in Competitive Markov Decision Processes2005-11-11Paper
A tutorial on the cross-entropy method2005-08-05Paper
Basis function adaptation in temporal difference reinforcement learning2005-08-05Paper
Learning Theory2005-06-13Paper
Learning Theory2005-06-13Paper
10.1162/1532443047739361082004-11-23Paper
https://portal.mardi4nfdi.de/entity/Q30467112004-08-12Paper
https://portal.mardi4nfdi.de/entity/Q30467152004-08-12Paper
https://portal.mardi4nfdi.de/entity/Q47092112003-06-20Paper
https://portal.mardi4nfdi.de/entity/Q47091872003-06-20Paper
https://portal.mardi4nfdi.de/entity/Q31488202002-09-22Paper
https://portal.mardi4nfdi.de/entity/Q31488022002-09-22Paper
On the existence of linear weak learners and applications to boosting2002-04-11Paper
https://portal.mardi4nfdi.de/entity/Q44100942002-01-01Paper

Research outcomes over time

This page was built for person: Shie Mannor