Shie Mannor

From MaRDI portal
(Redirected from Person:239358)



List of research outcomes

This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!

PublicationDate of PublicationType
A General Framework for Bandit Problems Beyond Cumulative Objectives
Mathematics of Operations Research
2024-03-01Paper
Inverse reinforcement learning in contextual MDPs
Machine Learning
2022-01-28Paper
Source Estimation in Time Series and the Surprising Resilience of HMMs
IEEE Transactions on Information Theory
2018-09-19Paper
High-Throughput Energy-Efficient LDPC Decoders Using Differential Binary Message Passing
IEEE Transactions on Signal Processing
2018-08-22Paper
Delayed Stochastic Decoding of LDPC Codes
IEEE Transactions on Signal Processing
2018-07-18Paper
Relaxation Dynamics in Stochastic Iterative Decoders
IEEE Transactions on Signal Processing
2018-07-09Paper
Majority-Based Tracking Forecast Memories for Stochastic LDPC Decoding
IEEE Transactions on Signal Processing
2018-07-09Paper
Fully Parallel Stochastic LDPC Decoders
IEEE Transactions on Signal Processing
2018-06-27Paper
Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback
SIAM Journal on Computing
2017-12-08Paper
Sequential Decision Making With Coherent Risk
IEEE Transactions on Automatic Control
2017-09-21Paper
The Kernel Recursive Least-Squares Algorithm
IEEE Transactions on Signal Processing
2017-09-08Paper
A Kalman Filter Design Based on the Performance/Robustness Tradeoff
IEEE Transactions on Automatic Control
2017-08-08Paper
Network Formation: Bilateral Contracting and Myopic Dynamics
IEEE Transactions on Automatic Control
2017-08-08Paper
Robust Regression and Lasso
IEEE Transactions on Information Theory
2017-07-27Paper
Design of /spl lscr//sub 1/-optimal controllers with flexible disturbance rejection level
IEEE Transactions on Automatic Control
2017-07-27Paper
Efficiency loss in a network resource allocation game: the case of elastic supply
IEEE Transactions on Automatic Control
2017-07-12Paper
Outlier-Robust PCA: The High-Dimensional Case
IEEE Transactions on Information Theory
2017-06-08Paper
Distinguishing Infections on Different Graph Topologies
IEEE Transactions on Information Theory
2017-04-28Paper
Regularized policy iteration with nonparametric function spaces
Journal of Machine Learning Research (JMLR)
2016-11-22Paper
Reinforcement learning in robust Markov decision processes
Mathematics of Operations Research
2016-11-16Paper
Robust MDPs with \(k\)-rectangular uncertainty
Mathematics of Operations Research
2016-11-16Paper
Statistical optimization in high dimensions
Operations Research
2016-10-31Paper
Learning the variance of the reward-to-go
Journal of Machine Learning Research (JMLR)
2016-06-06Paper
A state action frequency approach to throughput maximization over uncertain wireless channels
Internet Mathematics
2016-05-30Paper
Bayesian reinforcement learning: a survey
Foundations and Trends in Machine Learning
2016-05-30Paper
Oracle-based robust optimization via online learning
Operations Research
2015-11-06Paper
Approximate Value Iteration with Temporally Extended Actions
Journal of Artificial Intelligence Research
2015-08-25Paper
Algorithmic aspects of mean-variance optimization in Markov decision processes
European Journal of Operational Research
2015-07-29Paper
Opportunistic approachability and generalized no-regret problems
Mathematics of Operations Research
2015-04-24Paper
A primal condition for approachability with partial monitoring
Journal of Dynamics and Games
2015-01-05Paper
Set-valued approachability and online learning with partial monitoring2014-12-08Paper
Dynamics in tree formation games
Games and Economic Behavior
2014-02-18Paper
The sample complexity of dictionary learning2014-02-03Paper
The sample complexity of dictionary learning
(available as arXiv preprint)
2014-02-03Paper
Approximately optimal bidding policies for repeated first-price auctions
Annals of Operations Research
2012-11-15Paper
Optimization under probabilistic envelope constraints
Operations Research
2012-11-08Paper
Distributionally robust Markov decision processes
Mathematics of Operations Research
2012-05-24Paper
A distributional interpretation of robust optimization
Mathematics of Operations Research
2012-05-24Paper
Robustness and generalization
Machine Learning
2012-05-23Paper
Robustness and regularization of support vector machines
Journal of Machine Learning Research (JMLR)
2012-04-17Paper
Online learning with sample path constraints
Journal of Machine Learning Research (JMLR)
2012-04-17Paper
Go Viral, or Not: Rate-Optimal Control for Resource-Constrained Branching Processes2012-03-05Paper
Bias and variance approximation in value function estimates
Management Science
2012-02-21Paper
Percentile Optimization for Markov Decision Processes with Parameter Uncertainty
Operations Research
2011-11-24Paper
Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems2011-10-12Paper
The sample complexity of exploration in the multi-armed bandit problem2011-10-12Paper
scientific article; zbMATH DE number 5957207 (Why is no real title available?)2011-10-12Paper
Strategies for Prediction Under Imperfect Monitoring
Mathematics of Operations Research
2011-04-27Paper
Markov decision processes with arbitrary reward processes
Mathematics of Operations Research
2011-04-27Paper
A geometric proof of calibration
Mathematics of Operations Research
2011-04-27Paper
Learning Theory and Kernel Machines
Lecture Notes in Computer Science
2010-03-23Paper
Lower bounds on the sample complexity of exploration in the multi-armed bandit problem.
Lecture Notes in Computer Science
2010-03-23Paper
Multi-agent learning for engineers
Artificial Intelligence
2009-07-09Paper
Approachability in repeated games: Computational aspects and a Stackelberg variant
Games and Economic Behavior
2009-06-08Paper
An Inequality for Nearly Log-Concave Distributions With Applications to Learning
IEEE Transactions on Information Theory
2008-12-21Paper
Regret minimization in repeated matrix games with variable stage duration
Games and Economic Behavior
2008-05-21Paper
Strategies for Prediction Under Imperfect Monitoring
Learning Theory
2008-01-03Paper
Online calibrated forecasts: memory efficiency versus universality for learning in games
Machine Learning
2007-09-20Paper
Online Learning with Constraints
Learning Theory
2007-09-14Paper
Online Learning with Variable Stage Duration
Learning Theory
2007-09-14Paper
A contract-based model for directed network formation
Games and Economic Behavior
2006-10-05Paper
On the Empirical State-Action Frequencies in Markov Decision Processes Under General Policies
Mathematics of Operations Research
2005-11-11Paper
The Empirical Bayes Envelope and Regret Minimization in Competitive Markov Decision Processes
Mathematics of Operations Research
2005-11-11Paper
A tutorial on the cross-entropy method
Annals of Operations Research
2005-08-05Paper
Basis function adaptation in temporal difference reinforcement learning
Annals of Operations Research
2005-08-05Paper
Learning Theory
Lecture Notes in Computer Science
2005-06-13Paper
Learning Theory
Lecture Notes in Computer Science
2005-06-13Paper
10.1162/153244304773936108
CrossRef Listing of Deleted DOIs
2004-11-23Paper
scientific article; zbMATH DE number 2089367 (Why is no real title available?)2004-08-12Paper
scientific article; zbMATH DE number 2089371 (Why is no real title available?)2004-08-12Paper
scientific article; zbMATH DE number 1931843 (Why is no real title available?)2003-06-20Paper
scientific article; zbMATH DE number 1931826 (Why is no real title available?)2003-06-20Paper
scientific article; zbMATH DE number 1804118 (Why is no real title available?)2002-09-22Paper
scientific article; zbMATH DE number 1804100 (Why is no real title available?)2002-09-22Paper
On the existence of linear weak learners and applications to boosting
Machine Learning
2002-04-11Paper
scientific article; zbMATH DE number 1944045 (Why is no real title available?)2002-01-01Paper


Research outcomes over time


This page was built for person: Shie Mannor