scientific article

From MaRDI portal
Revision as of 21:13, 3 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:2921693

zbMath1297.90117MaRDI QIDQ2921693

Abraham D. Flaxman, H. Brendan McMahan, Adam Tauman Kalai

Publication date: 13 October 2014


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items

No Regret Learning in Oligopolies: Cournot vs. BertrandDerivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic SystemsA theoretical and empirical comparison of gradient approximations in derivative-free optimizationFinite Difference Gradient Approximation: To Randomize or Not?Generalized mirror descents in congestion gamesZeroth-Order Regularized Optimization (ZORO): Approximately Sparse Gradients and Adaptive SamplingStochastic online optimization. Single-point and multi-point non-linear multi-armed bandits. Convex and strongly-convex caseRandom gradient-free minimization of convex functionsAn Accelerated Method for Derivative-Free Smooth Stochastic Convex OptimizationPortfolio selection algorithm under financial crisis: a case study with Bursa MalaysiaMinimax efficient finite-difference stochastic gradient estimators using black-box function evaluationsDecentralized online convex optimization based on signs of relative statesPersonalized optimization with user's feedbackParallel distributed block coordinate descent methods based on pairwise comparison oracleContinuous Assortment Optimization with Logit Choice Probabilities and Incomplete InformationA mixed finite differences scheme for gradient approximationOnline strongly convex optimization with unknown delaysZeroth-order optimization with orthogonal random directionsGradient-free federated learning methods with \(l_1\) and \(l_2\)-randomization for non-smooth convex stochastic optimization problemsOnline distributed detection of sensor networks with delayed informationDistributed online bandit linear regressions with differential privacyZeroth-order feedback optimization for cooperative multi-agent systemsOnline bandit convex optimisation with stochastic constraints via two-point feedbackOn poisoned Wardrop equilibrium in congestion gamesUnnamed ItemUnnamed ItemDecentralized online convex optimization with compressed communicationsNonsmooth optimization by Lie bracket approximations into random directionsStochastic Saddle Point Problems with Decision-Dependent DistributionsOnline distributed dual averaging algorithm for multi-agent bandit optimization over time-varying general directed networksDistributed bandit online optimisation for energy management in smart gridsOnline Sequential Optimization with Biased Gradients: Theory and Applications to Censored DemandComplexity guarantees for an implicit smoothing-enabled method for stochastic MPECsEvent-triggered distributed online convex optimization with delayed bandit feedbackTechnical Note—Nonstationary Stochastic Optimization Under Lp,q-Variation MeasuresLearning in games with continuous action sets and unknown payoff functionsAn Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and ActionsRecent Theoretical Advances in Non-Convex OptimizationData-Driven Decisions for Problems with an Unspecified Objective FunctionExploratory distributions for convex functionsOnline linear optimization and adaptive routingPolicy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite HorizonSmoothed functional-based gradient algorithms for off-policy reinforcement learning: a non-asymptotic viewpointRegret bounded by gradual variation for online convex optimizationA Linearly Convergent Variant of the Conditional Gradient Algorithm under Strong Convexity, with Applications to Online and Stochastic OptimizationNon-Stationary Stochastic OptimizationTruthful Mechanisms with Implicit Payment ComputationThe Data-Driven Newsvendor Problem: New Bounds and InsightsMini-batch stochastic approximation methods for nonconvex stochastic composite optimizationAnalysis of Hannan consistent selection for Monte Carlo tree search in simultaneous move gamesAccelerating reinforcement learning with a directional-Gaussian-smoothing evolution strategyA new one-point residual-feedback oracle for black-box learning and controlDerivative-free optimization methodsPerspectives on multiagent learningWarranty optimization in a dynamic environmentRobust Power Management via Learning and Game DesignGlobal Convergence Rate Analysis of a Generic Line Search Algorithm with NoisePartial Monitoring—Classification, Regret Bounds, and AlgorithmsDerivative-free optimization over multi-user MIMO networksUnnamed ItemDistributed online bandit optimization under random quantizationUnnamed ItemNoisy zeroth-order optimization for non-smooth saddle point problemsOn Gradient-Based Learning in Continuous GamesOn two continuum armed bandit problems in high dimensions