An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback (Q5361319): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Import recommendations run Q6767936
 
(4 intermediate revisions by 3 users not shown)
label / enlabel / en
 
An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback
Property / MaRDI profile type
 
Property / MaRDI profile type: Publication / rank
 
Normal rank
Property / arXiv classification
 
cs.LG
Property / arXiv classification: cs.LG / rank
 
Normal rank
Property / arXiv classification
 
math.OC
Property / arXiv classification: math.OC / rank
 
Normal rank
Property / arXiv classification
 
stat.ML
Property / arXiv classification: stat.ML / rank
 
Normal rank
Property / arXiv ID
 
Property / arXiv ID: 1507.08752 / rank
 
Normal rank
Property / Recommended article
 
Property / Recommended article: Online bandit convex optimisation with stochastic constraints via two-point feedback / rank
 
Normal rank
Property / Recommended article: Online bandit convex optimisation with stochastic constraints via two-point feedback / qualifier
 
Similarity Score: 0.9173385
Amount0.9173385
Unit1
Property / Recommended article: Online bandit convex optimisation with stochastic constraints via two-point feedback / qualifier
 
Property / Recommended article
 
Property / Recommended article: Stochastic Convex Optimization with Bandit Feedback / rank
 
Normal rank
Property / Recommended article: Stochastic Convex Optimization with Bandit Feedback / qualifier
 
Similarity Score: 0.90070385
Amount0.90070385
Unit1
Property / Recommended article: Stochastic Convex Optimization with Bandit Feedback / qualifier
 
Property / Recommended article
 
Property / Recommended article: Improved regret for zeroth-order adversarial bandit convex optimisation / rank
 
Normal rank
Property / Recommended article: Improved regret for zeroth-order adversarial bandit convex optimisation / qualifier
 
Similarity Score: 0.87965727
Amount0.87965727
Unit1
Property / Recommended article: Improved regret for zeroth-order adversarial bandit convex optimisation / qualifier
 
Property / Recommended article
 
Property / Recommended article: A minimax and asymptotically optimal algorithm for stochastic bandits / rank
 
Normal rank
Property / Recommended article: A minimax and asymptotically optimal algorithm for stochastic bandits / qualifier
 
Similarity Score: 0.87755615
Amount0.87755615
Unit1
Property / Recommended article: A minimax and asymptotically optimal algorithm for stochastic bandits / qualifier
 
Property / Recommended article
 
Property / Recommended article: Q4999102 / rank
 
Normal rank
Property / Recommended article: Q4999102 / qualifier
 
Similarity Score: 0.86266863
Amount0.86266863
Unit1
Property / Recommended article: Q4999102 / qualifier
 
Property / Recommended article
 
Property / Recommended article: A Note on Optimal Strategies of a Generalized Two-Stage Bandit Problem / rank
 
Normal rank
Property / Recommended article: A Note on Optimal Strategies of a Generalized Two-Stage Bandit Problem / qualifier
 
Similarity Score: 0.8588209
Amount0.8588209
Unit1
Property / Recommended article: A Note on Optimal Strategies of a Generalized Two-Stage Bandit Problem / qualifier
 
Property / Recommended article
 
Property / Recommended article: An index-based deterministic convergent optimal algorithm for constrained multi-armed bandit problems / rank
 
Normal rank
Property / Recommended article: An index-based deterministic convergent optimal algorithm for constrained multi-armed bandit problems / qualifier
 
Similarity Score: 0.8563884
Amount0.8563884
Unit1
Property / Recommended article: An index-based deterministic convergent optimal algorithm for constrained multi-armed bandit problems / qualifier
 
Property / Recommended article
 
Property / Recommended article: Two stochastic optimization algorithms for convex optimization with fixed point constraints / rank
 
Normal rank
Property / Recommended article: Two stochastic optimization algorithms for convex optimization with fixed point constraints / qualifier
 
Similarity Score: 0.8532512
Amount0.8532512
Unit1
Property / Recommended article: Two stochastic optimization algorithms for convex optimization with fixed point constraints / qualifier
 
Property / Recommended article
 
Property / Recommended article: An asymptotically optimal strategy for constrained multi-armed bandit problems / rank
 
Normal rank
Property / Recommended article: An asymptotically optimal strategy for constrained multi-armed bandit problems / qualifier
 
Similarity Score: 0.8530696
Amount0.8530696
Unit1
Property / Recommended article: An asymptotically optimal strategy for constrained multi-armed bandit problems / qualifier
 
Property / Recommended article
 
Property / Recommended article: An Efficient Algorithm for Learning with Semi-bandit Feedback / rank
 
Normal rank
Property / Recommended article: An Efficient Algorithm for Learning with Semi-bandit Feedback / qualifier
 
Similarity Score: 0.8528775
Amount0.8528775
Unit1
Property / Recommended article: An Efficient Algorithm for Learning with Semi-bandit Feedback / qualifier
 
links / mardi / namelinks / mardi / name
 

Latest revision as of 15:47, 4 April 2025

scientific article; zbMATH DE number 6781373
Language Label Description Also known as
English
An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback
scientific article; zbMATH DE number 6781373

    Statements

    0 references
    27 September 2017
    0 references
    zero-order optimization
    0 references
    bandit optimization
    0 references
    stochastic optimization
    0 references
    gradient estimator
    0 references
    cs.LG
    0 references
    math.OC
    0 references
    stat.ML
    0 references

    Identifiers