The following pages link to 10.1162/153244303321897663 (Q4825350):
Displayed 50 items.
- Geiringer theorems: from population genetics to computational intelligence, memory evolutive systems and Hebbian learning (Q269771) (← links)
- Algorithm portfolios for noisy optimization (Q276539) (← links)
- Knows what it knows: a framework for self-aware learning (Q413843) (← links)
- Learning with stochastic inputs and adversarial outputs (Q439998) (← links)
- The \(K\)-armed dueling bandits problem (Q440003) (← links)
- Reducing reinforcement learning to KWIK online regression (Q616761) (← links)
- Multi-objective simultaneous optimistic optimization (Q781163) (← links)
- Effective hybrid system falsification using Monte Carlo tree search guided by QB-robustness (Q832206) (← links)
- An analysis of model-based interval estimation for Markov decision processes (Q959899) (← links)
- Randomized prediction of individual sequences (Q1733293) (← links)
- Bayesian optimization of pump operations in water distribution systems (Q1754464) (← links)
- Multiclass classification with bandit feedback using adaptive regularization (Q1945036) (← links)
- Hyperparameter optimization for recommender systems through Bayesian optimization (Q2033691) (← links)
- Gorthaur-EXP3: bandit-based selection from a portfolio of recommendation algorithms balancing the accuracy-diversity dilemma (Q2055544) (← links)
- Regret lower bound and optimal algorithm for high-dimensional contextual linear bandit (Q2074307) (← links)
- Two-armed bandit problem and batch version of the mirror descent algorithm (Q2081125) (← links)
- Stochastic continuum-armed bandits with additive models: minimax regrets and adaptive algorithm (Q2091834) (← links)
- Trading utility and uncertainty: applying the value of information to resolve the exploration-exploitation dilemma in reinforcement learning (Q2094051) (← links)
- Ballooning multi-armed bandits (Q2238588) (← links)
- A survey on kriging-based infill algorithms for multiobjective simulation optimization (Q2289950) (← links)
- New bounds on the price of bandit feedback for mistake-bounded online multiclass learning (Q2290693) (← links)
- Anticipatory action selection for human-robot table tennis (Q2407448) (← links)
- Customization of J. Bather's UCB strategy for a Gaussian multiarmed bandit (Q2689638) (← links)
- (Q4558206) (← links)
- (Q4998871) (← links)
- Statistical Inference for Online Decision Making via Stochastic Gradient Descent (Q4999148) (← links)
- (Q5043718) (← links)
- (Q5053317) (← links)
- Setting Reserve Prices in Second-Price Auctions with Unobserved Bids (Q5060778) (← links)
- Ranking and Selection with Covariates for Personalized Decision Making (Q5084611) (← links)
- Regret bounds for Narendra-Shapiro bandit algorithms (Q5086451) (← links)
- Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning (Q5089723) (← links)
- Dynamic Learning and Decision Making via Basis Weight Vectors (Q5095179) (← links)
- Online Resource Allocation with Personalized Learning (Q5106359) (← links)
- MNL-Bandit: A Dynamic Learning Approach to Assortment Selection (Q5129205) (← links)
- Bandits with Global Convex Constraints and Objective (Q5129206) (← links)
- Online Decision Making with High-Dimensional Covariates (Q5130496) (← links)
- (Q5149015) (← links)
- Learning Enabled Constrained Black-Box Optimization (Q5153491) (← links)
- A linear response bandit problem (Q5168867) (← links)
- Derivative-free optimization methods (Q5230522) (← links)
- Per-Round Knapsack-Constrained Linear Submodular Bandits (Q5380603) (← links)
- Statistical Inference for Online Decision Making: In a Contextual Bandit Setting (Q5857145) (← links)
- Model-based Reinforcement Learning: A Survey (Q5870792) (← links)
- Robust sequential design for piecewise-stationary multi-armed bandit problem in the presence of outliers (Q5880072) (← links)
- Greedy Algorithm Almost Dominates in Smoothed Contextual Bandits (Q5890034) (← links)
- Online learning of network bottlenecks via minimax paths (Q6097144) (← links)
- Multi-armed bandits with censored consumption of resources (Q6097147) (← links)
- Dealing with expert bias in collective decision-making (Q6103665) (← links)
- Online learning of energy consumption for navigation of electric vehicles (Q6157210) (← links)