10.1162/153244303321897663

From MaRDI portal
Revision as of 02:55, 8 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:4825350

DOI10.1162/153244303321897663zbMath1084.68543OpenAlexW4243522562MaRDI QIDQ4825350

Peter Auer

Publication date: 28 October 2004

Published in: CrossRef Listing of Deleted DOIs (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1162/153244303321897663




Related Items (57)

Robust sequential design for piecewise-stationary multi-armed bandit problem in the presence of outliersGeiringer theorems: from population genetics to computational intelligence, memory evolutive systems and Hebbian learningAlgorithm portfolios for noisy optimizationGreedy Algorithm Almost Dominates in Smoothed Contextual BanditsEffective hybrid system falsification using Monte Carlo tree search guided by QB-robustnessA linear response bandit problemSetting Reserve Prices in Second-Price Auctions with Unobserved BidsRanking and Selection with Covariates for Personalized Decision MakingRegret bounds for Narendra-Shapiro bandit algorithmsAnticipatory action selection for human-robot table tennisFeel-Good Thompson Sampling for Contextual Bandits and Reinforcement LearningDynamic Learning and Decision Making via Basis Weight VectorsReducing reinforcement learning to KWIK online regressionA set‐based approach for hierarchical optimization problem using Bayesian active learningAdaptive resources allocation CUSUM for binomial count data monitoring with application to COVID-19 hotspot detectionOnline Resource Allocation with Personalized LearningOnline learning of network bottlenecks via minimax pathsMulti-armed bandits with censored consumption of resourcesDealing with expert bias in collective decision-makingUnnamed ItemKnows what it knows: a framework for self-aware learningNearly Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset SelectionOnline learning of energy consumption for navigation of electric vehiclesAn asynchronous parallel high-throughput model calibration framework for crystal plasticity finite element constitutive modelsCustomization of J. Bather's UCB strategy for a Gaussian multiarmed banditMulticlass classification with bandit feedback using adaptive regularizationTransfer learning for contextual multi-armed banditsMulti-armed linear bandits with latent biasesLearning with stochastic inputs and adversarial outputsThe \(K\)-armed dueling bandits problemMNL-Bandit: A Dynamic Learning Approach to Assortment SelectionBandits with Global Convex Constraints and ObjectiveOnline Decision Making with High-Dimensional CovariatesUnnamed ItemPer-Round Knapsack-Constrained Linear Submodular BanditsRandomized prediction of individual sequencesUnnamed ItemBallooning multi-armed banditsLearning Enabled Constrained Black-Box OptimizationUnnamed ItemBayesian optimization of pump operations in water distribution systemsAn analysis of model-based interval estimation for Markov decision processesUnnamed ItemHyperparameter optimization for recommender systems through Bayesian optimizationA survey on kriging-based infill algorithms for multiobjective simulation optimizationNew bounds on the price of bandit feedback for mistake-bounded online multiclass learningGorthaur-EXP3: bandit-based selection from a portfolio of recommendation algorithms balancing the accuracy-diversity dilemmaDerivative-free optimization methodsRegret lower bound and optimal algorithm for high-dimensional contextual linear banditTwo-armed bandit problem and batch version of the mirror descent algorithmMulti-objective simultaneous optimistic optimizationStatistical Inference for Online Decision Making via Stochastic Gradient DescentUnnamed ItemStochastic continuum-armed bandits with additive models: minimax regrets and adaptive algorithmTrading utility and uncertainty: applying the value of information to resolve the exploration-exploitation dilemma in reinforcement learningStatistical Inference for Online Decision Making: In a Contextual Bandit SettingModel-based Reinforcement Learning: A Survey




This page was built for publication: 10.1162/153244303321897663