scientific article

From MaRDI portal
Revision as of 04:17, 6 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:4057976

zbMath0303.62064MaRDI QIDQ4057976

No author found.

Publication date: 1974


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (93)

Index-based policies for discounted multi-armed bandits on parallel machines.Optimal learning and experimentation in bandit problems.Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpointsA bisection/successive approximation method for computing Gittins indicesJob search with related information and wage signallingSelecting among scheduled projectsStochastic scheduling and forwards inductionMulti-armed bandits with simple armsEvaluating the effects of machine breakdowns in stochastic scheduling problemsOpen Bandit Processes with Uncountable States and Time-Backward EffectsOptimal selection of obsolescence mitigation strategies using a restless bandit modelBandit and covariate processes, with finite or non-denumerable set of armsIncentivizing Exploration with Heterogeneous Value of MoneyMulti-armed bandit processes with optimal selection of the operating timesOn Gittins' index theorem in continuous timeResource capacity allocation to stochastic dynamic competitors: knapsack problem for perishable items and index-knapsack heuristicFour proofs of Gittins' multiarmed bandit theoremOptimal decision indices for R\&D project evaluation in the pharmaceutical industry: Pearson index versus Gittins indexMULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONSScheduling of multi-class multi-server queueing systems with abandonmentsSensor Scheduling for Space Object Tracking and Collision AlertThe multi-armed bandit, with constraintsDerman's book as inspiration: some results on LP for MDPsThe archievable region method in the optimal control of queueing systems; formulations, bounds and policiesSimulation-based optimization of Markov decision processes: an empirical process theory approachCombining multiple strategies for multiarmed bandit problems and asymptotic optimalityCommon value experimentationA more general Pandora rule?Index policy for multiarmed bandit problem with dynamic risk measuresEncounters with Martingales in Statistics and Stochastic OptimizationA central limit theorem, loss aversion and multi-armed banditsA perpetual search for talents across overlapping generations: a learning processReinforcement Learning, Bit by BitMULTI-ARMED BANDITS UNDER GENERAL DEPRECIATION AND COMMITMENTINDEXABILITY AND OPTIMAL INDEX POLICIES FOR A CLASS OF REINITIALISING RESTLESS BANDITSEmpirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problemsConsumer strategy, vendor strategy and equilibrium in duopoly markets with production costsOptimal Dynamic Information AcquisitionOpen Problem—M/G/1 Scheduling with Preemption DelaysOptimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary RewardsA confirmation of a conjecture on Feldman’s two-armed bandit problemNonparametric learning rules from bandit experiments: the eyes have it!An adversarial model for scheduling with testingApproximations to Stochastic Dynamic Programs via Information Relaxation DualityThe extraction of natural resources from two sites under uncertaintyOptimal hysteresis for a class of deterministic deteriorating two-armed bandit problem with switching costs.Branching bandits: A sequential search process with correlated pay-offs.A unified framework for stochastic optimizationA common value experimentation with multiarmed banditsControl: a perspectiveHerbert Robbins and sequential analysisMinimizing the time to a decisionAmbiguity aversion in multi-armed bandit problemsBANDIT STRATEGIES EVALUATED IN THE CONTEXT OF CLINICAL TRIALS IN RARE LIFE-THREATENING DISEASESOptimal Online Learning for Nonlinear Belief Models Using Discrete PriorsSimple Bayesian Algorithms for Best-Arm IdentificationStochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocationOnline linear optimization and adaptive routingStochastic scheduling in an in-forestOptimal switching between cash-flow streamsOptimal stopping for Brownian motion with applications to sequential analysis and option pricingLocal information and the design of sequential hypothesis testsAllocation and scheduling of conditional task graphsOn the optimal amount of experimentation in sequential decision problemsUsing adaptive learning in credit scoring to estimate take-up probability distributionOptimal screening designs with flexible cost and constraint structuresExplicit Gittins Indices for a Class of Superdiffusive ProcessesSome best possible results for a discounted one armed banditCoupled bisection for root orderingLearning while searching for the best alternativeGood news and bad news in two-armed banditsAn asymptotically optimal heuristic for general nonstationary finite-horizon restless multi-armed, multi-action banditsPricing to accelerate demand learning in dynamic assortment planning for perishable productsChoosing a good toolkit. I: Prior-free heuristicsChoosing a good toolkit. II: Bayes-rule based heuristicsOptimal learning for sequential sampling with non-parametric beliefsUn ordonnancement dynamique de tâches stochastiques sur un seul processeurGeneralized Bandit ProblemsOptimizing a Unimodal Response Function for Binary VariablesGittins' theorem under uncertaintyTechnical Note—A Note on the Equivalence of Upper Confidence Bounds and Gittins Indices for Patient AgentsFinite state multi-armed bandit problems: Sensitive-discount, average-reward and average-overtaking optimalityDismemberment and design for controlling the replication variance of regret for the multi-armed banditUnnamed ItemMulti-armed bandits in discrete and continuous timeSmall-sample performance of Bernoulli two-armed bandit Bayesian strategiesFrom reinforcement learning to optimal control: a unified framework for sequential decisionsMatrices -- compensating the loss of anschauungAttributesOn the improvement of allocation rules for multi-armed bandit problemA General Theory of MultiArmed Bandit Processes with Constrained Arm SwitchesOptimal stopping problems for multiarmed bandit processes with arms' independenceA Restless Bandit Model for Resource Allocation, Competition, and Reservation







This page was built for publication: