Mathematical Research Data Initiative
Main page
Recent changes
Random page
SPARQL
MaRDI@GitHub
New item
Special pages
In other projects
MaRDI portal item
Discussion
View source
View history
English
Log in

Interactive Thompson sampling for multi-objective multi-armed bandits

From MaRDI portal
Publication:1990281
Jump to:navigation, search

DOI10.1007/978-3-319-67504-6_2zbMATH Open1398.90082OpenAlexW2759684794MaRDI QIDQ1990281FDOQ1990281


Authors: Diederik M. Roijers, Luisa M. Zintgraf, Ann Nowé Edit this on Wikidata


Publication date: 25 October 2018


Full work available at URL: https://doi.org/10.1007/978-3-319-67504-6_2




Recommendations

  • Hypervolume indicator and dominance reward based multi-objective Monte-Carlo tree search
  • Efficient multi-objective reinforcement learning via multiple-gradient descent with iteratively discovered weight-vector sets
  • A Survey of Preference-Based Online Learning with Bandit Algorithms
  • Multi-objective reinforcement learning using sets of Pareto dominating policies
  • scientific article; zbMATH DE number 6276176


Mathematics Subject Classification ID

Management decision making, including multiple objectives (90B50) Utility theory (91B16) Software, source code, etc. for problems pertaining to operations research and mathematical programming (90-04)



Cited In (1)

  • FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization Approach for Deep Neural Networks





This page was built for publication: Interactive Thompson sampling for multi-objective multi-armed bandits

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1990281)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1990281&oldid=14448539"
Tools
What links here
Related changes
Printable version
Permanent link
Page information
This page was last edited on 1 February 2024, at 17:07. Warning: Page may not contain recent updates.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki