Hypervolume indicator and dominance reward based multi-objective Monte-Carlo tree search (Q374142): Difference between revisions
From MaRDI portal
Created a new Item |
ReferenceBot (talk | contribs) Changed an Item |
||
(8 intermediate revisions by 6 users not shown) | |||
Property / author | |||
Property / author: Wei-Jia Wang / rank | |||
Property / author | |||
Property / author: Wei-Jia Wang / rank | |||
Normal rank | |||
Property / Mathematics Subject Classification ID | |||
Property / Mathematics Subject Classification ID: 68T05 / rank | |||
Normal rank | |||
Property / Mathematics Subject Classification ID | |||
Property / Mathematics Subject Classification ID: 90C29 / rank | |||
Normal rank | |||
Property / Mathematics Subject Classification ID | |||
Property / Mathematics Subject Classification ID: 65C05 / rank | |||
Normal rank | |||
Property / zbMATH DE Number | |||
Property / zbMATH DE Number: 6217874 / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
reinforcement learning | |||
Property / zbMATH Keywords: reinforcement learning / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
Monte-Carlo tree search | |||
Property / zbMATH Keywords: Monte-Carlo tree search / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
multi-objective optimization | |||
Property / zbMATH Keywords: multi-objective optimization / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
sequential decision making | |||
Property / zbMATH Keywords: sequential decision making / rank | |||
Normal rank | |||
Property / describes a project that uses | |||
Property / describes a project that uses: CMA-ES / rank | |||
Normal rank | |||
Property / describes a project that uses | |||
Property / describes a project that uses: SMS-EMOA / rank | |||
Normal rank | |||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank | |||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1007/s10994-013-5369-0 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W1974819972 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Theory of the hypervolume indicator / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: SMS-EMOA: multiobjective selection based on dominated hypervolume / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Markov Decision Processes with Multiple Long-Run Average Objectives / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q2723294 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q4424324 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q5405225 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q3093188 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Algorithms for Reinforcement Learning / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: NP-complete scheduling problems / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Workflow Scheduling Algorithms for Grid Computing / rank | |||
Normal rank | |||
links / mardi / name | links / mardi / name | ||
Latest revision as of 00:02, 7 July 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Hypervolume indicator and dominance reward based multi-objective Monte-Carlo tree search |
scientific article |
Statements
Hypervolume indicator and dominance reward based multi-objective Monte-Carlo tree search (English)
0 references
22 October 2013
0 references
reinforcement learning
0 references
Monte-Carlo tree search
0 references
multi-objective optimization
0 references
sequential decision making
0 references