Hypervolume indicator and dominance reward based multi-objective Monte-Carlo tree search (Q374142): Difference between revisions

@@ Property / author @@
-Wei-Jia Wang
@@ Property / author: Wei-Jia Wang / rank @@
-Normal rank
@@ Property / author @@
+Wei-Jia Wang
@@ Property / author: Wei-Jia Wang / rank @@
+Normal rank
@@ Property / Mathematics Subject Classification ID @@
+T05
@@ Property / Mathematics Subject Classification ID: 68T05 / rank @@
+Normal rank
@@ Property / Mathematics Subject Classification ID @@
+C29
@@ Property / Mathematics Subject Classification ID: 90C29 / rank @@
+Normal rank
@@ Property / Mathematics Subject Classification ID @@
+C05
@@ Property / Mathematics Subject Classification ID: 65C05 / rank @@
+Normal rank
@@ Property / zbMATH DE Number @@
+6217874
@@ Property / zbMATH DE Number: 6217874 / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+reinforcement learning
@@ Property / zbMATH Keywords: reinforcement learning / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+Monte-Carlo tree search
@@ Property / zbMATH Keywords: Monte-Carlo tree search / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+multi-objective optimization
@@ Property / zbMATH Keywords: multi-objective optimization / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+sequential decision making
@@ Property / zbMATH Keywords: sequential decision making / rank @@
+Normal rank
@@ Property / describes a project that uses @@
+CMA-ES
@@ Property / describes a project that uses: CMA-ES / rank @@
+Normal rank
@@ Property / describes a project that uses @@
+SMS-EMOA
@@ Property / describes a project that uses: SMS-EMOA / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1007/s10994-013-5369-0
+Normal rank
@@ Property / OpenAlex ID @@
+W1974819972
@@ Property / OpenAlex ID: W1974819972 / rank @@
+Normal rank
@@ Property / cites work @@
+Finite-time analysis of the multiarmed bandit problem
+Normal rank
@@ Property / cites work @@
+Theory of the hypervolume indicator
@@ Property / cites work: Theory of the hypervolume indicator / rank @@
+Normal rank
@@ Property / cites work @@
+SMS-EMOA: multiobjective selection based on dominated hypervolume
+Normal rank
@@ Property / cites work @@
+Markov Decision Processes with Multiple Long-Run Average Objectives
+Normal rank
@@ Property / cites work @@
+Q2723294
@@ Property / cites work: Q2723294 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4424324
@@ Property / cites work: Q4424324 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5405225
@@ Property / cites work: Q5405225 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3093188
@@ Property / cites work: Q3093188 / rank @@
+Normal rank
@@ Property / cites work @@
+Algorithms for Reinforcement Learning
@@ Property / cites work: Algorithms for Reinforcement Learning / rank @@
+Normal rank
@@ Property / cites work @@
+NP-complete scheduling problems
@@ Property / cites work: NP-complete scheduling problems / rank @@
+Normal rank
@@ Property / cites work @@
+Workflow Scheduling Algorithms for Grid Computing
@@ Property / cites work: Workflow Scheduling Algorithms for Grid Computing / rank @@
+Normal rank
@@ links / mardi / name / links / mardi / name @@
+Publication:374142