Efficient Multi-objective Reinforcement Learning via Multiple-gradient Descent with Iteratively Discovered Weight-Vector Sets (Q5145843): Difference between revisions

From MaRDI portal
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
 
(3 intermediate revisions by 3 users not shown)
Property / describes a project that uses
 
Property / describes a project that uses: TensorFlow / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1613/jair.1.12270 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W3121322767 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multiple-gradient descent algorithm (MGDA) for multiobjective optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Steepest descent methods for multicriteria optimization. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3103669 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On min-norm and min-max methods of multi-objective optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3093188 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simulation-based optimization of Markov reward processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Interactive bundle-based method for nondifferentiable multiobjeective optimization: nimbus<sup>§</sup> / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sequential Approximate Multiobjective Optimization Using Computational Intelligence / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multi-objective Reinforcement Learning through Continuous Pareto Manifold Approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Descent algorithm for nonsmooth stochastic multiobjective optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Scenarios and Policy Aggregation in Optimization Under Uncertainty / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Survey of Multi-Objective Sequential Decision-Making / rank
 
Normal rank
Property / cites work
 
Property / cites work: Computing Convex Coverage Sets for Faster Multi-objective Coordination / rank
 
Normal rank
Property / cites work
 
Property / cites work: A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multivariate stochastic approximation using a simultaneous perturbation gradient approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4626283 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Average cost temporal-difference learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finding intrinsic rewards by embodied evolution and constrained reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Generalized properly efficient solutions of vector optimization problems / rank
 
Normal rank

Latest revision as of 10:03, 24 July 2024

scientific article; zbMATH DE number 7299934
Language Label Description Also known as
English
Efficient Multi-objective Reinforcement Learning via Multiple-gradient Descent with Iteratively Discovered Weight-Vector Sets
scientific article; zbMATH DE number 7299934

    Statements

    Efficient Multi-objective Reinforcement Learning via Multiple-gradient Descent with Iteratively Discovered Weight-Vector Sets (English)
    0 references
    0 references
    0 references
    22 January 2021
    0 references
    reinforcement learning
    0 references
    Markov decision processes
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references