Policy iterations for reinforcement learning problems in continuous time and space -- fundamental theory and methods (Q2664203): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
ReferenceBot (talk | contribs)
Changed an Item
 
(4 intermediate revisions by 4 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W3128350768 / rank
 
Normal rank
Property / arXiv ID
 
Property / arXiv ID: 1705.03520 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach / rank
 
Normal rank
Property / cites work
 
Property / cites work: Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive dynamic programming and optimal control of nonlinear nonaffine systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4255465 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stabilization with discounted optimal control / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5440982 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3266141 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2771497 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Construction of Suboptimal Control Sequences / rank
 
Normal rank
Property / cites work
 
Property / cites work: Policy iterations for reinforcement learning problems in continuous time and space -- fundamental theory and methods / rank
 
Normal rank
Property / cites work
 
Property / cites work: Linear Quadratic Tracking Control of Partially-Unknown Continuous-Time Systems Using Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4736137 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5526189 / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Approximation Theory of Optimal Control for Trainable Manipulators / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4626283 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 01:28, 25 July 2024

scientific article
Language Label Description Also known as
English
Policy iterations for reinforcement learning problems in continuous time and space -- fundamental theory and methods
scientific article

    Statements

    Policy iterations for reinforcement learning problems in continuous time and space -- fundamental theory and methods (English)
    0 references
    0 references
    0 references
    0 references
    20 April 2021
    0 references
    policy iteration
    0 references
    reinforcement learning
    0 references
    optimization under uncertainties
    0 references
    continuous time and space
    0 references
    iterative schemes
    0 references
    adaptive systems
    0 references

    Identifiers