Pages that link to "Item:Q903601"
From MaRDI portal
The following pages link to Reinforcement learning algorithms with function approximation: recent advances and applications (Q903601):
Displayed 16 items.
- Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming (Q507750) (← links)
- Optimal scheduling for data transmission between mobile devices and cloud (Q528742) (← links)
- Embedded adaptive fuzzy controller based on reinforcement learning for DC motor with flexible shaft (Q1637893) (← links)
- An integrated data-driven Markov parameters sequence identification and adaptive dynamic programming method to design fault-tolerant optimal tracking control for completely unknown model systems (Q1661214) (← links)
- Two-phase iteration for value function approximation and hyperparameter optimization in Gaussian-kernel-based adaptive critic design (Q1666524) (← links)
- Reinforcement learning endowed with safe veto policies to learn the control of linked-multicomponent robotic systems (Q1749908) (← links)
- Optimal distributed synchronization control for continuous-time heterogeneous multi-agent differential graphical games (Q1749910) (← links)
- Neural circuits for learning context-dependent associations of stimuli (Q2182880) (← links)
- Data-based robust optimal control of continuous-time affine nonlinear systems with matched uncertainties (Q2282884) (← links)
- A unified DC programming framework and efficient DCA based approaches for large scale batch reinforcement learning (Q2633537) (← links)
- A systematic study on meta-heuristic approaches for solving the graph coloring problem (Q2664279) (← links)
- A lexicographic approach to constrained MDP admission control (Q2792714) (← links)
- Some recent advances in learning and adaptation for uncertain feedback control systems (Q2795789) (← links)
- An approximate dynamic programming approach to resource management in multi-cloud scenarios (Q2978074) (← links)
- A Q-Learning Approach for Investment Decisions (Q4606784) (← links)
- Dual heuristic programming with just‐in‐time modeling for self‐learning fault‐tolerant control of mobile robots (Q6078791) (← links)