Pages that link to "Item:Q2901057"
From MaRDI portal
The following pages link to Reinforcement Learning: A Tutorial Survey and Recent Advances (Q2901057):
Displaying 17 items.
- Dynamic capacity planning using strategic slack valuation (Q323107) (← links)
- A reinforcement learning approach to convoy scheduling on a contested transportation network (Q895789) (← links)
- Approximate stochastic annealing for online control of infinite horizon Markov decision processes (Q1937498) (← links)
- Q-learning-based target selection for bearings-only autonomous navigation (Q2235617) (← links)
- An aggregation-based approximate dynamic programming approach for the periodic review model with random yield (Q2333003) (← links)
- Policy sharing between multiple mobile robots using decision trees (Q2446395) (← links)
- A unified DC programming framework and efficient DCA based approaches for large scale batch reinforcement learning (Q2633537) (← links)
- Tabu search guided by reinforcement learning for the max-mean dispersion problem (Q2666721) (← links)
- Approximate Dynamic Programming based on High Dimensional Model Representation (Q2868781) (← links)
- On Incomplete Learning and Certainty-Equivalence Control (Q4971399) (← links)
- Actor-Critic–Like Stochastic Adaptive Search for Continuous Simulation Optimization (Q5060521) (← links)
- Continuous Action Generation of Q‐Learning in Multi‐Agent Cooperation (Q5416856) (← links)
- A critical review of the most popular types of neuro control (Q5745674) (← links)
- Scalable estimation strategies based on stochastic approximations: classical results and new insights (Q5963780) (← links)
- Literature reviews in operations research: a new taxonomy and a meta review (Q6106580) (← links)
- A self‐adaptive SAC‐PID control approach based on reinforcement learning for mobile robots (Q6139916) (← links)
- A survey for deep reinforcement learning in Markovian cyber-physical systems: common problems and solutions (Q6488715) (← links)