scientific article; zbMATH DE number 1348743
From MaRDI portal
Publication:4264774
zbMath0924.68157MaRDI QIDQ4264774
Alan C. Schultz, D. E. Moriarty, John J. Grefenstette
Publication date: 10 October 1999
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Learning and adaptive systems in artificial intelligence (68T05) Parallel algorithms in computer science (68W10)
Related Items (13)
Zeroth-order algorithms for nonconvex-strongly-concave minimax problems with improved complexities ⋮ From Reinforcement Learning to Deep Reinforcement Learning: An Overview ⋮ More precise runtime analyses of non-elitist evolutionary algorithms in uncertain environments ⋮ A heuristically accelerated reinforcement learning method for maintenance policy of an assembly line ⋮ Neuron as a reward-modulated combinatorial switch and a model of learning behavior ⋮ Epoch-incremental reinforcement learning algorithms ⋮ Optimized look-ahead tree policies: a bridge between look-ahead tree policies and direct policy search ⋮ Distribution of waiting time for dynamic pickup and delivery problems ⋮ Counter-Factual Reinforcement Learning: How to Model Decision-Makers That Anticipate the Future ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Does lifelong learning affect mobile robot evolution? ⋮ Learning classifier systems: New models, successful applications
This page was built for publication: