scientific article; zbMATH DE number 1348743

From MaRDI portal

Publication:4264774

Jump to:navigation, search

zbMath0924.68157MaRDI QIDQ4264774

Alan C. Schultz, D. E. Moriarty, John J. Grefenstette

Publication date: 10 October 1999

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

reinforcement learning problems

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Parallel algorithms in computer science (68W10)

Related Items (13)

Zeroth-order algorithms for nonconvex-strongly-concave minimax problems with improved complexities ⋮ From Reinforcement Learning to Deep Reinforcement Learning: An Overview ⋮ More precise runtime analyses of non-elitist evolutionary algorithms in uncertain environments ⋮ A heuristically accelerated reinforcement learning method for maintenance policy of an assembly line ⋮ Neuron as a reward-modulated combinatorial switch and a model of learning behavior ⋮ Epoch-incremental reinforcement learning algorithms ⋮ Optimized look-ahead tree policies: a bridge between look-ahead tree policies and direct policy search ⋮ Distribution of waiting time for dynamic pickup and delivery problems ⋮ Counter-Factual Reinforcement Learning: How to Model Decision-Makers That Anticipate the Future ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Does lifelong learning affect mobile robot evolution? ⋮ Learning classifier systems: New models, successful applications

This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4264774&oldid=18170092"