scientific article; zbMATH DE number 3898638
From MaRDI portal
Publication:3677539
zbMATH Open0563.90100MaRDI QIDQ3677539FDOQ3677539
Authors: Yousri M. El-Fattah
Publication date: 1983
Title of this publication is not available (Why is that?)
Recommendations
- Learning algorithms for Markov decision processes
- AVERAGE-OPTIMAL ADAPTIVE POLICIES IN SEMI-MARKOV DECISION PROCESSES INCLUDING AN UNKNOWN PARAMETER
- A learning algorithm for communicating Markov decision processes with unknown transition matrices
- Solving semi-Markov decision problems using average reward reinforcement learning
- Semi-Markov decision processes with limiting ratio average rewards
Learning and adaptive systems in artificial intelligence (68T05) Markov and semi-Markov decision processes (90C40)
Cited In (8)
- Recent advances in learning automata
- Finite State Automata Resulting from Temporal Information Maximization and a Temporal Learning Rule
- A sojourn-based approach to semi-Markov reinforcement learning
- Solving semi-Markov decision problems using average reward reinforcement learning
- AVERAGE-OPTIMAL ADAPTIVE POLICIES IN SEMI-MARKOV DECISION PROCESSES INCLUDING AN UNKNOWN PARAMETER
- A learning algorithm for communicating Markov decision processes with unknown transition matrices
- On conditional optimality of a class of learning automata in random environments
- Multiaction learning automata possessing ergodicity of the mean
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3677539)