Probabilistic inference for determining options in reinforcement learning (Q331688): Difference between revisions

@@ Property / cites work @@
+Q5483032
@@ Property / cites work: Q5483032 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3188018
@@ Property / cites work: Q3188018 / rank @@
+Normal rank
@@ Property / cites work @@
+Using Expectation-Maximization for Reinforcement Learning
+Normal rank
@@ Property / cites work @@
+Q4527272
@@ Property / cites work: Q4527272 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3174169
@@ Property / cites work: Q3174169 / rank @@
+Normal rank
@@ Property / cites work @@
+Policy search for motor primitives in robotics
@@ Property / cites work: Policy search for motor primitives in robotics / rank @@
+Normal rank
@@ Property / cites work @@
+.1162/1532443041827907
@@ Property / cites work: 10.1162/1532443041827907 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4709211
@@ Property / cites work: Q4709211 / rank @@
+Normal rank
@@ Property / cites work @@
+Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4737595
@@ Property / cites work: Q4737595 / rank @@
+Normal rank
@@ Property / cites work @@
+Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
+Normal rank
@@ Property / cites work @@
+Q2896181
@@ Property / cites work: Q2896181 / rank @@
+Normal rank
@@ Property / cites work @@
+\({\mathcal Q}\)-learning
@@ Property / cites work: \({\mathcal Q}\)-learning / rank @@
+Normal rank