Probabilistic inference for determining options in reinforcement learning (Q331688): Difference between revisions

@@ Property / DOI @@
-.1007/s10994-016-5580-x
@@ Property / DOI: 10.1007/s10994-016-5580-x / rank @@
-Normal rank
@@ Property / describes a project that uses @@
+PRMLT
@@ Property / describes a project that uses: PRMLT / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+Publication
@@ Property / MaRDI profile type: Publication / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1007/s10994-016-5580-x
+Normal rank
@@ Property / OpenAlex ID @@
+W2498991332
@@ Property / OpenAlex ID: W2498991332 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5483032
@@ Property / cites work: Q5483032 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3188018
@@ Property / cites work: Q3188018 / rank @@
+Normal rank
@@ Property / cites work @@
+Using Expectation-Maximization for Reinforcement Learning
+Normal rank
@@ Property / cites work @@
+Q4527272
@@ Property / cites work: Q4527272 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3174169
@@ Property / cites work: Q3174169 / rank @@
+Normal rank
@@ Property / cites work @@
+Policy search for motor primitives in robotics
@@ Property / cites work: Policy search for motor primitives in robotics / rank @@
+Normal rank
@@ Property / cites work @@
+.1162/1532443041827907
@@ Property / cites work: 10.1162/1532443041827907 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4709211
@@ Property / cites work: Q4709211 / rank @@
+Normal rank
@@ Property / cites work @@
+Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4737595
@@ Property / cites work: Q4737595 / rank @@
+Normal rank
@@ Property / cites work @@
+Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
+Normal rank
@@ Property / cites work @@
+Q2896181
@@ Property / cites work: Q2896181 / rank @@
+Normal rank
@@ Property / cites work @@
+\({\mathcal Q}\)-learning
@@ Property / cites work: \({\mathcal Q}\)-learning / rank @@
+Normal rank
@@ Property / DOI @@
+.1007/S10994-016-5580-X
@@ Property / DOI: 10.1007/S10994-016-5580-X / rank @@
+Normal rank
@@ links / mardi / name / links / mardi / name @@
+Publication:331688