Dealing with multiple experts and non-stationarity in inverse reinforcement learning: an application to real-life problems (Q2071401): Difference between revisions

@@ Property / describes a project that uses @@
+SUMO
@@ Property / describes a project that uses: SUMO / rank @@
+Normal rank
@@ Property / describes a project that uses @@
+CARLA
@@ Property / describes a project that uses: CARLA / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1007/s10994-020-05939-8
+Normal rank
@@ Property / OpenAlex ID @@
+W3137303899
@@ Property / OpenAlex ID: W3137303899 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4533362
@@ Property / cites work: Q4533362 / rank @@
+Normal rank
@@ Property / cites work @@
+On a routing problem
@@ Property / cites work: On a routing problem / rank @@
+Normal rank
@@ Property / cites work @@
+Multi-robot inverse reinforcement learning under occlusion with estimation of state transitions
+Normal rank
@@ Property / cites work @@
+Deep hedging
@@ Property / cites work: Deep hedging / rank @@
+Normal rank
@@ Property / cites work @@
+Q3996777
@@ Property / cites work: Q3996777 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4708017
@@ Property / cites work: Q4708017 / rank @@
+Normal rank
@@ Property / cites work @@
+Policy search for motor primitives in robotics
@@ Property / cites work: Policy search for motor primitives in robotics / rank @@
+Normal rank
@@ Property / cites work @@
+A well-conditioned estimator for large-dimensional covariance matrices
+Normal rank
@@ Property / cites work @@
+Q2810828
@@ Property / cites work: Q2810828 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5491447
@@ Property / cites work: Q5491447 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3093299
@@ Property / cites work: Q3093299 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4626283
@@ Property / cites work: Q4626283 / rank @@
+Normal rank
@@ Property / cites work @@
+Simple statistical gradient-following algorithms for connectionist reinforcement learning
+Normal rank