Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge (Q6126872): Difference between revisions

@@ Property / DOI @@
-.1016/j.ins.2024.120182
@@ Property / DOI: 10.1016/j.ins.2024.120182 / rank @@
-Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1016/j.ins.2024.120182
+Normal rank
@@ Property / OpenAlex ID @@
+W4391133699
@@ Property / OpenAlex ID: W4391133699 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4626283
@@ Property / cites work: Q4626283 / rank @@
+Normal rank
@@ Property / cites work @@
+MM Optimization Algorithms
@@ Property / cites work: MM Optimization Algorithms / rank @@
+Normal rank
@@ Property / cites work @@
+Simple statistical gradient-following algorithms for connectionist reinforcement learning
+Normal rank
@@ Property / cites work @@
+Overcoming catastrophic forgetting in neural networks
+Normal rank
@@ Property / cites work @@
+Q5148970
@@ Property / cites work: Q5148970 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5053301
@@ Property / cites work: Q5053301 / rank @@
+Normal rank
@@ Property / Wikidata QID @@
+Q129390077
@@ Property / Wikidata QID: Q129390077 / rank @@
+Normal rank
@@ Property / DOI @@
+.1016/J.INS.2024.120182
@@ Property / DOI: 10.1016/J.INS.2024.120182 / rank @@
+Normal rank
@@ links / mardi / name / links / mardi / name @@
+Publication:6126872