A policy-based learning beam search for combinatorial optimization
From MaRDI portal
Publication:6149097
Cites work
- A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
- A large neighborhood search heuristic for the longest common subsequence problem
- Algorithms on Strings, Trees and Sequences
- Finding the longest common subsequence for multiple biological sequences by ant colony optimization
- Reinforcement learning. An introduction
- The Complexity of Some Problems on Subsequences and Supersequences
This page was built for publication: A policy-based learning beam search for combinatorial optimization
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6149097)