Computing human-understandable strategies: deducing fundamental rules of poker strategy (Q1656987): Difference between revisions
From MaRDI portal
Set OpenAlex properties. |
ReferenceBot (talk | contribs) Changed an Item |
||
Property / cites work | |||
Property / cites work: Q2894080 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: The complexity of two-person zero-sum games in extensive form / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Interpretable classifiers using rules and Bayesian analysis: building a better stroke prediction model / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: DeepStack: Expert-level artificial intelligence in heads-up no-limit poker / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q5681545 / rank | |||
Normal rank |
Latest revision as of 07:21, 16 July 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Computing human-understandable strategies: deducing fundamental rules of poker strategy |
scientific article |
Statements
Computing human-understandable strategies: deducing fundamental rules of poker strategy (English)
0 references
13 August 2018
0 references
Summary: Algorithms for equilibrium computation generally make no attempt to ensure that the computed strategies are understandable by humans. For instance the strategies for the strongest poker agents are represented as massive binary files. In many situations, we would like to compute strategies that can actually be implemented by humans, who may have computational limitations and may only be able to remember a small number of features or components of the strategies that have been computed. For example, a human poker player or military leader may not have access to large precomputed tables when making real-time strategic decisions. We study poker games where private information distributions can be arbitrary (i.e., players are dealt cards from different distributions, which depicts the phenomenon in large real poker games where at some points in the hand players have different distribution of hand strength by applying Bayes' rule given the history of play in the hand thus far). We create a large training set of game instances and solutions, by randomly selecting the information probabilities, and present algorithms that learn from the training instances to perform well in games with unseen distributions. We are able to conclude several new fundamental rules about poker strategy that can be easily implemented by humans.
0 references
game theory
0 references
Nash equilibrium
0 references
human understandable
0 references
imperfect information
0 references
poker
0 references
decision tree
0 references
machine learning
0 references
0 references