A learning algorithm for the finite-time two-armed bandit problem (Q3342234)

From MaRDI portal





scientific article; zbMATH DE number 3876947
Language Label Description Also known as
default for all languages
No label defined
    English
    A learning algorithm for the finite-time two-armed bandit problem
    scientific article; zbMATH DE number 3876947

      Statements

      A learning algorithm for the finite-time two-armed bandit problem (English)
      0 references
      0 references
      0 references
      0 references
      1984
      0 references
      learning algorithm
      0 references
      finite-time two-armed bandit problem
      0 references
      estimating process
      0 references
      controlling process
      0 references

      Identifiers