A Bandit Learning Method for Continuous Games under Feedback Delays with Residual Pseudo-Gradient Estimate (Q6431228)

From MaRDI portal





scientific article; zbMATH DE number 900585774
Language Label Description Also known as
default for all languages
No label defined
    English
    A Bandit Learning Method for Continuous Games under Feedback Delays with Residual Pseudo-Gradient Estimate
    scientific article; zbMATH DE number 900585774

      Statements

      28 March 2023
      0 references
      math.OC
      0 references
      0 references
      0 references

      Identifiers

      0 references