Off-line estimation of controlled Markov chains: minimaxity and sample complexity (Q6903458)

From MaRDI portal
!
WARNING

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

scientific article; zbMATH DE number 8118207
Language Label Description Also known as
default for all languages
No label defined
    English
    Off-line estimation of controlled Markov chains: minimaxity and sample complexity
    scientific article; zbMATH DE number 8118207

      Statements

      Off-line estimation of controlled Markov chains: minimaxity and sample complexity (English)
      0 references
      0 references
      0 references
      0 references
      10 November 2025
      0 references
      reinforcement learning
      0 references
      controlled Markov chains
      0 references
      stochastic processes
      0 references
      policy evaluation
      0 references
      nonparametric statistics
      0 references

      Identifiers

      0 references
      0 references
      0 references
      0 references
      0 references