An information-theoretic analysis of return maximization in reinforcement learning (Q2375396): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: The strong ergodic theorem for densities: Generalized Shannon-McMillan- Breiman theorem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3241581 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discrete Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Individual Ergodic Theorem of Information Theory / rank
 
Normal rank
Property / cites work
 
Property / cites work: Correction Notes: Correction to "The Individual Ergodic Theorem of Information Theory" / rank
 
Normal rank
Property / cites work
 
Property / cites work: Elements of Information Theory / rank
 
Normal rank
Property / cites work
 
Property / cites work: The method of types [information theory] / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3686615 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The convergence of \(TD(\lambda)\) for general \(\lambda\) / rank
 
Normal rank
Property / cites work
 
Property / cites work: Model-Based Reinforcement Learning for Partially Observable Games with Sampling-Based State Estimation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Boundedness of iterates in \(Q\)-learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotically mean stationary measures / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4779829 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximation theory of output statistics / rank
 
Normal rank
Property / cites work
 
Property / cites work: A New Optimality Criterion for Nonhomogeneous Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3266141 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The asymptotic equipartition property in reinforcement learning and its relation to return maximization / rank
 
Normal rank
Property / cites work
 
Property / cites work: A simple proof of the Moy-Perez generalization of the Shannon-McMillan theorem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4346705 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Basic Theorems of Information Theory / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4398828 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Generalizations of Shannon-McMillan theorem / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Mathematical Theory of Communication / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence results for single-step on-policy reinforcement-learning algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asynchronous stochastic approximation and Q-learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: The role of the asymptotic equipartition property in noiseless source coding / rank
 
Normal rank
Property / cites work
 
Property / cites work: \({\mathcal Q}\)-learning / rank
 
Normal rank

Latest revision as of 12:26, 6 July 2024

scientific article
Language Label Description Also known as
English
An information-theoretic analysis of return maximization in reinforcement learning
scientific article

    Statements

    An information-theoretic analysis of return maximization in reinforcement learning (English)
    0 references
    0 references
    14 June 2013
    0 references
    reinforcement learning
    0 references
    stochastic sequential decision process
    0 references
    information theory
    0 references
    asymptotic equipartition property
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers