Hoeffding's inequality for uniformly ergodic Markov chains (Q1612983): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Stationarity detection in the initial transient problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Weighted sums of certain dependent random variables / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multiplicative ergodicity and large deviations for an irreducible Markov chain. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4881152 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5822308 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Probability Inequalities for Sums of Bounded Random Variables / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov chains and stochastic stability / rank
 
Normal rank
Property / cites work
 
Property / cites work: Kernel-based reinforcement learning in average-cost problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3857500 / rank
 
Normal rank

Latest revision as of 16:37, 4 June 2024

scientific article
Language Label Description Also known as
English
Hoeffding's inequality for uniformly ergodic Markov chains
scientific article

    Statements

    Hoeffding's inequality for uniformly ergodic Markov chains (English)
    0 references
    0 references
    0 references
    5 September 2002
    0 references
    The aim of the present mathematical note is to provide a generalization of Hoeffding's inequality which, in its classical form, gives an exponential bound on partial sums of independent and bounded random variables. The authors propose and prove an extension of the Hoeffding's inequality to the setting of (uniformly ergodic) Markov chains. The deviation of the partial sums (derived from a Markov chain) from their expectation is particularly useful in situations where the uniform control on the constants involved in the exponential bound is required; in particular, within the analysis of reinforcement learning algorithms.
    0 references
    0 references
    Hoeffding's inequality
    0 references
    Markov chains
    0 references
    large deviations
    0 references
    reinforcement learning algorithm
    0 references