Hoeffding's inequality for uniformly ergodic Markov chains (Q1612983): Difference between revisions

The aim of the present mathematical note is to provide a generalization of Hoeffding's inequality which, in its classical form, gives an exponential bound on partial sums of independent and bounded random variables. The authors propose and prove an extension of the Hoeffding's inequality to the setting of (uniformly ergodic) Markov chains. The deviation of the partial sums (derived from a Markov chain) from their expectation is particularly useful in situations where the uniform control on the constants involved in the exponential bound is required; in particular, within the analysis of reinforcement learning algorithms.

0 references

reviewed by

Neculai Curteanu

0 references

zbMATH Keywords

Hoeffding's inequality

0 references

Markov chains

0 references

large deviations

0 references

reinforcement learning algorithm

0 references

MaRDI profile type

MaRDI publication profile

0 references

cites work

Stationarity detection in the initial transient problem

0 references

Weighted sums of certain dependent random variables

0 references

Multiplicative ergodicity and large deviations for an irreducible Markov chain.

0 references

Q4881152

0 references

Q5822308

0 references

Probability Inequalities for Sums of Bounded Random Variables

0 references

Markov chains and stochastic stability

0 references

Kernel-based reinforcement learning in average-cost problems

0 references

Q4315289

0 references

Q3857500

0 references

full work available at URL

https://doi.org/10.1016/s0167-7152(01)00158-4

0 references

Identifiers

zbMATH Open document ID

0999.60019

0 references

DOI

10.1016/S0167-7152(01)00158-4

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1612983

@@ Property / full work available at URL @@
+https://doi.org/10.1016/s0167-7152(01)00158-4
+Normal rank
@@ Property / OpenAlex ID @@
+W2082040833
@@ Property / OpenAlex ID: W2082040833 / rank @@
+Normal rank