Hoeffding's inequality for uniformly ergodic Markov chains (Q1612983): Difference between revisions

The aim of the present mathematical note is to provide a generalization of Hoeffding's inequality which, in its classical form, gives an exponential bound on partial sums of independent and bounded random variables. The authors propose and prove an extension of the Hoeffding's inequality to the setting of (uniformly ergodic) Markov chains. The deviation of the partial sums (derived from a Markov chain) from their expectation is particularly useful in situations where the uniform control on the constants involved in the exponential bound is required; in particular, within the analysis of reinforcement learning algorithms.

0 references

reviewed by

Neculai Curteanu

0 references

Mathematics Subject Classification ID

0 references

0 references

Hoeffding's inequality

0 references

Markov chains

0 references

large deviations

0 references

reinforcement learning algorithm

0 references

MaRDI profile type

MaRDI publication profile

0 references

cites work

Stationarity detection in the initial transient problem

0 references

Weighted sums of certain dependent random variables

0 references

Multiplicative ergodicity and large deviations for an irreducible Markov chain.

0 references

Q4881152

0 references

Q5822308

0 references

Probability Inequalities for Sums of Bounded Random Variables

0 references

Markov chains and stochastic stability

0 references

Kernel-based reinforcement learning in average-cost problems

0 references

Q4315289

0 references

Q3857500

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1612983

@@ Property / cites work @@
+Stationarity detection in the initial transient problem
+Normal rank
@@ Property / cites work @@
+Weighted sums of certain dependent random variables
+Normal rank
@@ Property / cites work @@
+Multiplicative ergodicity and large deviations for an irreducible Markov chain.
+Normal rank
@@ Property / cites work @@
+Q4881152
@@ Property / cites work: Q4881152 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5822308
@@ Property / cites work: Q5822308 / rank @@
+Normal rank
@@ Property / cites work @@
+Probability Inequalities for Sums of Bounded Random Variables
+Normal rank
@@ Property / cites work @@
+Markov chains and stochastic stability
@@ Property / cites work: Markov chains and stochastic stability / rank @@
+Normal rank
@@ Property / cites work @@
+Kernel-based reinforcement learning in average-cost problems
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3857500
@@ Property / cites work: Q3857500 / rank @@
+Normal rank