Hoeffding's inequality for uniformly ergodic Markov chains (Q1612983)

From MaRDI portal

Revision as of 11:31, 30 July 2024 by Openalex240730090724 (talk | contribs) (Set OpenAlex properties.)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Jump to:navigation, search

scientific article

Language	Label	Description	Also known as
English	Hoeffding's inequality for uniformly ergodic Markov chains	scientific article

Statements

scholarly article

0 references

Hoeffding's inequality for uniformly ergodic Markov chains (English)

0 references

0 references

0 references

Statistics \& Probability Letters

0 references

publication date

5 September 2002

0 references

The aim of the present mathematical note is to provide a generalization of Hoeffding's inequality which, in its classical form, gives an exponential bound on partial sums of independent and bounded random variables. The authors propose and prove an extension of the Hoeffding's inequality to the setting of (uniformly ergodic) Markov chains. The deviation of the partial sums (derived from a Markov chain) from their expectation is particularly useful in situations where the uniform control on the constants involved in the exponential bound is required; in particular, within the analysis of reinforcement learning algorithms.

0 references

Neculai Curteanu

0 references

zbMATH Keywords

Hoeffding's inequality

0 references

Markov chains

0 references

large deviations

0 references

reinforcement learning algorithm

0 references

MaRDI profile type

MaRDI publication profile

0 references

Stationarity detection in the initial transient problem

0 references

Weighted sums of certain dependent random variables

0 references

Multiplicative ergodicity and large deviations for an irreducible Markov chain.

0 references

0 references

0 references

Probability Inequalities for Sums of Bounded Random Variables

0 references

Markov chains and stochastic stability

0 references

Kernel-based reinforcement learning in average-cost problems

0 references

0 references

0 references

full work available at URL

https://doi.org/10.1016/s0167-7152(01)00158-4

0 references

Identifiers

zbMATH Open document ID

0 references

10.1016/S0167-7152(01)00158-4

0 references

Mathematics Subject Classification ID

0 references

zbMATH DE Number

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1612983

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q1612983&oldid=37295738"