Hoeffding's inequality for uniformly ergodic Markov chains (Q1612983): Difference between revisions
From MaRDI portal
ReferenceBot (talk | contribs) Changed an Item |
Set OpenAlex properties. |
||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1016/s0167-7152(01)00158-4 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W2082040833 / rank | |||
Normal rank |
Latest revision as of 11:31, 30 July 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Hoeffding's inequality for uniformly ergodic Markov chains |
scientific article |
Statements
Hoeffding's inequality for uniformly ergodic Markov chains (English)
0 references
5 September 2002
0 references
The aim of the present mathematical note is to provide a generalization of Hoeffding's inequality which, in its classical form, gives an exponential bound on partial sums of independent and bounded random variables. The authors propose and prove an extension of the Hoeffding's inequality to the setting of (uniformly ergodic) Markov chains. The deviation of the partial sums (derived from a Markov chain) from their expectation is particularly useful in situations where the uniform control on the constants involved in the exponential bound is required; in particular, within the analysis of reinforcement learning algorithms.
0 references
Hoeffding's inequality
0 references
Markov chains
0 references
large deviations
0 references
reinforcement learning algorithm
0 references