Empirical measure large deviations for reinforced chains on finite spaces

From MaRDI portal
Publication:2107618

DOI10.1016/J.SYSCONLE.2022.105379zbMATH Open1505.93251arXiv2205.09291OpenAlexW4297330031MaRDI QIDQ2107618FDOQ2107618


Authors: Amarjit Budhiraja, Adam Waterbury Edit this on Wikidata


Publication date: 2 December 2022

Published in: Systems \& Control Letters (Search for Journal in Brave)

Abstract: Let A be a transition probability kernel on a finite state space Deltao=1,ldots,d such that A(x,y)>0 for all x,yinDeltao. Consider a reinforced chain given as a sequence Xn,;ninmathbbN0 of Deltao-valued random variables, defined recursively according to, L^n = frac{1}{n}sum_{i=0}^{n-1} delta_{X_i}, ;; P(X_{n+1} in cdot mid X_0, ldots, X_n) = L^n A(cdot). We establish a large deviation principle for Ln. The rate function takes a strikingly different form than the Donsker-Varadhan rate function associated with the empirical measure of the Markov chain with transition kernel A and is described in terms of a novel deterministic infinite horizon discounted cost control problem with an associated linear controlled dynamics and a nonlinear running cost involving the relative entropy function. Proofs are based on an analysis of time-reversal of controlled dynamics in representations for log-transforms of exponential moments, and on weak convergence methods.


Full work available at URL: https://arxiv.org/abs/2205.09291




Recommendations




Cites Work


Cited In (1)





This page was built for publication: Empirical measure large deviations for reinforced chains on finite spaces

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2107618)