Semi-parametric regression estimation of the tail index (Q1697475)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Semi-parametric regression estimation of the tail index |
scientific article |
Statements
Semi-parametric regression estimation of the tail index (English)
0 references
20 February 2018
0 references
Let \( X_{1},X_{2},\ldots,X_{n} \) be a random sample with a distribution function \(F\) satisfying \[ \bar{F}\left( x\right) =x^{-\alpha} L\left(x \right), \quad x\rightarrow \infty, \] where \( \bar{F}=1-F \) and \( L\left(x \right) \) is a slowly varying function, satisfying \( L\left(tx \right) / L\left(x \right) \rightarrow 1 \) as \( x\rightarrow \infty, \) for any \( t>0\). The parameter \( \alpha>0 \) is called the tail index or the extreme value index. Using an empirical version of the real part of the characteristic function \( U\left(t \right)= E\left[\cos\left( tX\right) \right], \) i.e. \( U_{n}\left(t \right)= n^{-1} \sum_{j=1}^{n} \cos\left( tX_{j}\right)\), and evaluating \( U_{n}\left( t\right) \) at points \( t_{j}=j/\sqrt{n}, \; j=1,2,\ldots,m,\, m=\left[ n^{\delta}\right], 0<\delta< 1/2, \) the authors consider the regression equation \[ \log\left[\left( 1-U_{n}\left(t_{j} \right) \right) \right] \sim \log\left[g\left( \alpha,t_{j}\right) \right] + \alpha\log t_{j} + \varepsilon_{j}, \quad 0<\alpha\leqslant 2, \] where \( \varepsilon_{j} =\log \frac{1-U_{n}\left(t_j \right) }{1-U\left(t_j \right)}\) and \( g\left(\alpha,t \right)= \pi/2 \left[ \Gamma\left( \alpha\right) \sin\left( \alpha\pi/2 \right) \right]^{-1} L\left(1/t \right) \) if \, \( 0< \alpha<2\) and \( g\left(\alpha,t \right)= \int_{0}^{1/t} x \left[F\left(-x \right) + \bar{F}\left(x \right) \right] \, dx\) if \( \alpha =2. \) By ordinary least squares, they obtain a simple estimator \( \hat{\alpha} \) for \( \alpha \). The bias reduced version of \( \hat{\alpha} \) is also developed. The authors define a procedure to obtain a reduced bias estimator of \( \alpha \) optimized, according to generalized crossvalidation or restricted maximum likelihood, with respect to the choice of \( m, \) i.e. \(\delta\). Theoretical properties of the proposed method are derived and simulations show the performance of this estimator in a wide range of cases. An application to data sets on city sizes, facing the debated issue of distinguishing Pareto-type tails from Log-normal tails, illustrate, show the proposed method works in practice.
0 references
tail index
0 references
heavy-tailed distributions
0 references
regular variation
0 references
empirical characteristic function
0 references
Zipf's law
0 references
0 references
0 references
0 references
0 references