On inference validity of weighted U-statistics under data heterogeneity (Q1786572)

From MaRDI portal

Jump to:navigation, search

scientific article

Language	Label	Description	Also known as
English	On inference validity of weighted U-statistics under data heterogeneity	scientific article

Statements

scholarly article

0 references

On inference validity of weighted U-statistics under data heterogeneity (English)

0 references

0 references

0 references

Electronic Journal of Statistics

0 references

publication date

24 September 2018

0 references

full work available at URL

https://arxiv.org/abs/1804.00034

0 references

https://projecteuclid.org/euclid.ejs/1535681029

0 references

Consider independent (but not necessarily identically distributed) data \(X_1,\ldots,X_n\), and the corresponding U-statistic \[ U_n=\frac{(n-m)!}{n!}\sum_{\underset{1\leq i_1,\ldots,i_m\leq n}{i_j\not=i_k,\text{ }j\not=k}}a_n(i_1,\ldots,i_m)h_n(X_{i_1},\ldots,X_{i_m})\,, \] where no symmetry assumptions are made on the weight function \(a_n\) and the kernel function \(h_n\). The elimination of any assumptions of symmetry and of the IID nature of the underlying data are the key features of this work. The main results of the present paper are a central limit theorem (giving sufficient conditions for the convergence of \(U_n\) to normality as \(n\rightarrow\infty\)), and sufficient conditions for consistent bootstrap variance estimation (including a bounded second moment condition, and control on the heterogeneity of the distributions of the \(X_i\)). These results are applied to the cases of Kendall's tau and the average-precision correlation, defined by \[ \tau_K=\frac{2}{n(n-1)}\sum_{i\not=j}[1(X_i>X_j)1(i < j)+1(X_j>X_i)1(j < i)]-1\,, \] and \[ \tau_{AP}=\frac{2}{n-1}\sum_{i=2}^n\frac{\sum_{j=1}^{i-1}1(X_j>X_i)}{i-1}-1\,, \] respectively, which share the same kernel function, \(1(y>x)\). The work is motivated by the analysis of \(\tau_{AP}\), which appears in an information retrieval setting. Here, the \(X_i\) correspond to the scores given by rankings of a certain webpage, ordered by corresponding human rankings. \(\tau_{AP}\) is a rank correlation measure, designed to evaluate the quality of a given ranking algorithm, where more weight is given to errors at high rankings. Numerical experiments illustrate the main results, and show, for example, that consistency of the bootstrap variance estimation is more sensitive to data heterogeneity than the central limit theorem, and that finite sample bootstrap performance for \(\tau_{AP}\) seems to be generally better than that for \(\tau_K\). The proofs of the main results are combinatorial in flavour.

0 references

0 references

zbMATH Keywords

weighted U-statistics

0 references

bootstrap

0 references

rank correlation

0 references

average-precision correlation

0 references

central limit theorem

0 references

consistency

0 references

MaRDI profile type

MaRDI publication profile

0 references

Some asymptotic theory for the bootstrap

0 references

0 references

Matched-block bootstrap for dependent data

0 references

Asymptotics of randomly weighted image- and image-statistics: application to bootstrap

0 references

Fitting time series models to nonstationary processes

0 references

Central limit theorem and the bootstrap for \(U\)-statistics of strongly mixing data

0 references

Bootstrap methods: another look at the jackknife

0 references

The moving blocks bootstrap and robust inference for linear least squares and quantile regressions

0 references

THE BOOTSTRAP OF THE MEAN FOR DEPENDENT HETEROGENEOUS ARRAYS

0 references

Convergence rates for U-statistics and related statistics

0 references

The bootstrap and Edgeworth expansion

0 references

A Class of Statistics with Asymptotically Normal Distribution

0 references

On weighted \(U\)-statistics for stationary processes.

0 references

A NEW MEASURE OF RANK CORRELATION

0 references

0 references

Bootstrapping Locally Stationary Processes

0 references

The jackknife and the bootstrap for general stationary observations

0 references

On the moving block bootstrap under long range dependence

0 references

0 references

0 references

Bootstrap procedures under some non-i.i.d. models

0 references

Using i.i.d. bootstrap inference for general non-i.i.d. models

0 references

Asymptotic distributions for weighted \(U\)-statistics

0 references

When does bootstrap work! Asymptotic results and simulations

0 references

Asymptotic distributions of weighted \(U\)-statistics of degree 2

0 references

Tapered block bootstrap

0 references

Local block bootstrap

0 references

0 references

Large sample confidence regions based on subsamples under minimal assumptions

0 references

0 references

Heavy-Tail Phenomena

0 references

On the asymptotic behavior of weighted \(U\)-statistics

0 references

Estimates of the Regression Coefficient Based on Kendall's Tau

0 references

Approximation Theorems of Mathematical Statistics

0 references

The Dependent Wild Bootstrap

0 references

Asymptotic normality of permutation statistics derived from weighted sums of bivariate functions

0 references

0 references

Limiting behavior of U-statistics for stationary, absolutely regular processes

0 references

Inference of weighted \(V\)-statistics for nonstationary time series and its applications

0 references

Identifiers

zbMATH Open document ID

0 references

10.1214/18-EJS1462

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

zbMATH DE Number

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1786572

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q1786572&oldid=37841355"