On the empirical multilinear copula process for count data (Q396007): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
ReferenceBot (talk | contribs)
Changed an Item
 
(5 intermediate revisions by 4 users not shown)
Property / author
 
Property / author: Christian Genest / rank
Normal rank
 
Property / author
 
Property / author: Bruno Rémillard / rank
Normal rank
 
Property / author
 
Property / author: Christian Genest / rank
 
Normal rank
Property / author
 
Property / author: Bruno Rémillard / rank
 
Normal rank
Property / review text
 
The aim of the paper under review is to study the asymptotic behavior of the empirical process associated with the multilinear copula based on \(d\)-variate count data. The main result is sufficient to deduce the limiting distribution of classical statistics for monotone trend such as Spearman's rho and Kendall's tau, and performs well for sparse contingency tables whose dimensions are sample size dependent. The first two authors studied this earlier to show some important dependence properties for the 2-dimensional case. In Section 2, the authors define the multilinear extension copula \(C^*\) of the joint cumulative distribution function \(H\) for a vector \(X=(X_1, \dots, X_d)\) of discrete random variables with density with respect to the Lebesgue measure given by \[ c^*(u_1, \dots, u_d) = \frac{Pr[X_1=A_1(k_1), \dots, X_d=A_d(k_d)]}{Pr[X_1=A_1(k_1)]\cdots Pr[X_d=A_d(k_d)]}, \] for all \(F_j[A_j(k_j-1)]< u_j \leq F_j[A_j(k_j)]\) for some \(k_j\in \mathbb{N}\), where \(F_j\) is the distribution function of \(X_j\) (\(1\leq j \leq d\)). The explicit form of \(C^*\) is given in Proposition 2.1 and is proved in Appendix A, and the multilinear extension copula satisfies Sklar's representation, and is invariant with respect to strictly increasing transformations of the margins. Consider a random sample \({\chi}=\{(X_{11}, \dots, X_{1d}), \dots, (X_{n1}, \dots, X_{nd})\}\) from \(H\) and let \(H_n\) be the corresponding empirical distribution function. One can define \[ c_n^*(u_1, \dots, u_d) = \frac{h_n[A_{n1}(k_1), \dots, A_{nd}(k_d)]}{\Delta F_{n1}[A_{n1}(k_1)]\cdots \Delta F_{nd}[A_{nd}(k_d)]}, \] \[ C_n^*(u_1, \dots, u_d) = \sum_{S\subset\{1, \dots, d\}}\lambda_{H_n, S}(u_1, \dots, u_d)H_n\{ F_{n1}^{-1}(u_{S_1}), \dots, F_{nd}^{-1}(u_{S_d})\}. \] The \(C_n^*(u_1, \dots, u_d)\) is asymptotically equivalent to other versions of the empirical copula commonly used in the literature. Section 3 gives the main result on the asymptotic behavior of \(C_n^*\) as \(n\to \infty\). Theorem 3.1 shows that \(C_n^*\) converges over any compact subset \(K\). Section 4 and Appendix B are devoted to the proof of the main result Theorem 3.1. The first step consists of considering the case where the margins of \(H\) are known. Proposition 4.1 shows that the empirical process \(C_n^*\) converges, and the process \(\hat{C}_n^*\) in which margins are unknown can be decomposed into \(\tilde{C}_n^* + \tilde{D}_n\). Therefore, the next step shows that \(\tilde{C}_n^* - C_n^*\) converges to zero and \(\tilde{D}_n - D_n\) converges to zero, and \(\hat{C}_n^*\) is a uniformly consistent estimator of \(C^*\). Section 5 lists applications of the asymptotic behavior and usefulness of this result. Kendall's tau and Spearman's rho are two classical measures of monotone trend for two-way cross-classifications of ordinal or interval data. Proposition 5.1 shows the weak convergence of tau for \(d=2\), Proposition 5.2 shows the weak convergence of rho for \(d\geq 2\), and Proposition 5.3 shows that a test based on the Cramer-von Mises statistic \(S_n\) is consistent against any alternative. However, the limiting null distribution of \(S_n\) depends upon the margins of \(H\) which are generally unknown. For \(d=2\), Algorithm 5.1 presents how to carry out the test with restoring to resampling techniques as multiplier bootstrap. The paper ends with the conclusion in Section 6. It is nice that all proofs are given in the appendixes.
Property / review text: The aim of the paper under review is to study the asymptotic behavior of the empirical process associated with the multilinear copula based on \(d\)-variate count data. The main result is sufficient to deduce the limiting distribution of classical statistics for monotone trend such as Spearman's rho and Kendall's tau, and performs well for sparse contingency tables whose dimensions are sample size dependent. The first two authors studied this earlier to show some important dependence properties for the 2-dimensional case. In Section 2, the authors define the multilinear extension copula \(C^*\) of the joint cumulative distribution function \(H\) for a vector \(X=(X_1, \dots, X_d)\) of discrete random variables with density with respect to the Lebesgue measure given by \[ c^*(u_1, \dots, u_d) = \frac{Pr[X_1=A_1(k_1), \dots, X_d=A_d(k_d)]}{Pr[X_1=A_1(k_1)]\cdots Pr[X_d=A_d(k_d)]}, \] for all \(F_j[A_j(k_j-1)]< u_j \leq F_j[A_j(k_j)]\) for some \(k_j\in \mathbb{N}\), where \(F_j\) is the distribution function of \(X_j\) (\(1\leq j \leq d\)). The explicit form of \(C^*\) is given in Proposition 2.1 and is proved in Appendix A, and the multilinear extension copula satisfies Sklar's representation, and is invariant with respect to strictly increasing transformations of the margins. Consider a random sample \({\chi}=\{(X_{11}, \dots, X_{1d}), \dots, (X_{n1}, \dots, X_{nd})\}\) from \(H\) and let \(H_n\) be the corresponding empirical distribution function. One can define \[ c_n^*(u_1, \dots, u_d) = \frac{h_n[A_{n1}(k_1), \dots, A_{nd}(k_d)]}{\Delta F_{n1}[A_{n1}(k_1)]\cdots \Delta F_{nd}[A_{nd}(k_d)]}, \] \[ C_n^*(u_1, \dots, u_d) = \sum_{S\subset\{1, \dots, d\}}\lambda_{H_n, S}(u_1, \dots, u_d)H_n\{ F_{n1}^{-1}(u_{S_1}), \dots, F_{nd}^{-1}(u_{S_d})\}. \] The \(C_n^*(u_1, \dots, u_d)\) is asymptotically equivalent to other versions of the empirical copula commonly used in the literature. Section 3 gives the main result on the asymptotic behavior of \(C_n^*\) as \(n\to \infty\). Theorem 3.1 shows that \(C_n^*\) converges over any compact subset \(K\). Section 4 and Appendix B are devoted to the proof of the main result Theorem 3.1. The first step consists of considering the case where the margins of \(H\) are known. Proposition 4.1 shows that the empirical process \(C_n^*\) converges, and the process \(\hat{C}_n^*\) in which margins are unknown can be decomposed into \(\tilde{C}_n^* + \tilde{D}_n\). Therefore, the next step shows that \(\tilde{C}_n^* - C_n^*\) converges to zero and \(\tilde{D}_n - D_n\) converges to zero, and \(\hat{C}_n^*\) is a uniformly consistent estimator of \(C^*\). Section 5 lists applications of the asymptotic behavior and usefulness of this result. Kendall's tau and Spearman's rho are two classical measures of monotone trend for two-way cross-classifications of ordinal or interval data. Proposition 5.1 shows the weak convergence of tau for \(d=2\), Proposition 5.2 shows the weak convergence of rho for \(d\geq 2\), and Proposition 5.3 shows that a test based on the Cramer-von Mises statistic \(S_n\) is consistent against any alternative. However, the limiting null distribution of \(S_n\) depends upon the margins of \(H\) which are generally unknown. For \(d=2\), Algorithm 5.1 presents how to carry out the test with restoring to resampling techniques as multiplier bootstrap. The paper ends with the conclusion in Section 6. It is nice that all proofs are given in the appendixes. / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 62H20 / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 62G10 / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 62G20 / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 62H17 / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 60F05 / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 60F17 / rank
 
Normal rank
Property / zbMATH DE Number
 
Property / zbMATH DE Number: 6327911 / rank
 
Normal rank
Property / zbMATH Keywords
 
checkerboard copula
Property / zbMATH Keywords: checkerboard copula / rank
 
Normal rank
Property / zbMATH Keywords
 
contingency table
Property / zbMATH Keywords: contingency table / rank
 
Normal rank
Property / zbMATH Keywords
 
count data
Property / zbMATH Keywords: count data / rank
 
Normal rank
Property / zbMATH Keywords
 
empirical process
Property / zbMATH Keywords: empirical process / rank
 
Normal rank
Property / zbMATH Keywords
 
Kendall's tau
Property / zbMATH Keywords: Kendall's tau / rank
 
Normal rank
Property / zbMATH Keywords
 
mid-ranks
Property / zbMATH Keywords: mid-ranks / rank
 
Normal rank
Property / zbMATH Keywords
 
multilinear extension copula
Property / zbMATH Keywords: multilinear extension copula / rank
 
Normal rank
Property / zbMATH Keywords
 
Spearman's rho
Property / zbMATH Keywords: Spearman's rho / rank
 
Normal rank
Property / zbMATH Keywords
 
test of independence
Property / zbMATH Keywords: test of independence / rank
 
Normal rank
Property / zbMATH Keywords
 
Spearsman's rho
Property / zbMATH Keywords: Spearsman's rho / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / arXiv ID
 
Property / arXiv ID: 1407.1200 / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Introduction to Categorical Data Analysis / rank
 
Normal rank
Property / cites work
 
Property / cites work: Time series: theory and methods. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3855979 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3903872 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Real Analysis and Probability / rank
 
Normal rank
Property / cites work
 
Property / cites work: Weak convergence of empirical copula processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Primer on Copulas for Count Data / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discussion: Statistical models and methods for dependence in insurance data / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the estimation of Spearman's rho and related tests of independence for possibly discontinuous multivariate data / rank
 
Normal rank
Property / cites work
 
Property / cites work: Tests of independence and randomness based on the empirical copula process / rank
 
Normal rank
Property / cites work
 
Property / cites work: An introduction to copulas. / rank
 
Normal rank
Property / cites work
 
Property / cites work: On rank correlation measures for non-continuous random variables / rank
 
Normal rank
Property / cites work
 
Property / cites work: Tests of Multivariate Independence for Ordinal Data / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotic distributions of multivariate rank order statistics / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multivariate extensions of Spearman's rho and related statistics / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotics of empirical copula processes under non-restrictive smoothness assumptions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Weak convergence and empirical processes. With applications to statistics / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 21:39, 8 July 2024

scientific article
Language Label Description Also known as
English
On the empirical multilinear copula process for count data
scientific article

    Statements

    On the empirical multilinear copula process for count data (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    8 August 2014
    0 references
    The aim of the paper under review is to study the asymptotic behavior of the empirical process associated with the multilinear copula based on \(d\)-variate count data. The main result is sufficient to deduce the limiting distribution of classical statistics for monotone trend such as Spearman's rho and Kendall's tau, and performs well for sparse contingency tables whose dimensions are sample size dependent. The first two authors studied this earlier to show some important dependence properties for the 2-dimensional case. In Section 2, the authors define the multilinear extension copula \(C^*\) of the joint cumulative distribution function \(H\) for a vector \(X=(X_1, \dots, X_d)\) of discrete random variables with density with respect to the Lebesgue measure given by \[ c^*(u_1, \dots, u_d) = \frac{Pr[X_1=A_1(k_1), \dots, X_d=A_d(k_d)]}{Pr[X_1=A_1(k_1)]\cdots Pr[X_d=A_d(k_d)]}, \] for all \(F_j[A_j(k_j-1)]< u_j \leq F_j[A_j(k_j)]\) for some \(k_j\in \mathbb{N}\), where \(F_j\) is the distribution function of \(X_j\) (\(1\leq j \leq d\)). The explicit form of \(C^*\) is given in Proposition 2.1 and is proved in Appendix A, and the multilinear extension copula satisfies Sklar's representation, and is invariant with respect to strictly increasing transformations of the margins. Consider a random sample \({\chi}=\{(X_{11}, \dots, X_{1d}), \dots, (X_{n1}, \dots, X_{nd})\}\) from \(H\) and let \(H_n\) be the corresponding empirical distribution function. One can define \[ c_n^*(u_1, \dots, u_d) = \frac{h_n[A_{n1}(k_1), \dots, A_{nd}(k_d)]}{\Delta F_{n1}[A_{n1}(k_1)]\cdots \Delta F_{nd}[A_{nd}(k_d)]}, \] \[ C_n^*(u_1, \dots, u_d) = \sum_{S\subset\{1, \dots, d\}}\lambda_{H_n, S}(u_1, \dots, u_d)H_n\{ F_{n1}^{-1}(u_{S_1}), \dots, F_{nd}^{-1}(u_{S_d})\}. \] The \(C_n^*(u_1, \dots, u_d)\) is asymptotically equivalent to other versions of the empirical copula commonly used in the literature. Section 3 gives the main result on the asymptotic behavior of \(C_n^*\) as \(n\to \infty\). Theorem 3.1 shows that \(C_n^*\) converges over any compact subset \(K\). Section 4 and Appendix B are devoted to the proof of the main result Theorem 3.1. The first step consists of considering the case where the margins of \(H\) are known. Proposition 4.1 shows that the empirical process \(C_n^*\) converges, and the process \(\hat{C}_n^*\) in which margins are unknown can be decomposed into \(\tilde{C}_n^* + \tilde{D}_n\). Therefore, the next step shows that \(\tilde{C}_n^* - C_n^*\) converges to zero and \(\tilde{D}_n - D_n\) converges to zero, and \(\hat{C}_n^*\) is a uniformly consistent estimator of \(C^*\). Section 5 lists applications of the asymptotic behavior and usefulness of this result. Kendall's tau and Spearman's rho are two classical measures of monotone trend for two-way cross-classifications of ordinal or interval data. Proposition 5.1 shows the weak convergence of tau for \(d=2\), Proposition 5.2 shows the weak convergence of rho for \(d\geq 2\), and Proposition 5.3 shows that a test based on the Cramer-von Mises statistic \(S_n\) is consistent against any alternative. However, the limiting null distribution of \(S_n\) depends upon the margins of \(H\) which are generally unknown. For \(d=2\), Algorithm 5.1 presents how to carry out the test with restoring to resampling techniques as multiplier bootstrap. The paper ends with the conclusion in Section 6. It is nice that all proofs are given in the appendixes.
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    checkerboard copula
    0 references
    contingency table
    0 references
    count data
    0 references
    empirical process
    0 references
    Kendall's tau
    0 references
    mid-ranks
    0 references
    multilinear extension copula
    0 references
    Spearman's rho
    0 references
    test of independence
    0 references
    Spearsman's rho
    0 references
    0 references