Estimation of heavy-tailed probability density function with applications to Web data (Q1775986)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Estimation of heavy-tailed probability density function with applications to Web data |
scientific article |
Statements
Estimation of heavy-tailed probability density function with applications to Web data (English)
0 references
20 May 2005
0 references
The authors discuss an approach to estimating probability density functions (PDF) using the so-called ``transform-retransform scheme'' which includes transformation of the data before application of specific estimation methods based on appropriate parametric or nonparametric approximation of the cumulative distribution functions (CDF) and estimation of the PDF of the original data applying the reverse transformation of the PDF estimate of the transformed data. In this context the authors focus on the following questions: what family of distributions is a reasonable approximation of the true CDF; which nonparametric estimate better maintains the tail decay rate of the true PDF. An application of the proposed PDF estimators to a classification problem is described, and their performance in cases of different heavy-tailed distributions is investigated by a simulation study. In particular, the PDF estimators are compared by their capacity to solve a classification problem. Simulation results and an application to Web data analysis are presented and discussed.
0 references
nonparametric density estimation
0 references
heavy-tailed density
0 references
kernel estimation
0 references
polygram
0 references
extreme value index
0 references
risk of misclassification
0 references
0 references