Nonparametric Estimation of Repeated Densities with Heterogeneous Sample Sizes
From MaRDI portal
Publication:6153983
Abstract: We consider the estimation of densities in multiple subpopulations, where the available sample size in each subpopulation greatly varies. This problem occurs in epidemiology, for example, where different diseases may share similar pathogenic mechanism but differ in their prevalence. Without specifying a parametric form, our proposed method pools information from the population and estimate the density in each subpopulation in a data-driven fashion. Drawing from functional data analysis, low-dimensional approximating density families in the form of exponential families are constructed from the principal modes of variation in the log-densities. Subpopulation densities are subsequently fitted in the approximating families based on likelihood principles and shrinkage. The approximating families increase in their flexibility as the number of components increases and can approximate arbitrary infinite-dimensional densities. We also derive convergence results of the density estimates with discrete observations. The proposed methods are shown to be interpretable and efficient in simulation as well as applications to electronic medical record and rainfall data.
Cites work
- scientific article; zbMATH DE number 469135 (Why is no real title available?)
- scientific article; zbMATH DE number 1560711 (Why is no real title available?)
- A study of logspline density estimation
- Additive functional regression for densities as responses
- Amplitude and phase variation of point processes
- Bayes Hilbert spaces
- Bayes spaces: use of improper distributions and exponential families
- Dimensionality reduction when data are density functions
- EM algorithms for multivariate Gaussian mixture models with truncated and censored data
- Functional Data Analysis for Sparse Longitudinal Data
- Functional data analysis for density functions by transformation to a Hilbert space
- Functional data analysis for point processes with rare events
- Functional principal component analysis of density families with categorical and continuous data on Canadian entrant manufacturing firms
- High-dimensional statistics. A non-asymptotic viewpoint
- Hilbert space of probability density functions based on Aitchison geometry
- Inference for Density Families Using Functional Principal Component Analysis
- Information geometry and its applications
- Local likelihood density estimation
- On the evolution of the united kingdom price distributions
- Semiparametric exponential families for heavy-tailed data
- Simplicial principal component analysis for density functions in Bayes spaces
This page was built for publication: Nonparametric Estimation of Repeated Densities with Heterogeneous Sample Sizes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6153983)