Optimal representative sample weighting
From MaRDI portal
Publication:2058720
DOI10.1007/S11222-021-10001-1zbMATH Open1475.62017arXiv2005.09065OpenAlexW3133615935MaRDI QIDQ2058720FDOQ2058720
Authors: Yanyan Li
Publication date: 9 December 2021
Published in: Statistics and Computing (Search for Journal in Brave)
Abstract: We consider the problem of assigning weights to a set of samples or data records, with the goal of achieving a representative weighting, which happens when certain sample averages of the data are close to prescribed values. We frame the problem of finding representative sample weights as an optimization problem, which in many cases is convex and can be efficiently solved. Our formulation includes as a special case the selection of a fixed number of the samples, with equal weights, i.e., the problem of selecting a smaller representative subset of the samples. While this problem is combinatorial and not convex, heuristic methods based on convex optimization seem to perform very well. We describe rsw, an open-source implementation of the ideas described in this paper, and apply it to a skewed sample of the CDC BRFSS dataset.
Full work available at URL: https://arxiv.org/abs/2005.09065
Recommendations
Computational methods for problems pertaining to statistics (62-08) Sampling theory, sample surveys (62D05)
Cites Work
- CVXPY: a Python-embedded modeling language for convex optimization
- Parameter selection and preconditioning for a graph form solver
- OSQP: an operator splitting solver for quadratic programs
- Title not available (Why is that?)
- Discrete Multivariate Analysis Theory and Practice
- Convergence of a block coordinate descent method for nondifferentiable minimization
- Distributed optimization and statistical learning via the alternating direction method of multipliers
- Conic optimization via operator splitting and homogeneous self-dual embedding
- Symmetric Quasidefinite Matrices
- On Information and Sufficiency
- A Generalization of Sampling Without Replacement From a Finite Universe
- Computational optimal transport. With applications to data sciences
- Generalized Raking Procedures in Survey Sampling
- Title not available (Why is that?)
- Reducibility among combinatorial problems
- Graph implementations for nonsmooth convex programs
- Atomic decomposition by basis pursuit
- Block splitting for distributed optimization
- Practical tools for designing and weighting survey samples
- A general system for heuristic minimization of convex functions over non-convex sets
- Iterative Proportional Scaling Revisited: A Modern Optimization Perspective
Cited In (2)
Uses Software
This page was built for publication: Optimal representative sample weighting
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2058720)