A review on design inspired subsampling for big data
From MaRDI portal
Publication:6549149
DOI10.1007/S00362-022-01386-WzbMATH Open1539.62351MaRDI QIDQ6549149FDOQ6549149
Authors: Jun Yu, Mingyao Ai, Zhiqiang Ye
Publication date: 3 June 2024
Published in: Statistical Papers (Search for Journal in Brave)
Recommendations
Statistical aspects of big data and data science (62R07) Sampling theory, sample surveys (62D05) Optimal statistical designs (62K05)
Cites Work
- Are more data always better for factor analysis?
- Title not available (Why is that?)
- Bagging predictors
- The design and analysis of computer experiments.
- The Role of Sampling Weights When Modeling Survey Data
- Title not available (Why is that?)
- Title not available (Why is that?)
- General equivalence theory for optimum designs (approximate theory)
- Local polynomial regresssion estimators in survey sampling.
- Optimum experimental designs, with SAS
- Weighting for Unequal Selection Probabilities in Multilevel Models
- Asymptotic Theory of Rejective Sampling with Varying Probabilities from a Finite Population
- Principal points
- Note on Grouping
- Title not available (Why is that?)
- Model Selection and Multimodel Inference
- Monte Carlo and quasi-Monte Carlo sampling
- Energy statistics: a class of statistics based on distances
- Experiments. Planning, analysis and optimization.
- Speeding Up MCMC by Efficient Data Subsampling
- A statistical perspective on algorithmic leveraging
- Optimal Design of Experiments
- Orthogonal arrays. Theory and applications
- Auction algorithms for network flow problems: A tutorial introduction
- Uniform designs limit aliasing
- Penalized likelihood regression: General formulation and efficient approximation
- On Design Orthogonality, Maximin Distance, and Projection Uniformity for Computer Experiments
- Revisiting the Nyström method for improved large-scale machine learning
- Title not available (Why is that?)
- Orthogonal Column Latin Hypercubes and Their Application in Computer Experiments
- Smoothing spline ANOVA models
- Efficient computation of smoothing splines via adaptive basis sampling
- Title not available (Why is that?)
- Blendenpik: Supercharging LAPACK's Least-Squares Solver
- Fast Monte Carlo Algorithms for Matrices I: Approximating Matrix Multiplication
- A note on generalized aberration in factorial designs
- Some properties of incomplete U-statistics
- Local case-control sampling: efficient subsampling in imbalanced data sets
- On the sequential construction of optimum bounded designs
- Variable Selection for Gaussian Process Models using Experimental Design-Based Subagging
- Monge-Kantorovich depth, quantiles, ranks and signs
- Some results on the convergence of conditional distributions
- Title not available (Why is that?)
- Information-Based Optimal Subdata Selection for Big Data Linear Regression
- Optimal subsampling for large sample logistic regression
- More efficient estimation for logistic regression with optimal subsamples
- Empirical likelihood confidence intervals for complex sampling designs
- Distributed subdata selection for big data via sampling-based approach
- LSRN: A parallel iterative solver for strongly over- or underdetermined systems
- On greedy heuristics for computing D-efficient saturated subsets
- Title not available (Why is that?)
- Admissibility and minimaxity of the uniform design measure in nonparametric regression model
- Extensible Grids: Uniform Sampling on a Space Filling Curve
- Representative points for location-biased datasets
- A general theory for orthogonal array based Latin hypercube sampling
- On the connection between maximin distance designs and orthogonal designs
- Optimal Distributed Subsampling for Maximum Quasi-Likelihood Estimators With Massive Data
- Support points
- Optimal subsampling for large-scale quantile regression
- Local uncertainty sampling for large-scale multiclass logistic regression
- Information-based optimal subdata selection for big data logistic regression
- On computationally tractable selection of experiments in measurement-constrained regression models
- Model-robust subdata selection for big data
- Reverse iterative volume sampling for linear regression
- LowCon: A Design-based Subsampling Approach in a Misspecified Linear Model
- Optimal subsampling for softmax regression
- Optimal subsampling algorithms for big data regressions
- Optimal subsampling for quantile regression in big data
- Optimal subsampling for linear quantile regression models
- Subdata selection algorithm for linear model discrimination
- Optimal Sampling for Generalized Linear Models Under Measurement Constraints
- More efficient approximation of smoothing splines via space-filling basis selection
- Subdata selection based on orthogonal array for big data
- Most likely optimal subsampled Markov chain Monte Carlo
- Smoothing Splines Approximation Using Hilbert Curve Basis Selection
- Optimal subsampling for large‐sample quantile regression with massive data
- FM-criterion for representative points
- Large-Scale Datastreams Surveillance via Pattern-Oriented-Sampling
- An Optimal Transport Approach for Selecting a Representative Subsample with Application in Efficient Kernel Density Estimation
- Subsampling and Jackknifing: A Practically Convenient Solution for Large Data Analysis With Limited Computational Resources
- Sampling-based estimation for massive survival data with additive hazards model
- Feature Screening for Massive Data Analysis by Subsampling
- Deterministic Sampling of Expensive Posteriors Using Minimum Energy Designs
Cited In (3)
This page was built for publication: A review on design inspired subsampling for big data
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6549149)