Optimal Subsampling for Large Sample Logistic Regression
From MaRDI portal
Publication:4962448
DOI10.1080/01621459.2017.1292914zbMath1398.62196arXiv1702.01166OpenAlexW3098603383WikidataQ90762625 ScholiaQ90762625MaRDI QIDQ4962448
Rong Zhu, Ping Ma, Hai Ying Wang
Publication date: 2 November 2018
Published in: Journal of the American Statistical Association (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1702.01166
Asymptotic properties of parametric estimators (62F12) Optimal statistical designs (62K05) Generalized linear models (logistic models) (62J12)
Related Items (66)
Optimal Distributed Subsampling for Maximum Quasi-Likelihood Estimators With Massive Data ⋮ Robust active learning with binary responses ⋮ Communication-efficient distributed estimator for generalized linear models with a diverging number of covariates ⋮ Distributed subdata selection for big data via sampling-based approach ⋮ Optimal subsample selection for massive logistic regression with distributed data ⋮ Score-matching representative approach for big data analysis with generalized linear models ⋮ Functional principal subspace sampling for large scale functional data analysis ⋮ A two-stage optimal subsampling estimation for missing data problems with large-scale data ⋮ Randomized Spectral Clustering in Large-Scale Stochastic Block Models ⋮ Inversion-free subsampling Newton's method for large sample logistic regression ⋮ Optimal Sampling for Generalized Linear Models Under Measurement Constraints ⋮ LowCon: A Design-based Subsampling Approach in a Misspecified Linear Model ⋮ Least-Square Approximation for a Distributed System ⋮ Logistic Regression Models for Aggregated Data ⋮ Unnamed Item ⋮ Statistical inference in massive datasets by empirical likelihood ⋮ Optimal subsampling for composite quantile regression model in massive data ⋮ Online updating of information based model selection in the big data setting ⋮ Surface temperature monitoring in liver procurement via functional variance change-point analysis ⋮ Model Checking in Large-Scale Dataset via Structure-Adaptive-Sampling ⋮ Optimal subsampling for large‐sample quantile regression with massive data ⋮ Fast Calibration for Computer Models with Massive Physical Observations ⋮ Information-based optimal subdata selection for big data logistic regression ⋮ Optimal subsampling for multiplicative regression with massive data ⋮ Online updating method to correct for measurement error in big data streams ⋮ Subsampling spectral clustering for stochastic block models in large-scale networks ⋮ Information-based optimal subdata selection for non-linear models ⋮ A model robust subsampling approach for generalised linear models in big data settings ⋮ Subsampling and Jackknifing: A Practically Convenient Solution for Large Data Analysis With Limited Computational Resources ⋮ Sketched approximation of regularized canonical correlation analysis ⋮ Optimal sampling algorithms for block matrix multiplication ⋮ Model constraints independent optimal subsampling probabilities for softmax regression ⋮ Optimal subsampling for softmax regression ⋮ Applications of robust methods in spatial analysis ⋮ Subdata selection based on orthogonal array for big data ⋮ Generalized linear models for massive data via doubly-sketching ⋮ Three-way sampling for rapid attribute reduction ⋮ Distributed smoothed rank regression with heterogeneous errors for massive data ⋮ A block-randomized stochastic method with importance sampling for CP tensor decomposition ⋮ Optimal subsampling algorithms for composite quantile regression in massive data ⋮ Optimal sampling designs for multidimensional streaming time series with application to power grid sensor data ⋮ Estimating promotion effects in email marketing using a large-scale cross-classified Bayesian joint model for nested imbalanced data ⋮ Optimal decorrelated score subsampling for generalized linear models with massive data ⋮ Communication-efficient surrogate quantile regression for non-randomly distributed system ⋮ Conditional characteristic feature screening for massive imbalanced data ⋮ Subsampling in longitudinal models ⋮ Frugal Gaussian clustering of huge imbalanced datasets through a bin-marginal approach ⋮ Unnamed Item ⋮ Analyzing Big EHR Data—Optimal Cox Regression Subsampling Procedure with Rare Events ⋮ Optimal subsampling for functional quantile regression ⋮ LIC criterion for optimal subset selection in distributed interval estimation ⋮ Testing multivariate quantile by empirical likelihood ⋮ Crawling subsampling for multivariate spatial autoregression model in large-scale networks ⋮ Randomized sketches for kernel CCA ⋮ Learning nonlocal constitutive models with neural networks ⋮ A quasi-Monte Carlo data compression algorithm for machine learning ⋮ Optimal subsampling for large-scale quantile regression ⋮ Multiplicative perturbation bounds for multivariate multiple linear regression in Schatten \(p\)-norms ⋮ Surprise sampling: improving and extending the local case-control sampling ⋮ Bayesian estimation under informative sampling with unattenuated dependence ⋮ Divide-and-conquer information-based optimal subdata selection algorithm ⋮ Parallel-and-stream accelerator for computationally fast supervised learning ⋮ Optimal subsampling for composite quantile regression in big data ⋮ Optimal subsampling for least absolute relative error estimators with massive data ⋮ Model-free global likelihood subsampling for massive data ⋮ Subdata selection algorithm for linear model discrimination
Uses Software
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- CUR matrix decompositions for improved data analysis
- Local case-control sampling: efficient subsampling in imbalanced data sets
- Faster least squares approximation
- Bootstrap methods: another look at the jackknife
- A fast randomized algorithm for overdetermined linear least-squares regression
- Low-Rank Approximation and Regression in Input Sparsity Time
- Sampling algorithms for l2 regression and applications
- Applied Logistic Regression
- Sub-Gaussian random variables
This page was built for publication: Optimal Subsampling for Large Sample Logistic Regression