Optimal Subsampling for Large Sample Logistic Regression

From MaRDI portal

Publication:4962448

Jump to:navigation, search

DOI10.1080/01621459.2017.1292914zbMath1398.62196arXiv1702.01166OpenAlexW3098603383WikidataQ90762625 ScholiaQ90762625MaRDI QIDQ4962448

Rong Zhu, Ping Ma, Hai Ying Wang

Publication date: 2 November 2018

Published in: Journal of the American Statistical Association (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1702.01166

zbMATH Keywords

logistic regression massive data rare event \(A\)-optimality optimal subsampling

Mathematics Subject Classification ID

Asymptotic properties of parametric estimators (62F12) Optimal statistical designs (62K05) Generalized linear models (logistic models) (62J12)

Related Items

Optimal Distributed Subsampling for Maximum Quasi-Likelihood Estimators With Massive Data, Robust active learning with binary responses, Communication-efficient distributed estimator for generalized linear models with a diverging number of covariates, Distributed subdata selection for big data via sampling-based approach, Optimal subsample selection for massive logistic regression with distributed data, Score-matching representative approach for big data analysis with generalized linear models, Functional principal subspace sampling for large scale functional data analysis, A two-stage optimal subsampling estimation for missing data problems with large-scale data, Randomized Spectral Clustering in Large-Scale Stochastic Block Models, Inversion-free subsampling Newton's method for large sample logistic regression, Optimal Sampling for Generalized Linear Models Under Measurement Constraints, LowCon: A Design-based Subsampling Approach in a Misspecified Linear Model, Least-Square Approximation for a Distributed System, Logistic Regression Models for Aggregated Data, Unnamed Item, Statistical inference in massive datasets by empirical likelihood, Optimal subsampling for composite quantile regression model in massive data, Online updating of information based model selection in the big data setting, Surface temperature monitoring in liver procurement via functional variance change-point analysis, Model Checking in Large-Scale Dataset via Structure-Adaptive-Sampling, Optimal subsampling for large‐sample quantile regression with massive data, Fast Calibration for Computer Models with Massive Physical Observations, Information-based optimal subdata selection for big data logistic regression, Optimal subsampling for multiplicative regression with massive data, Online updating method to correct for measurement error in big data streams, Subsampling spectral clustering for stochastic block models in large-scale networks, Information-based optimal subdata selection for non-linear models, A model robust subsampling approach for generalised linear models in big data settings, Subsampling and Jackknifing: A Practically Convenient Solution for Large Data Analysis With Limited Computational Resources, Sketched approximation of regularized canonical correlation analysis, Optimal sampling algorithms for block matrix multiplication, Model constraints independent optimal subsampling probabilities for softmax regression, Optimal subsampling for softmax regression, Applications of robust methods in spatial analysis, Subdata selection based on orthogonal array for big data, Generalized linear models for massive data via doubly-sketching, Three-way sampling for rapid attribute reduction, Distributed smoothed rank regression with heterogeneous errors for massive data, A block-randomized stochastic method with importance sampling for CP tensor decomposition, Optimal subsampling algorithms for composite quantile regression in massive data, Optimal sampling designs for multidimensional streaming time series with application to power grid sensor data, Estimating promotion effects in email marketing using a large-scale cross-classified Bayesian joint model for nested imbalanced data, Optimal decorrelated score subsampling for generalized linear models with massive data, Communication-efficient surrogate quantile regression for non-randomly distributed system, Conditional characteristic feature screening for massive imbalanced data, Subsampling in longitudinal models, Frugal Gaussian clustering of huge imbalanced datasets through a bin-marginal approach, Unnamed Item, Analyzing Big EHR Data—Optimal Cox Regression Subsampling Procedure with Rare Events, Optimal subsampling for functional quantile regression, LIC criterion for optimal subset selection in distributed interval estimation, Testing multivariate quantile by empirical likelihood, Crawling subsampling for multivariate spatial autoregression model in large-scale networks, Randomized sketches for kernel CCA, Learning nonlocal constitutive models with neural networks, A quasi-Monte Carlo data compression algorithm for machine learning, Optimal subsampling for large-scale quantile regression, Multiplicative perturbation bounds for multivariate multiple linear regression in Schatten \(p\)-norms, Surprise sampling: improving and extending the local case-control sampling, Bayesian estimation under informative sampling with unattenuated dependence, Divide-and-conquer information-based optimal subdata selection algorithm, Parallel-and-stream accelerator for computationally fast supervised learning, Optimal subsampling for composite quantile regression in big data, Optimal subsampling for least absolute relative error estimators with massive data, Model-free global likelihood subsampling for massive data, Subdata selection algorithm for linear model discrimination

Uses Software

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4962448&oldid=19391485"