Massive parallelization of serial inference algorithms for a complex generalized linear model
From MaRDI portal
Publication:4635216
Abstract: Following a series of high-profile drug safety disasters in recent years, many countries are redoubling their efforts to ensure the safety of licensed medical products. Large-scale observational databases such as claims databases or electronic health record systems are attracting particular attention in this regard, but present significant methodological and computational concerns. In this paper we show how high-performance statistical computation, including graphics processing units, relatively inexpensive highly parallel computing devices, can enable complex methods in large databases. We focus on optimization and massive parallelization of cyclic coordinate descent approaches to fit a conditioned generalized linear model involving tens of millions of observations and thousands of predictors in a Bayesian context. We find orders-of-magnitude improvement in overall run-time. Coordinate descent approaches are ubiquitous in high-dimensional statistics and the algorithms we propose open up exciting new methodological possibilities with the potential to significantly improve drug safety.
Recommendations
- Parallel statistical computing for statistical inference
- Applications of Parallel Computation to Statistical Inference
- Graphics processing units and high-dimensional optimization
- GPU-accelerated Gibbs sampling: a case study of the horseshoe probit model
- Parallelized integrated nested Laplace approximations for fast Bayesian inference
Cited in
(11)- Generalized linear models for massive data via doubly-sketching
- Massive Parallelization of Massive Sample-Size Survival Analysis
- A scalable surrogate L₀ sparse regression method for generalized linear models with applications to large scale data
- scientific article; zbMATH DE number 5124739 (Why is no real title available?)
- Parallel restricted maximum likelihood estimation for linear models with a dense exogenous matrix
- Parallel partial Gaussian process emulation for computer models with massive output
- A parallel solver for generalised additive models
- A surrogate \(\ell_0\) sparse Cox's regression with applications to sparse high-dimensional massive sample size time-to-event data
- Hierarchical models for multiple, rare outcomes using massive observational healthcare databases
- Scalable Algorithms for Large Competing Risks Data
- High-performance statistical computing in the computing environments of the 2020s
This page was built for publication: Massive parallelization of serial inference algorithms for a complex generalized linear model
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4635216)