A case study competition among methods for analyzing large spatial data
From MaRDI portal
Publication:2272997
Abstract: The Gaussian process is an indispensable tool for spatial data analysts. The onset of the "big data" era, however, has lead to the traditional Gaussian process being computationally infeasible for modern spatial data. As such, various alternatives to the full Gaussian process that are more amenable to handling big spatial data have been proposed. These modern methods often exploit low rank structures and/or multi-core and multi-threaded computing environments to facilitate computation. This study provides, first, an introductory overview of several methods for analyzing large spatial data. Second, this study describes the results of a predictive competition among the described methods as implemented by different groups with strong expertise in the methodology. Specifically, each research group was provided with two training datasets (one simulated and one observed) along with a set of prediction locations. Each group then wrote their own implementation of their method to produce predictions at the given location and each which was subsequently run on a common computing environment. The methods were then compared in terms of various predictive diagnostics. Supplementary materials regarding implementation details of the methods and code are available for this article online.
Recommendations
- Smoothed full-scale approximation of Gaussian process models for computation of large spatial data sets
- Modified linear projection for large spatial datasets
- Efficient Gaussian process regression for large datasets
- A Full Scale Approximation of Covariance Functions for Large Spatial Data Sets
- Kryging: geostatistical analysis of large-scale datasets using Krylov subspace methods
Cites work
- scientific article; zbMATH DE number 5849508 (Why is no real title available?)
- scientific article; zbMATH DE number 823069 (Why is no real title available?)
- A Full Scale Approximation of Covariance Functions for Large Spatial Data Sets
- A Resampling-Based Stochastic Approximation Method for Analysis of Large Geostatistical Data
- A class of multi-resolution approximations for large spatial datasets
- A comparison of spatial predictors when datasets could be very large
- An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach
- Analyzing Nonstationary Spatial Data Using Piecewise Gaussian Processes
- Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations (with discussion)
- Approximate Likelihood for Large Irregularly Spaced Spatial Data
- Approximating Likelihoods for Large Spatial Data Sets
- Asymptotic properties of multivariate tapering for estimation and prediction
- Bayesian Detection of Clusters and Discontinuities in Disease Maps
- Bayesian Inference for the Spatial Random Effects Model
- Conditionally linear models for non-homogeneous spatial random fields
- Covariance approximation for large multivariate spatial data sets with an application to multiple climate model errors
- Covariance tapering for likelihood-based estimation in large spatial data sets
- Covariance tapering for prediction of large spatial data sets in transformed random fields
- Edge effects and efficient parameter estimation for stationary random fields
- Efficient Algorithms for Bayesian Nearest Neighbor Gaussian Processes
- Estimation and prediction using generalized Wendland covariance functions under fixed domain asymptotics
- Fixed Rank Kriging for Very Large Spatial Data Sets
- Fixed-domain asymptotic properties of tapered maximum likelihood estimators
- Gaussian Predictive Process Models for Large Spatial Data Sets
- Geometric median and robust estimation in Banach spaces
- Hierarchical modeling and analysis for spatial data
- Improving the performance of predictive process modeling for large datasets
- Interpolation of spatial data. Some theory for kriging
- Likelihood approximation with hierarchical matrices for large spatial datasets
- Massively parallel approximate Gaussian process regression
- Multi-Resolution Filters for Massive Spatio-Temporal Data
- Nonseparable dynamic nearest neighbor Gaussian process models for large spatio-temporal data with an application to particulate matter analysis
- ON STATIONARY PROCESSES IN THE PLANE
- On fixed-domain asymptotics and covariance tapering in Gaussian random field models
- Parallel inference for massive distributed spatial data using low-rank models
- Parameter estimation for a stationary process on a d-dimensional lattice
- Robust and scalable Bayes via a median of subset posterior measures
- Space and space-time modeling using process convolutions
- Spatial factor models for high-dimensional and large spatial data: an application in forest variable mapping
- Spatio-temporal smoothing and EM estimation for massive remote-sensing data sets
- Spectral density estimation for random fields via periodic embeddings
- Statistical Methods for Spatial Data Analysis
- Statistical analysis of small-area data based on independence, spatial, non-hierarchical, and hierarchical models
- Statistics for spatial data
- Statistics for spatio-temporal data
- Stochastic approximation of score functions for Gaussian processes
- Strictly Proper Scoring Rules, Prediction, and Estimation
Cited in
(only showing first 100 items - show all)- Estimating high-resolution red sea surface temperature hotspots, using a low-rank semiparametric spatial model
- VPint: value propagation-based spatial interpolation
- Finite element representations of Gaussian processes: balancing numerical and statistical accuracy
- Unifying compactly supported and Matérn covariance functions in spatial statistics
- New formulation of the logistic-Gaussian process to analyze trajectory tracking data
- Bayesian multiresolution modeling of georeferenced data: an extension of `LatticeKrig'
- Airflow recovery from thoracic and abdominal movements using synchrosqueezing transform and locally stationary Gaussian process regression
- Spatial regression with non-parametric modeling of Fourier coefficients
- The Rational SPDE Approach for Gaussian Random Fields With General Smoothness
- Actual error rates in linear discrimination of spatial Gaussian data in terms of semivariograms
- Spatial data compression via adaptive dispersion clustering
- Scaled Vecchia approximation for fast computer-model emulation
- Hierarchical sparse Cholesky decomposition with applications to high-dimensional spatio-temporal filtering
- Posterior inference for sparse hierarchical non-stationary models
- Random Forests for Spatially Dependent Data
- Scalable high-resolution forecasting of sparse spatiotemporal events with kernel methods: a winning solution to the NIJ ``Real-time crime forecasting challenge
- Competition on spatial statistics for large datasets
- Discussion on competition on spatial statistics for large datasets
- Probabilistic forecasts of arctic sea ice thickness
- Joint modeling of longitudinal relational data and exogenous variables
- Comparison of deep neural networks and deep hierarchical models for spatio-temporal data
- Spatiotemporal lagged models for variable rate irrigation in agriculture
- Non-stationary multi-layered Gaussian priors for Bayesian inversion
- Understanding the stochastic partial differential equation approach to smoothing
- Multivariate transformed Gaussian processes
- Max-and-smooth: a two-step approach for approximate Bayesian inference in latent Gaussian models
- A multi-resolution approximation via linear projection for large spatial datasets
- Non-Gaussian covariate-dependent spatial measurement error model for analyzing big spatial data
- Discussion on competition for spatial statistics for large datasets
- Discussion on competition on spatial statistics for large datasets
- Scalable penalized spatiotemporal land-use regression for ground-level nitrogen dioxide
- Modified linear projection for large spatial datasets
- Practical Bayesian modeling and inference for massive spatial data sets on modest computing environments†
- Response envelopes for linear coregionalization models
- Mapping interstellar dust with Gaussian processes
- A flexible Bayesian framework to estimate age- and cause-specific child mortality over time from sample registration data
- Vecchia-Laplace approximations of generalized Gaussian processes for big non-Gaussian spatial data
- Accounting for survey design in Bayesian disaggregation of survey-based areal estimates of proportions: an application to the American Community Survey
- Towards a complete picture of stationary covariance functions on spheres cross time
- A Bayesian optimization approach to find Nash equilibria
- Vecchia-approximated Deep Gaussian Processes for Computer Experiments
- Diagnostics-driven nonstationary emulators using kernel mixtures
- Bayesian inference for high-dimensional nonstationary Gaussian processes
- Properties and comparison of some kriging sub-model aggregation methods
- A hierarchical multivariate spatio-temporal model for clustered climate data with annual cycles
- Kryging: geostatistical analysis of large-scale datasets using Krylov subspace methods
- Ice model calibration using semicontinuous spatial data
- Improving the performance of predictive process modeling for large datasets
- A memory-free spatial additive mixed modeling for big spatial data
- Assessing the effective sample size for large spatial datasets: a block likelihood approach
- scientific article; zbMATH DE number 7625170 (Why is no real title available?)
- Making Recursive Bayesian Inference Accessible
- Multi-scale Vecchia approximations of Gaussian processes
- Bayesian finite-population inference with spatially correlated measurements
- The SPDE approach to Matérn fields: graph representations
- Stochastic local interaction model: an alternative to kriging for massive datasets
- Spatial factor modeling: A Bayesian matrix‐normal approach for misaligned data
- Bayesian fixed-domain asymptotics for covariance parameters in a Gaussian process model
- Highly Scalable Bayesian Geostatistical Modeling via Meshed Gaussian Processes on Partitioned Domains
- Fitting Matérn smoothness parameters using automatic differentiation
- Large-scale inference of correlation among mixed-type biological traits with phylogenetic multivariate probit models
- Conjugate Bayesian Regression Models for Massive Geostatistical Data Sets
- Spatio-temporal modeling of global ozone data using convolution
- A general framework for Vecchia approximations of Gaussian processes
- Fixed-domain asymptotics under Vecchia's approximation of spatial process likelihoods
- Bayesian modeling of discrete-time point-referenced spatio-temporal data
- Multilevel approximation of Gaussian random fields: covariance compression, estimation, and spatial prediction
- Bayesian inference for finite populations under spatial process settings
- A high-resolution bilevel skew-\(t\) stochastic generator for assessing Saudi Arabia's wind energy resources
- High-dimensional multivariate geostatistics: a Bayesian matrix-normal approach
- Conjugate sparse plus low rank models for efficient Bayesian interpolation of large spatial data
- Long memory conditional random fields on regular lattices
- Multivariate nearest-neighbors Gaussian processes with random covariance matrices
- The scope of the Kalman filter for spatio-temporal applications in environmental science
- Spatio-temporal downscaling emulator for regional climate models
- PICAR: An Efficient Extendable Approach for Fitting Hierarchical Spatial Models
- An Approach to Incorporate Subsampling Into a Generic Bayesian Hierarchical Model
- Integrating machine learning and Bayesian nonparametrics for flexible modeling of point pattern data
- Nearest neighbors weighted composite likelihood based on pairs for (non-)Gaussian massive spatial data with an application to Tukey-\(hh\) random fields estimation
- DeepKriging: Spatially Dependent Deep Neural Networks for Spatial Prediction
- Bayesian nonparametric generative modeling of large multivariate non-Gaussian spatial fields
- The third competition on spatial statistics for large datasets
- Implementation and analysis of GPU algorithms for Vecchia approximation
- Linear-Cost Covariance Functions for Gaussian Random Fields
- A Statistical Review of Template Model Builder: A Flexible Tool for Spatial Modelling
- Are You All Normal? It Depends!
- Covariance–Based Rational Approximations of Fractional SPDEs for Computationally Efficient Bayesian Inference
- Modeling spatial data using local likelihood estimation and a Matérn to spatial autoregressive translation
- Local scale invariance and robustness of proper scoring rules
- Comparing emulation methods for a high-resolution storm surge model
- On modeling positive continuous data with spatiotemporal dependence
- The Matérn model: a journey through statistics, numerical analysis and machine learning
- Prediction and model evaluation for space–time data
- Modeling Nonstationary and Asymmetric Multivariate Spatial Covariances via Deformations
- Asymptotic analysis of ML-covariance parameter estimators based on covariance approximations
- Partition-Based Nonstationary Covariance Estimation Using the Stochastic Score Approximation
- Guest editors' introduction to the special issue on ``Climate and the Earth system
- Latent multivariate log-gamma models for high-dimensional multitype responses with application to daily fine particulate matter and mortality counts
- Distributed nearest-neighbor Gaussian processes
- Spatial 3D Matérn priors for fast whole-brain fMRI analysis
This page was built for publication: A case study competition among methods for analyzing large spatial data
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2272997)