A case study competition among methods for analyzing large spatial data
From MaRDI portal
Publication:2272997
Abstract: The Gaussian process is an indispensable tool for spatial data analysts. The onset of the "big data" era, however, has lead to the traditional Gaussian process being computationally infeasible for modern spatial data. As such, various alternatives to the full Gaussian process that are more amenable to handling big spatial data have been proposed. These modern methods often exploit low rank structures and/or multi-core and multi-threaded computing environments to facilitate computation. This study provides, first, an introductory overview of several methods for analyzing large spatial data. Second, this study describes the results of a predictive competition among the described methods as implemented by different groups with strong expertise in the methodology. Specifically, each research group was provided with two training datasets (one simulated and one observed) along with a set of prediction locations. Each group then wrote their own implementation of their method to produce predictions at the given location and each which was subsequently run on a common computing environment. The methods were then compared in terms of various predictive diagnostics. Supplementary materials regarding implementation details of the methods and code are available for this article online.
Recommendations
- Smoothed full-scale approximation of Gaussian process models for computation of large spatial data sets
- Modified linear projection for large spatial datasets
- Efficient Gaussian process regression for large datasets
- A Full Scale Approximation of Covariance Functions for Large Spatial Data Sets
- Kryging: geostatistical analysis of large-scale datasets using Krylov subspace methods
Cites work
- scientific article; zbMATH DE number 5849508 (Why is no real title available?)
- scientific article; zbMATH DE number 823069 (Why is no real title available?)
- A Full Scale Approximation of Covariance Functions for Large Spatial Data Sets
- A Resampling-Based Stochastic Approximation Method for Analysis of Large Geostatistical Data
- A class of multi-resolution approximations for large spatial datasets
- A comparison of spatial predictors when datasets could be very large
- An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach
- Analyzing Nonstationary Spatial Data Using Piecewise Gaussian Processes
- Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations (with discussion)
- Approximate Likelihood for Large Irregularly Spaced Spatial Data
- Approximating Likelihoods for Large Spatial Data Sets
- Asymptotic properties of multivariate tapering for estimation and prediction
- Bayesian Detection of Clusters and Discontinuities in Disease Maps
- Bayesian Inference for the Spatial Random Effects Model
- Conditionally linear models for non-homogeneous spatial random fields
- Covariance approximation for large multivariate spatial data sets with an application to multiple climate model errors
- Covariance tapering for likelihood-based estimation in large spatial data sets
- Covariance tapering for prediction of large spatial data sets in transformed random fields
- Edge effects and efficient parameter estimation for stationary random fields
- Efficient Algorithms for Bayesian Nearest Neighbor Gaussian Processes
- Estimation and prediction using generalized Wendland covariance functions under fixed domain asymptotics
- Fixed Rank Kriging for Very Large Spatial Data Sets
- Fixed-domain asymptotic properties of tapered maximum likelihood estimators
- Gaussian Predictive Process Models for Large Spatial Data Sets
- Geometric median and robust estimation in Banach spaces
- Hierarchical modeling and analysis for spatial data
- Improving the performance of predictive process modeling for large datasets
- Interpolation of spatial data. Some theory for kriging
- Likelihood approximation with hierarchical matrices for large spatial datasets
- Massively parallel approximate Gaussian process regression
- Multi-Resolution Filters for Massive Spatio-Temporal Data
- Nonseparable dynamic nearest neighbor Gaussian process models for large spatio-temporal data with an application to particulate matter analysis
- ON STATIONARY PROCESSES IN THE PLANE
- On fixed-domain asymptotics and covariance tapering in Gaussian random field models
- Parallel inference for massive distributed spatial data using low-rank models
- Parameter estimation for a stationary process on a d-dimensional lattice
- Robust and scalable Bayes via a median of subset posterior measures
- Space and space-time modeling using process convolutions
- Spatial factor models for high-dimensional and large spatial data: an application in forest variable mapping
- Spatio-temporal smoothing and EM estimation for massive remote-sensing data sets
- Spectral density estimation for random fields via periodic embeddings
- Statistical Methods for Spatial Data Analysis
- Statistical analysis of small-area data based on independence, spatial, non-hierarchical, and hierarchical models
- Statistics for spatial data
- Statistics for spatio-temporal data
- Stochastic approximation of score functions for Gaussian processes
- Strictly Proper Scoring Rules, Prediction, and Estimation
Cited in
(only showing first 100 items - show all)- Making Recursive Bayesian Inference Accessible
- Response envelopes for linear coregionalization models
- Mapping interstellar dust with Gaussian processes
- An Approach to Incorporate Subsampling Into a Generic Bayesian Hierarchical Model
- Multivariate transformed Gaussian processes
- Improving the performance of predictive process modeling for large datasets
- Practical Bayesian modeling and inference for massive spatial data sets on modest computing environments†
- Competition on spatial statistics for large datasets
- Discussion on competition on spatial statistics for large datasets
- Scalable high-resolution forecasting of sparse spatiotemporal events with kernel methods: a winning solution to the NIJ ``Real-time crime forecasting challenge
- Fast covariance parameter estimation of spatial Gaussian process models using neural networks
- Kryging: geostatistical analysis of large-scale datasets using Krylov subspace methods
- Unifying compactly supported and Matérn covariance functions in spatial statistics
- Covariance–Based Rational Approximations of Fractional SPDEs for Computationally Efficient Bayesian Inference
- Discussion on competition for spatial statistics for large datasets
- Discussion on competition on spatial statistics for large datasets
- On modeling positive continuous data with spatiotemporal dependence
- Correlation-based sparse inverse Cholesky factorization for fast Gaussian-process inference
- Bayesian Spatial Binary Regression for Label Fusion in Structural Neuroimaging
- Bayesian nonstationary and nonparametric covariance estimation for large spatial data (with discussion)
- Gaussian orthogonal latent factor processes for large incomplete matrices of correlated data
- Actual error rates in linear discrimination of spatial Gaussian data in terms of semivariograms
- VPint: value propagation-based spatial interpolation
- 30 years of space-time covariance functions
- Reds: random ensemble deep spatial prediction
- A multi-resolution approximation via linear projection for large spatial datasets
- Vecchia-Laplace approximations of generalized Gaussian processes for big non-Gaussian spatial data
- Comparison of deep neural networks and deep hierarchical models for spatio-temporal data
- Integrating machine learning and Bayesian nonparametrics for flexible modeling of point pattern data
- Nearest neighbors weighted composite likelihood based on pairs for (non-)Gaussian massive spatial data with an application to Tukey-\(hh\) random fields estimation
- Non-Gaussian covariate-dependent spatial measurement error model for analyzing big spatial data
- Multi-scale Vecchia approximations of Gaussian processes
- Bayesian finite-population inference with spatially correlated measurements
- Spatio-temporal modeling of global ozone data using convolution
- Probabilistic forecasts of arctic sea ice thickness
- Bayesian hierarchical modeling and analysis for actigraph data from wearable devices
- Random Forests for Spatially Dependent Data
- Estimating high-resolution red sea surface temperature hotspots, using a low-rank semiparametric spatial model
- Bayesian Modeling with Spatial Curvature Processes
- Distributed Inference for Spatial Extremes Modeling in High Dimensions
- Fixed-Domain Posterior Contraction Rates for Spatial Gaussian Process Model with Nugget
- Valid Model-Free Spatial Prediction
- Properties and comparison of some kriging sub-model aggregation methods
- Scaled Vecchia approximation for fast computer-model emulation
- Modeling spatial data using local likelihood estimation and a Matérn to spatial autoregressive translation
- Linear-Cost Covariance Functions for Gaussian Random Fields
- Spatial factor modeling: A Bayesian matrix‐normal approach for misaligned data
- Bayesian modeling of discrete-time point-referenced spatio-temporal data
- Hierarchical sparse Cholesky decomposition with applications to high-dimensional spatio-temporal filtering
- Accounting for survey design in Bayesian disaggregation of survey-based areal estimates of proportions: an application to the American Community Survey
- The Matérn model: a journey through statistics, numerical analysis and machine learning
- Nearest-neighbor sparse Cholesky matrices in spatial statistics
- Assessing the effective sample size for large spatial datasets: a block likelihood approach
- Comparing emulation methods for a high-resolution storm surge model
- Joint modeling of longitudinal relational data and exogenous variables
- Scalable penalized spatiotemporal land-use regression for ground-level nitrogen dioxide
- A flexible Bayesian framework to estimate age- and cause-specific child mortality over time from sample registration data
- Extending the generalized Wendland covariance model
- Direct Bayesian linear regression for distribution-valued covariates
- Distributed Bayesian inference in massive spatial data
- Finite element representations of Gaussian processes: balancing numerical and statistical accuracy
- Spatial bootstrapped microeconometrics: Forecasting for out‐of‐sample geo‐locations in big data
- Bayesian latent variable co-kriging model in remote sensing for quality flagged observations
- Spatial 3D Matérn priors for fast whole-brain fMRI analysis
- Bayesian multiresolution modeling of georeferenced data: an extension of `LatticeKrig'
- Prediction and model evaluation for space–time data
- Spatiotemporal lagged models for variable rate irrigation in agriculture
- Efficient Construction of an HSS Preconditioner for Symmetric Positive Definite $\mathcal{H}^2$ Matrices
- Large-scale inference of correlation among mixed-type biological traits with phylogenetic multivariate probit models
- DeepKriging: Spatially Dependent Deep Neural Networks for Spatial Prediction
- Bayesian fixed-domain asymptotics for covariance parameters in a Gaussian process model
- Modified linear projection for large spatial datasets
- Spatial data compression via adaptive dispersion clustering
- scientific article; zbMATH DE number 7625170 (Why is no real title available?)
- Conjugate Bayesian Regression Models for Massive Geostatistical Data Sets
- Stochastic PDE representation of random fields for large-scale Gaussian process regression and statistical finite element analysis
- Fixed-domain asymptotics under Vecchia's approximation of spatial process likelihoods
- Multilevel approximation of Gaussian random fields: covariance compression, estimation, and spatial prediction
- Bayesian inference for finite populations under spatial process settings
- A high-resolution bilevel skew-\(t\) stochastic generator for assessing Saudi Arabia's wind energy resources
- High-dimensional multivariate geostatistics: a Bayesian matrix-normal approach
- Conjugate sparse plus low rank models for efficient Bayesian interpolation of large spatial data
- Long memory conditional random fields on regular lattices
- Multivariate nearest-neighbors Gaussian processes with random covariance matrices
- The scope of the Kalman filter for spatio-temporal applications in environmental science
- Spatio-temporal downscaling emulator for regional climate models
- PICAR: An Efficient Extendable Approach for Fitting Hierarchical Spatial Models
- Airflow recovery from thoracic and abdominal movements using synchrosqueezing transform and locally stationary Gaussian process regression
- Distributed nearest-neighbor Gaussian processes
- Guest editors' introduction to the special issue on ``Climate and the Earth system
- Highly Scalable Bayesian Geostatistical Modeling via Meshed Gaussian Processes on Partitioned Domains
- Bayesian inference for high-dimensional nonstationary Gaussian processes
- A Bayesian optimization approach to find Nash equilibria
- Geostatistical modeling of positive‐definite matrices: An application to diffusion tensor imaging
- A hierarchical multivariate spatio-temporal model for clustered climate data with annual cycles
- Accounting for Location Measurement Error in Imaging Data With Application to Atomic Resolution Images of Crystalline Materials
- Physically constrained spatiotemporal modeling: generating clear-sky constructions of land surface temperature from sparse, remotely sensed satellite data
- Max-and-smooth: a two-step approach for approximate Bayesian inference in latent Gaussian models
- Non-stationary multi-layered Gaussian priors for Bayesian inversion
- Deep Latent Factor Model for Spatio-Temporal Forecasting
This page was built for publication: A case study competition among methods for analyzing large spatial data
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2272997)