Literature DB >> 35935897

Highly Scalable Bayesian Geostatistical Modeling via Meshed Gaussian Processes on Partitioned Domains.

Michele Peruzzi1,2, Sudipto Banerjee3, Andrew O Finley1.   

Abstract

We introduce a class of scalable Bayesian hierarchical models for the analysis of massive geostatistical datasets. The underlying idea combines ideas on high-dimensional geostatistics by partitioning the spatial domain and modeling the regions in the partition using a sparsity-inducing directed acyclic graph (DAG). We extend the model over the DAG to a well-defined spatial process, which we call the Meshed Gaussian Process (MGP). A major contribution is the development of a MGPs on tessellated domains, accompanied by a Gibbs sampler for the efficient recovery of spatial random effects. In particular, the cubic MGP (Q-MGP) can harness high-performance computing resources by executing all large-scale operations in parallel within the Gibbs sampler, improving mixing and computing time compared to sequential updating schemes. Unlike some existing models for large spatial data, a Q-MGP facilitates massive caching of expensive matrix operations, making it particularly apt in dealing with spatiotemporal remote-sensing data. We compare Q-MGPs with large synthetic and real world data against state-of-the-art methods. We also illustrate using Normalized Difference Vegetation Index (NDVI) data from the Serengeti park region to recover latent multivariate spatiotemporal random effects at millions of locations. The source code is available at github.com/mkln/meshgp.

Entities:  

Keywords:  Bayesian; domain partitioning; graphical models; large n; sparsity; spatial

Year:  2020        PMID: 35935897      PMCID: PMC9354857          DOI: 10.1080/01621459.2020.1833889

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   4.369


  12 in total

1.  Adaptive Gaussian Predictive Process Models for Large Spatial Datasets.

Authors:  Rajarshi Guhaniyogi; Andrew O Finley; Sudipto Banerjee; Alan E Gelfand
Journal:  Environmetrics       Date:  2011-12       Impact factor: 1.900

2.  NONSEPARABLE DYNAMIC NEAREST NEIGHBOR GAUSSIAN PROCESS MODELS FOR LARGE SPATIO-TEMPORAL DATA WITH AN APPLICATION TO PARTICULATE MATTER ANALYSIS.

Authors:  Abhirup Datta; Sudipto Banerjee; Andrew O Finley; Nicholas A S Hamm; Martijn Schaap
Journal:  Ann Appl Stat       Date:  2016-09-28       Impact factor: 2.083

3.  Hierarchical Spatial Process Models for Multiple Traits in Large Genetic Trials.

Authors:  Sudipto Banerjee; Andrew O Finley; Patrik Waldmann; Tore Ericsson
Journal:  J Am Stat Assoc       Date:  2010-06-01       Impact factor: 5.033

4.  Spatial Factor Models for High-Dimensional and Large Spatial Data: An Application in Forest Variable Mapping.

Authors:  Daniel Taylor-Rodriguez; Andrew O Finley; Abhirup Datta; Chad Babcock; Hans-Erik Andersen; Bruce D Cook; Douglas C Morton; Sudipto Banerjee
Journal:  Stat Sin       Date:  2019       Impact factor: 1.261

5.  High-Dimensional Bayesian Geostatistics.

Authors:  Sudipto Banerjee
Journal:  Bayesian Anal       Date:  2017-05-16       Impact factor: 3.728

6.  Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets.

Authors:  Abhirup Datta; Sudipto Banerjee; Andrew O Finley; Alan E Gelfand
Journal:  J Am Stat Assoc       Date:  2016-08-18       Impact factor: 5.033

7.  Efficient algorithms for Bayesian Nearest Neighbor Gaussian Processes.

Authors:  Andrew O Finley; Abhirup Datta; Bruce C Cook; Douglas C Morton; Hans E Andersen; Sudipto Banerjee
Journal:  J Comput Graph Stat       Date:  2019-04-01       Impact factor: 2.302

8.  Hierarchical factor models for large spatially misaligned data: a low-rank predictive process approach.

Authors:  Qian Ren; Sudipto Banerjee
Journal:  Biometrics       Date:  2013-02-04       Impact factor: 2.571

9.  Gaussian predictive process models for large spatial data sets.

Authors:  Sudipto Banerjee; Alan E Gelfand; Andrew O Finley; Huiyan Sang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2008-09-01       Impact factor: 4.488

10.  A Case Study Competition Among Methods for Analyzing Large Spatial Data.

Authors:  Matthew J Heaton; Abhirup Datta; Andrew O Finley; Reinhard Furrer; Joseph Guinness; Rajarshi Guhaniyogi; Florian Gerber; Robert B Gramacy; Dorit Hammerling; Matthias Katzfuss; Finn Lindgren; Douglas W Nychka; Furong Sun; Andrew Zammit-Mangion
Journal:  J Agric Biol Environ Stat       Date:  2018-12-14       Impact factor: 1.524

View more
  1 in total

1.  Spatial Multivariate Trees for Big Data Bayesian Regression.

Authors:  Michele Peruzzi; David B Dunson
Journal:  J Mach Learn Res       Date:  2022       Impact factor: 5.177

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.