Literature DB >> 18443645

Computational Techniques for Spatial Logistic Regression with Large Datasets.

Christopher J Paciorek1.   

Abstract

In epidemiological research, outcomes are frequently non-normal, sample sizes may be large, and effect sizes are often small. To relate health outcomes to geographic risk factors, fast and powerful methods for fitting spatial models, particularly for non-normal data, are required. I focus on binary outcomes, with the risk surface a smooth function of space, but the development herein is relevant for non-normal data in general. I compare penalized likelihood models, including the penalized quasi-likelihood (PQL) approach, and Bayesian models based on fit, speed, and ease of implementation.A Bayesian model using a spectral basis representation of the spatial surface via the Fourier basis provides the best tradeoff of sensitivity and specificity in simulations, detecting real spatial features while limiting overfitting and being reasonably computationally efficient. One of the contributions of this work is further development of this underused representation. The spectral basis model outperforms the penalized likelihood methods, which are prone to overfitting, but is slower to fit and not as easily implemented. A Bayesian Markov random field model performs less well statistically than the spectral basis model, but is very computationally efficient. We illustrate the methods on a real dataset of cancer cases in Taiwan.The success of the spectral basis with binary data and similar results with count data suggest that it may be generally useful in spatial models and more complicated hierarchical models.

Entities:  

Year:  2007        PMID: 18443645      PMCID: PMC2350194          DOI: 10.1016/j.csda.2006.11.008

Source DB:  PubMed          Journal:  Comput Stat Data Anal        ISSN: 0167-9473            Impact factor:   1.681


  8 in total

1.  Bayesian prediction of spatial count data using generalized linear mixed models.

Authors:  Ole F Christensen; Rasmus Waagepetersen
Journal:  Biometrics       Date:  2002-06       Impact factor: 2.571

2.  Modeling spatial survival data using semiparametric frailty models.

Authors:  Yi Li; Louise Ryan
Journal:  Biometrics       Date:  2002-06       Impact factor: 2.571

3.  On the analysis of mortality events associated with a prespecified fixed point.

Authors:  A B Lawson
Journal:  J R Stat Soc Ser A Stat Soc       Date:  1993       Impact factor: 2.483

4.  Divergent biases in ecologic and individual-level studies.

Authors:  S Greenland
Journal:  Stat Med       Date:  1992-06-30       Impact factor: 2.373

5.  Regression modeling in back-propagation and projection pursuit learning.

Authors:  J N Hwang; S R Lay; M Maechler; R D Martin; J Schimert
Journal:  IEEE Trans Neural Netw       Date:  1994

6.  Random effects probit and logistic regression models for three-level data.

Authors:  R D Gibbons; D Hedeker
Journal:  Biometrics       Date:  1997-12       Impact factor: 2.571

7.  A random-effects ordinal regression model for multilevel analysis.

Authors:  D Hedeker; R D Gibbons
Journal:  Biometrics       Date:  1994-12       Impact factor: 2.571

8.  Excess cancer mortality among children and adolescents in residential districts polluted by petrochemical manufacturing plants in Taiwan.

Authors:  B J Pan; Y J Hong; G C Chang; M T Wang; F F Cinkotai; Y C Ko
Journal:  J Toxicol Environ Health       Date:  1994-09
  8 in total
  16 in total

1.  Bayesian Smoothing with Gaussian Processes Using Fourier Basis Functions in the spectralGP Package.

Authors:  Christopher J Paciorek
Journal:  J Stat Softw       Date:  2007-04       Impact factor: 6.440

2.  Spatial Modelling Using a New Class of Nonstationary Covariance Functions.

Authors:  Christopher J Paciorek; Mark J Schervish
Journal:  Environmetrics       Date:  2006       Impact factor: 1.900

3.  Hierarchical Multiresolution Approaches for Dense Point-Level Breast Cancer Treatment Data.

Authors:  Shengde Liang; Sudipto Banerjee; Sally Bushhouse; Andrew Finley; Bradley P Carlin
Journal:  Comput Stat Data Anal       Date:  2008-01-20       Impact factor: 1.681

4.  Measurement error caused by spatial misalignment in environmental epidemiology.

Authors:  Alexandros Gryparis; Christopher J Paciorek; Ariana Zeka; Joel Schwartz; Brent A Coull
Journal:  Biostatistics       Date:  2008-10-16       Impact factor: 5.899

5.  Bayesian Modeling for Large Spatial Datasets.

Authors:  Sudipto Banerjee; Montserrat Fuentes
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2012-01

6.  HIERARCHICAL SPATIAL MODELS FOR PREDICTING TREE SPECIES ASSEMBLAGES ACROSS LARGE DOMAINS.

Authors:  Andrew O Finley; Sudipto Banerjee; Ronald E McRoberts
Journal:  Ann Appl Stat       Date:  2009-09-01       Impact factor: 2.083

7.  High Resolution Space-Time Ozone Modeling for Assessing Trends.

Authors:  Sujit K Sahu; Alan E Gelfand; David M Holland
Journal:  J Am Stat Assoc       Date:  2007       Impact factor: 5.033

8.  Hierarchical factor models for large spatially misaligned data: a low-rank predictive process approach.

Authors:  Qian Ren; Sudipto Banerjee
Journal:  Biometrics       Date:  2013-02-04       Impact factor: 2.571

9.  Gaussian predictive process models for large spatial data sets.

Authors:  Sudipto Banerjee; Alan E Gelfand; Andrew O Finley; Huiyan Sang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2008-09-01       Impact factor: 4.488

10.  Bayesian variable selection for multivariate spatially varying coefficient regression.

Authors:  Brian J Reich; Montserrat Fuentes; Amy H Herring; Kelly R Evenson
Journal:  Biometrics       Date:  2010-09       Impact factor: 2.571

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.