Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Computational Techniques for Spatial Logistic Regression with Large Datasets.

Literature DB >> 18443645

Computational Techniques for Spatial Logistic Regression with Large Datasets.

Abstract

In epidemiological research, outcomes are frequently non-normal, sample sizes may be large, and effect sizes are often small. To relate health outcomes to geographic risk factors, fast and powerful methods for fitting spatial models, particularly for non-normal data, are required. I focus on binary outcomes, with the risk surface a smooth function of space, but the development herein is relevant for non-normal data in general. I compare penalized likelihood models, including the penalized quasi-likelihood (PQL) approach, and Bayesian models based on fit, speed, and ease of implementation.A Bayesian model using a spectral basis representation of the spatial surface via the Fourier basis provides the best tradeoff of sensitivity and specificity in simulations, detecting real spatial features while limiting overfitting and being reasonably computationally efficient. One of the contributions of this work is further development of this underused representation. The spectral basis model outperforms the penalized likelihood methods, which are prone to overfitting, but is slower to fit and not as easily implemented. A Bayesian Markov random field model performs less well statistically than the spectral basis model, but is very computationally efficient. We illustrate the methods on a real dataset of cancer cases in Taiwan.The success of the spectral basis with binary data and similar results with count data suggest that it may be generally useful in spatial models and more complicated hierarchical models.

Entities: Disease

Year: 2007 PMID： 18443645 PMCID： PMC2350194 DOI： 10.1016/j.csda.2006.11.008

Source DB: PubMed Journal: Comput Stat Data Anal ISSN： 0167-9473 Impact factor: 1.681

8 in total

1. Bayesian prediction of spatial count data using generalized linear mixed models.

Authors: Ole F Christensen; Rasmus Waagepetersen
Journal: Biometrics Date: 2002-06 Impact factor: 2.571

2. Modeling spatial survival data using semiparametric frailty models.

Authors: Yi Li; Louise Ryan
Journal: Biometrics Date: 2002-06 Impact factor: 2.571

3. On the analysis of mortality events associated with a prespecified fixed point.

Authors: A B Lawson
Journal: J R Stat Soc Ser A Stat Soc Date: 1993 Impact factor: 2.483

4. Divergent biases in ecologic and individual-level studies.

Authors: S Greenland
Journal: Stat Med Date: 1992-06-30 Impact factor: 2.373

5. Regression modeling in back-propagation and projection pursuit learning.

Authors: J N Hwang; S R Lay; M Maechler; R D Martin; J Schimert
Journal: IEEE Trans Neural Netw Date: 1994

6. Random effects probit and logistic regression models for three-level data.

Authors: R D Gibbons; D Hedeker
Journal: Biometrics Date: 1997-12 Impact factor: 2.571

7. A random-effects ordinal regression model for multilevel analysis.

Authors: D Hedeker; R D Gibbons
Journal: Biometrics Date: 1994-12 Impact factor: 2.571

8. Excess cancer mortality among children and adolescents in residential districts polluted by petrochemical manufacturing plants in Taiwan.

Authors: B J Pan; Y J Hong; G C Chang; M T Wang; F F Cinkotai; Y C Ko
Journal: J Toxicol Environ Health Date: 1994-09

8 in total

16 in total

1. Bayesian Smoothing with Gaussian Processes Using Fourier Basis Functions in the spectralGP Package.

Authors: Christopher J Paciorek
Journal: J Stat Softw Date: 2007-04 Impact factor: 6.440

2. Spatial Modelling Using a New Class of Nonstationary Covariance Functions.

Authors: Christopher J Paciorek; Mark J Schervish
Journal: Environmetrics Date: 2006 Impact factor: 1.900

3. Hierarchical Multiresolution Approaches for Dense Point-Level Breast Cancer Treatment Data.

Authors: Shengde Liang; Sudipto Banerjee; Sally Bushhouse; Andrew Finley; Bradley P Carlin
Journal: Comput Stat Data Anal Date: 2008-01-20 Impact factor: 1.681

4. Measurement error caused by spatial misalignment in environmental epidemiology.

Authors: Alexandros Gryparis; Christopher J Paciorek; Ariana Zeka; Joel Schwartz; Brent A Coull
Journal: Biostatistics Date: 2008-10-16 Impact factor: 5.899

5. Bayesian Modeling for Large Spatial Datasets.

Authors: Sudipto Banerjee; Montserrat Fuentes
Journal: Wiley Interdiscip Rev Comput Stat Date: 2012-01

6. HIERARCHICAL SPATIAL MODELS FOR PREDICTING TREE SPECIES ASSEMBLAGES ACROSS LARGE DOMAINS.

Authors: Andrew O Finley; Sudipto Banerjee; Ronald E McRoberts
Journal: Ann Appl Stat Date: 2009-09-01 Impact factor: 2.083

7. High Resolution Space-Time Ozone Modeling for Assessing Trends.

Authors: Sujit K Sahu; Alan E Gelfand; David M Holland
Journal: J Am Stat Assoc Date: 2007 Impact factor: 5.033

8. Hierarchical factor models for large spatially misaligned data: a low-rank predictive process approach.

Authors: Qian Ren; Sudipto Banerjee
Journal: Biometrics Date: 2013-02-04 Impact factor: 2.571

9. Gaussian predictive process models for large spatial data sets.

Authors: Sudipto Banerjee; Alan E Gelfand; Andrew O Finley; Huiyan Sang
Journal: J R Stat Soc Series B Stat Methodol Date: 2008-09-01 Impact factor: 4.488

10. Bayesian variable selection for multivariate spatially varying coefficient regression.

Authors: Brian J Reich; Montserrat Fuentes; Amy H Herring; Kelly R Evenson
Journal: Biometrics Date: 2010-09 Impact factor: 2.571