Literature DB >> 24358073

Multiple-Shrinkage Multinomial Probit Models with Applications to Simulating Geographies in Public Use Data.

Lane F Burgette1, Jerome P Reiter2.   

Abstract

Multinomial outcomes with many levels can be challenging to model. Information typically accrues slowly with increasing sample size, yet the parameter space expands rapidly with additional covariates. Shrinking all regression parameters towards zero, as often done in models of continuous or binary response variables, is unsatisfactory, since setting parameters equal to zero in multinomial models does not necessarily imply "no effect." We propose an approach to modeling multinomial outcomes with many levels based on a Bayesian multinomial probit (MNP) model and a multiple shrinkage prior distribution for the regression parameters. The prior distribution encourages the MNP regression parameters to shrink toward a number of learned locations, thereby substantially reducing the dimension of the parameter space. Using simulated data, we compare the predictive performance of this model against two other recently-proposed methods for big multinomial models. The results suggest that the fully Bayesian, multiple shrinkage approach can outperform these other methods. We apply the multiple shrinkage MNP to simulating replacement values for areal identifiers, e.g., census tract indicators, in order to protect data confidentiality in public use datasets.

Entities:  

Keywords:  Confidentiality; Dirichlet process; disclosure; spatial; synthetic

Year:  2013        PMID: 24358073      PMCID: PMC3863948          DOI: 10.1214/13-BA816

Source DB:  PubMed          Journal:  Bayesian Anal        ISSN: 1931-6690            Impact factor:   3.728


  5 in total

1.  Bayesian variable selection in multinomial probit models to identify molecular signatures of disease stage.

Authors:  Naijun Sha; Marina Vannucci; Mahlet G Tadesse; Philip J Brown; Ilaria Dragoni; Nick Davies; Tracy C Roberts; Andrea Contestabile; Mike Salmon; Chris Buckley; Francesco Falciani
Journal:  Biometrics       Date:  2004-09       Impact factor: 2.571

2.  MULTIPLE IMPUTATION FOR SHARING PRECISE GEOGRAPHIES IN PUBLIC USE DATA.

Authors:  Hao Wang; Jerome P Reiter
Journal:  Ann Appl Stat       Date:  2012-03-01       Impact factor: 2.083

3.  Sparse multinomial logistic regression: fast algorithms and generalization bounds.

Authors:  Balaji Krishnapuram; Lawrence Carin; Mário A T Figueiredo; Alexander J Hartemink
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2005-06       Impact factor: 6.226

4.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

5.  Bayesian semiparametric multiple shrinkage.

Authors:  Richard F Maclehose; David B Dunson
Journal:  Biometrics       Date:  2009-06-08       Impact factor: 2.571

  5 in total
  2 in total

1.  Protecting Confidentiality in Cancer Registry Data With Geographic Identifiers.

Authors:  Mandi Yu; Jerome Phillip Reiter; Li Zhu; Benmei Liu; Kathleen A Cronin; Eric J Rocky Feuer
Journal:  Am J Epidemiol       Date:  2017-07-01       Impact factor: 4.897

2.  Using the "Hidden" genome to improve classification of cancer types.

Authors:  Saptarshi Chakraborty; Colin B Begg; Ronglai Shen
Journal:  Biometrics       Date:  2020-09-21       Impact factor: 2.571

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.