Literature DB >> 21909346

Semi-parametric Bayesian Inference for Multi-Season Baseball Data.

Fernando A Quintana1, Peter Müler, Gary L Rosner, Mark Munsell.   

Abstract

We analyze complete sequences of successes (hits, walks, and sacrifices) for a group of players from the American and National Leagues, collected over 4 seasons. The goal is to describe how players' performances vary from season to season. In particular, we wish to assess and compare the effect of available occasion-specific covariates over seasons. The data are binary sequences for each player and each season. We model dependence in the binary sequence by an autoregressive logistic model. The model includes lagged terms up to a fixed order. For each player and season we introduce a different set of autologistic regression coefficients, i.e., the regression coefficients are random effects that are specific to each season and player. We use a nonparametric approach to define a random effects distribution. The nonparametric model is defined as a mixture with a Dirichlet process prior for the mixing measure. The described model is justified by a representation theorem for order-k exchangeable sequences. Besides the repeated measurements for each season and player, multiple seasons within a given player define an additional level of repeated measurements. We introduce dependence at this level of repeated measurements by relating the season-specific random effects vectors in an autoregressive fashion. We ultimately conclude that while some covariates like the ERA of the opposing pitcher are always relevant, others like an indicator for the game being into the seventh inning may be significant only for certain seasons, and some others, like the score of the game, can safely be ignored.

Year:  2008        PMID: 21909346      PMCID: PMC3168950          DOI: 10.1214/08-BA312

Source DB:  PubMed          Journal:  Bayesian Anal        ISSN: 1931-6690            Impact factor:   3.728


  2 in total

1.  Bayesian analyses of longitudinal binary data using Markov regression models of unknown order.

Authors:  A Erkanli; R Soyer; A Angold
Journal:  Stat Med       Date:  2001-03-15       Impact factor: 2.373

2.  A semiparametric Bayesian approach to the random effects model.

Authors:  K P Kleinman; J G Ibrahim
Journal:  Biometrics       Date:  1998-09       Impact factor: 2.571

  2 in total
  1 in total

1.  Hitting is contagious in baseball: evidence from long hitting streaks.

Authors:  Joel R Bock; Akhilesh Maewal; David A Gough
Journal:  PLoS One       Date:  2012-12-12       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.