Literature DB >> 20213739

Decomposition and model selection for large contingency tables.

Corinne Dahinden1, Markus Kalisch, Peter Bühlmann.   

Abstract

Large contingency tables summarizing categorical variables arise in many areas. One example is in biology, where large numbers of biomarkers are cross-tabulated according to their discrete expression level. Interactions of the variables are of great interest and are generally studied with log-linear models. The structure of a log-linear model can be visually represented by a graph from which the conditional independence structure can then be easily read off. However, since the number of parameters in a saturated model grows exponentially in the number of variables, this generally comes with a heavy computational burden. Even if we restrict ourselves to models of lower-order interactions or other sparse structures, we are faced with the problem of a large number of cells which play the role of sample size. This is in sharp contrast to high-dimensional regression or classification procedures because, in addition to a high-dimensional parameter, we also have to deal with the analogue of a huge sample size. Furthermore, high-dimensional tables naturally feature a large number of sampling zeros which often leads to the nonexistence of the maximum likelihood estimate. We therefore present a decomposition approach, where we first divide the problem into several lower-dimensional problems and then combine these to form a global solution. Our methodology is computationally feasible for log-linear interaction models with many categorical variables each or some of them having many levels. We demonstrate the proposed method on simulated data and apply it to a bio-medical problem in cancer research.

Entities:  

Mesh:

Year:  2010        PMID: 20213739     DOI: 10.1002/bimj.200900083

Source DB:  PubMed          Journal:  Biom J        ISSN: 0323-3847            Impact factor:   2.207


  3 in total

1.  Bayesian modeling of temporal dependence in large sparse contingency tables.

Authors:  Tsuyoshi Kunihama; David B Dunson
Journal:  J Am Stat Assoc       Date:  2013-01-01       Impact factor: 5.033

2.  TENSOR DECOMPOSITIONS AND SPARSE LOG-LINEAR MODELS.

Authors:  James E Johndrow; Anirban Bhattacharya; David B Dunson
Journal:  Ann Stat       Date:  2017-02-21       Impact factor: 4.028

3.  Seasonal Variations in Obsessive-Compulsive Disorder: Analysis of Prospective-Clinical Data.

Authors:  Ebru Altintaş; Meryem Özlem Kütük; A Evren Tufan
Journal:  Noro Psikiyatr Ars       Date:  2021-08-26       Impact factor: 1.339

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.