Literature DB >> 26167197

Model selection for degree-corrected block models.

Xiaoran Yan1, Cosma Shalizi2, Jacob E Jensen3, Florent Krzakala4, Cristopher Moore5, Lenka Zdeborová6, Pan Zhang5, Yaojia Zhu7.   

Abstract

The proliferation of models for networks raises challenging problems of model selection: the data are sparse and globally dependent, and models are typically high-dimensional and have large numbers of latent variables. Together, these issues mean that the usual model-selection criteria do not work properly for networks. We illustrate these challenges, and show one way to resolve them, by considering the key network-analysis problem of dividing a graph into communities or blocks of nodes with homogeneous patterns of links to the rest of the network. The standard tool for undertaking this is the stochastic block model, under which the probability of a link between two nodes is a function solely of the blocks to which they belong. This imposes a homogeneous degree distribution within each block; this can be unrealistic, so degree-corrected block models add a parameter for each node, modulating its overall degree. The choice between ordinary and degree-corrected block models matters because they make very different inferences about communities. We present the first principled and tractable approach to model selection between standard and degree-corrected block models, based on new large-graph asymptotics for the distribution of log-likelihood ratios under the stochastic block model, finding substantial departures from classical results for sparse graphs. We also develop linear-time approximations for log-likelihoods under both the stochastic block model and the degree-corrected model, using belief propagation. Applications to simulated and real networks show excellent agreement with our approximations. Our results thus both solve the practical problem of deciding on degree correction and point to a general approach to model selection in network analysis.

Entities:  

Keywords:  clustering techniques; message-passing algorithms; networks; random graphs; statistical inference

Year:  2014        PMID: 26167197      PMCID: PMC4498413          DOI: 10.1088/1742-5468/2014/05/P05007

Source DB:  PubMed          Journal:  J Stat Mech        ISSN: 1742-5468            Impact factor:   2.231


  12 in total

1.  Mixing patterns in networks.

Authors:  M E J Newman
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2003-02-27

2.  Assortative mixing in networks.

Authors:  M E J Newman
Journal:  Phys Rev Lett       Date:  2002-10-28       Impact factor: 9.161

3.  Asymptotic analysis of the stochastic block model for modular networks and its algorithmic applications.

Authors:  Aurelien Decelle; Florent Krzakala; Cristopher Moore; Lenka Zdeborová
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2011-12-12

4.  Food web models: a plea for groups.

Authors:  Stefano Allesina; Mercedes Pascual
Journal:  Ecol Lett       Date:  2009-05-06       Impact factor: 9.492

5.  Hierarchical structure and the prediction of missing links in networks.

Authors:  Aaron Clauset; Cristopher Moore; M E J Newman
Journal:  Nature       Date:  2008-05-01       Impact factor: 49.962

6.  A nonparametric view of network models and Newman-Girvan and other modularities.

Authors:  Peter J Bickel; Aiyou Chen
Journal:  Proc Natl Acad Sci U S A       Date:  2009-11-23       Impact factor: 11.205

7.  Stochastic blockmodels and community structure in networks.

Authors:  Brian Karrer; M E J Newman
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2011-01-21

8.  Inference and phase transitions in the detection of modules in sparse networks.

Authors:  Aurelien Decelle; Florent Krzakala; Cristopher Moore; Lenka Zdeborová
Journal:  Phys Rev Lett       Date:  2011-08-02       Impact factor: 9.161

9.  Mixed Membership Stochastic Blockmodels.

Authors:  Edoardo M Airoldi; David M Blei; Stephen E Fienberg; Eric P Xing
Journal:  J Mach Learn Res       Date:  2008-09       Impact factor: 3.654

10.  The interplay between microscopic and mesoscopic structures in complex networks.

Authors:  Jörg Reichardt; Roberto Alamino; David Saad
Journal:  PLoS One       Date:  2011-08-01       Impact factor: 3.240

View more
  11 in total

1.  Completeness of Community Structure in Networks.

Authors:  Jia-Rong Xie; Pan Zhang; Hai-Feng Zhang; Bing-Hong Wang
Journal:  Sci Rep       Date:  2017-07-13       Impact factor: 4.379

2.  Scalable detection of statistically significant communities and hierarchies, using message passing for modularity.

Authors:  Pan Zhang; Cristopher Moore
Journal:  Proc Natl Acad Sci U S A       Date:  2014-12-08       Impact factor: 11.205

3.  Significance-based community detection in weighted networks.

Authors:  John Palowitch; Shankar Bhamidi; Andrew B Nobel
Journal:  J Mach Learn Res       Date:  2018-04       Impact factor: 3.654

4.  Cross-validation estimate of the number of clusters in a network.

Authors:  Tatsuro Kawamoto; Yoshiyuki Kabashima
Journal:  Sci Rep       Date:  2017-06-12       Impact factor: 4.379

5.  An empirical Bayes approach to stochastic blockmodels and graphons: shrinkage estimation and model selection.

Authors:  Zhanhao Peng; Qing Zhou
Journal:  PeerJ Comput Sci       Date:  2022-07-06

6.  Investigation of patient-sharing networks using a Bayesian network model selection approach for congruence class models.

Authors:  Ravi Goyal; Victor De Gruttola
Journal:  Stat Med       Date:  2021-04-02       Impact factor: 2.373

7.  Modelling sequences and temporal networks with dynamic community structures.

Authors:  Tiago P Peixoto; Martin Rosvall
Journal:  Nat Commun       Date:  2017-09-19       Impact factor: 14.919

8.  Weighted Stochastic Block Models of the Human Connectome across the Life Span.

Authors:  Joshua Faskowitz; Xiaoran Yan; Xi-Nian Zuo; Olaf Sporns
Journal:  Sci Rep       Date:  2018-08-29       Impact factor: 4.379

9.  Loan maturity aggregation in interbank lending networks obscures mesoscale structure and economic functions.

Authors:  Marnix Van Soom; Milan van den Heuvel; Jan Ryckebusch; Koen Schoors
Journal:  Sci Rep       Date:  2019-08-29       Impact factor: 4.379

10.  Scalable Estimation of Epidemic Thresholds via Node Sampling.

Authors:  Anirban Dasgupta; Srijan Sengupta
Journal:  Sankhya Ser A       Date:  2021-07-07
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.