Literature DB >> 26605002

MODEL-BASED CLUSTERING OF LARGE NETWORKS.

Duy Q Vu1, David R Hunter2, Michael Schweinberger3.   

Abstract

We describe a network clustering framework, based on finite mixture models, that can be applied to discrete-valued networks with hundreds of thousands of nodes and billions of edge variables. Relative to other recent model-based clustering work for networks, we introduce a more flexible modeling framework, improve the variational-approximation estimation algorithm, discuss and implement standard error estimation via a parametric bootstrap approach, and apply these methods to much larger data sets than those seen elsewhere in the literature. The more flexible framework is achieved through introducing novel parameterizations of the model, giving varying degrees of parsimony, using exponential family models whose structure may be exploited in various theoretical and algorithmic ways. The algorithms are based on variational generalized EM algorithms, where the E-steps are augmented by a minorization-maximization (MM) idea. The bootstrapped standard error estimates are based on an efficient Monte Carlo network simulation idea. Last, we demonstrate the usefulness of the model-based clustering framework by applying it to a discrete-valued network with more than 131,000 nodes and 17 billion edge variables.

Entities:  

Keywords:  EM algorithms; MM algorithms; Social networks; finite mixture models; generalized EM algorithms; stochastic block models; variational EM algorithms

Year:  2013        PMID: 26605002      PMCID: PMC4655199          DOI: 10.1214/12-AOAS617

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  5 in total

1.  Model for heterogeneous random networks using continuous latent variables and an application to a tree-fungus network.

Authors:  Jean-Jacques Daudin; Laurent Pierre; Corinne Vacher
Journal:  Biometrics       Date:  2010-12       Impact factor: 2.571

2.  Statistical analysis of pair relationships: symmetry, subjective consistency and reciprocity.

Authors:  J A Davis
Journal:  Sociometry       Date:  1968-03

3.  Mixed Membership Stochastic Blockmodels.

Authors:  Edoardo M Airoldi; David M Blei; Stephen E Fienberg; Eric P Xing
Journal:  J Mach Learn Res       Date:  2008-09       Impact factor: 3.654

4.  Instability, Sensitivity, and Degeneracy of Discrete Exponential Families.

Authors:  Michael Schweinberger
Journal:  J Am Stat Assoc       Date:  2012-01-24       Impact factor: 5.033

5.  Disaster Response on September 11, 2001 Through the Lens of Statistical Network Analysis.

Authors:  Michael Schweinberger; Miruna Petrescu-Prahova; Duy Quang Vu
Journal:  Soc Networks       Date:  2014-05
  5 in total
  5 in total

1.  Model-Based Clustering of Nonparametric Weighted Networks with Application to Water Pollution Analysis.

Authors:  Amal Agarwal; Lingzhou Xue
Journal:  Technometrics       Date:  2019-07-05

2.  LCN: a random graph mixture model for community detection in functional brain networks.

Authors:  Christopher Bryant; Hongtu Zhu; Mihye Ahn; Joseph Ibrahim
Journal:  Stat Interface       Date:  2017       Impact factor: 0.582

3.  A Semiparametric Bayesian Approach to Epidemics, with Application to the Spread of the Coronavirus MERS in South Korea in 2015.

Authors:  Michael Schweinberger; Rashmi P Bomiriya; Sergii Babkin
Journal:  J Nonparametr Stat       Date:  2021-09-16       Impact factor: 1.012

4.  Model-based clustering of time-evolving networks through temporal exponential-family random graph models.

Authors:  Kevin H Lee; Lingzhou Xue; David R Hunter
Journal:  J Multivar Anal       Date:  2019-09-05       Impact factor: 1.473

5.  Local dependence in random graph models: characterization, properties and statistical inference.

Authors:  Michael Schweinberger; Mark S Handcock
Journal:  J Am Stat Assoc       Date:  2015-06-01       Impact factor: 5.033

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.