| Literature DB >> 26166910 |
Cosma Rohilla Shalizi1, Alessandro Rinaldo1.
Abstract
The growing availability of network data and of scientific interest in distributed systems has led to the rapid development of statistical models of network structure. Typically, however, these are models for the entire network, while the data consists only of a sampled sub-network. Parameters for the whole network, which is what is of interest, are estimated by applying the model to the sub-network. This assumes that the model is consistent under sampling, or, in terms of the theory of stochastic processes, that it defines a projective family. Focusing on the popular class of exponential random graph models (ERGMs), we show that this apparently trivial condition is in fact violated by many popular and scientifically appealing models, and that satisfying it drastically limits ERGM's expressive power. These results are actually special cases of more general results about exponential families of dependent random variables, which we also prove. Using such results, we offer easily checked conditions for the consistency of maximum likelihood estimation in ERGMs, and discuss some possible constructive responses.Entities:
Keywords: Exponential family; exponential random graph model; independent increments; network models; network sampling; projective family; sufficient statistics
Year: 2013 PMID: 26166910 PMCID: PMC4498414 DOI: 10.1214/12-AOS1044
Source DB: PubMed Journal: Ann Stat ISSN: 0090-5364 Impact factor: 4.028