Literature DB >> 28867924

Estimating the Size of a Large Network and its Communities from a Random Sample.

Lin Chen1,2, Amin Karbasi1,2, Forrest W Crawford2,3.   

Abstract

Most real-world networks are too large to be measured or studied directly and there is substantial interest in estimating global network properties from smaller sub-samples. One of the most important global properties is the number of vertices/nodes in the network. Estimating the number of vertices in a large network is a major challenge in computer science, epidemiology, demography, and intelligence analysis. In this paper we consider a population random graph G = (V, E) from the stochastic block model (SBM) with K communities/blocks. A sample is obtained by randomly choosing a subset W ⊆ V and letting G(W) be the induced subgraph in G of the vertices in W. In addition to G(W), we observe the total degree of each sampled vertex and its block membership. Given this partial information, we propose an efficient PopULation Size Estimation algorithm, called PULSE, that accurately estimates the size of the whole population as well as the size of each community. To support our theoretical analysis, we perform an exhaustive set of experiments to study the effects of sample size, K, and SBM model parameters on the accuracy of the estimates. The experimental results also demonstrate that PULSE significantly outperforms a widely-used method called the network scale-up estimator in a wide variety of scenarios.

Entities:  

Year:  2016        PMID: 28867924      PMCID: PMC5578631     

Source DB:  PubMed          Journal:  Adv Neural Inf Process Syst        ISSN: 1049-5258


  8 in total

Review 1.  Community structure in social and biological networks.

Authors:  M Girvan; M E J Newman
Journal:  Proc Natl Acad Sci U S A       Date:  2002-06-11       Impact factor: 11.205

2.  Finding and evaluating community structure in networks.

Authors:  M E J Newman; M Girvan
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2004-02-26

3.  Modularity and community structure in networks.

Authors:  M E J Newman
Journal:  Proc Natl Acad Sci U S A       Date:  2006-05-24       Impact factor: 11.205

4.  Estimation of seroprevalence, rape, and homelessness in the United States using a social network approach.

Authors:  P D Killworth; C McCarty; H R Bernard; G A Shelley; E C Johnsen
Journal:  Eval Rev       Date:  1998-04

5.  Size Estimation of Groups at High Risk of HIV/AIDS using Network Scale Up in Kerman, Iran.

Authors:  Mostafa Shokoohi; Mohammad Reza Baneshi; Ali-Akbar Haghdoost
Journal:  Int J Prev Med       Date:  2012-07

6.  Assessing network scale-up estimates for groups most at risk of HIV/AIDS: evidence from a multiple-method study of heavy drug users in Curitiba, Brazil.

Authors:  Matthew J Salganik; Dimitri Fazito; Neilane Bertoni; Alexandre H Abdo; Maeve B Mello; Francisco I Bastos
Journal:  Am J Epidemiol       Date:  2011-10-14       Impact factor: 4.897

7.  Population size estimation of men who have sex with men through the network scale-up method in Japan.

Authors:  Satoshi Ezoe; Takeo Morooka; Tatsuya Noda; Miriam Lewis Sabin; Soichi Koike
Journal:  PLoS One       Date:  2012-01-27       Impact factor: 3.240

8.  Estimating the size of HIV key affected populations in Chongqing, China, using the network scale-up method.

Authors:  Wei Guo; Shuilian Bao; Wen Lin; Guohui Wu; Wei Zhang; Wolfgang Hladik; Abu Abdul-Quader; Marc Bulterys; Serena Fuller; Lu Wang
Journal:  PLoS One       Date:  2013-08-13       Impact factor: 3.240

  8 in total
  1 in total

1.  Public communication can facilitate low-risk coordination under surveillance.

Authors:  Amos Korman; Pierluigi Crescenzi
Journal:  Sci Rep       Date:  2022-03-02       Impact factor: 4.379

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.