Literature DB >> 12381792

Growing and navigating the small world Web by local content.

Filippo Menczer1.   

Abstract

Can we model the scale-free distribution of Web hypertext degree under realistic assumptions about the behavior of page authors? Can a Web crawler efficiently locate an unknown relevant page? These questions are receiving much attention due to their potential impact for understanding the structure of the Web and for building better search engines. Here I investigate the connection between the linkage and content topology of Web pages. The relationship between a text-induced distance metric and a link-based neighborhood probability distribution displays a phase transition between a region where linkage is not determined by content and one where linkage decays according to a power law. This relationship is used to propose a Web growth model that is shown to accurately predict the distribution of Web page degree, based on textual content and assuming only local knowledge of degree for existing pages. A qualitatively similar phase transition is found between linkage and semantic distance, with an exponential decay tail. Both relationships suggest that efficient paths can be discovered by decentralized Web navigation algorithms based on textual and/or categorical cues.

Year:  2002        PMID: 12381792      PMCID: PMC137828          DOI: 10.1073/pnas.212348399

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  10 in total

1.  Genome evolution. Global methylation in eutherian hybrids.

Authors:  I Roemer; F Grützner; H Winking; T Haaf; A Orth; L Skidmore; D Antczak; R Fundele
Journal:  Nature       Date:  1999-09-09       Impact factor: 49.962

2.  Emergence of scaling in random networks

Authors: 
Journal:  Science       Date:  1999-10-15       Impact factor: 47.728

3.  Structure of growing networks with preferential linking.

Authors:  S N Dorogovtsev; J F Mendes; A N Samukhin
Journal:  Phys Rev Lett       Date:  2000-11-20       Impact factor: 9.161

4.  Network analysis. The structure of the Web.

Authors:  J Kleinberg; S Lawrence
Journal:  Science       Date:  2001-11-30       Impact factor: 47.728

5.  Bose-Einstein condensation in complex networks.

Authors:  G Bianconi; A L Barabási
Journal:  Phys Rev Lett       Date:  2001-06-11       Impact factor: 9.161

6.  Path finding strategies in scale-free networks.

Authors:  Beom Jun Kim; Chang No Yoon; Seung Kee Han; Hawoong Jeong
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2002-01-23

7.  Search in power-law networks.

Authors:  L A Adamic; R M Lukose; A R Puniyani; B A Huberman
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2001-09-26

8.  Navigation in a small world

Authors: 
Journal:  Nature       Date:  2000-08-24       Impact factor: 49.962

9.  Winners don't take all: Characterizing the competition for links on the web.

Authors:  David M Pennock; Gary W Flake; Steve Lawrence; Eric J Glover; C Lee Giles
Journal:  Proc Natl Acad Sci U S A       Date:  2002-04-16       Impact factor: 11.205

10.  Identity and search in social networks.

Authors:  Duncan J Watts; Peter Sheridan Dodds; M E J Newman
Journal:  Science       Date:  2002-05-17       Impact factor: 47.728

  10 in total
  9 in total

1.  Evolution of document networks.

Authors:  Filippo Menczer
Journal:  Proc Natl Acad Sci U S A       Date:  2004-01-27       Impact factor: 11.205

2.  Topical interests and the mitigation of search engine bias.

Authors:  S Fortunato; A Flammini; F Menczer; A Vespignani
Journal:  Proc Natl Acad Sci U S A       Date:  2006-08-10       Impact factor: 11.205

3.  Distribution of node characteristics in complex networks.

Authors:  Juyong Park; Albert-László Barabási
Journal:  Proc Natl Acad Sci U S A       Date:  2007-11-07       Impact factor: 11.205

4.  Popularity versus similarity in growing networks.

Authors:  Fragkiskos Papadopoulos; Maksim Kitsak; M Ángeles Serrano; Marián Boguñá; Dmitri Krioukov
Journal:  Nature       Date:  2012-09-12       Impact factor: 49.962

5.  Sublinear domination and core-periphery networks.

Authors:  Marios Papachristou
Journal:  Sci Rep       Date:  2021-07-30       Impact factor: 4.379

6.  The spread of scientific information: insights from the web usage statistics in PLoS article-level metrics.

Authors:  Koon-Kiu Yan; Mark Gerstein
Journal:  PLoS One       Date:  2011-05-16       Impact factor: 3.240

7.  Wikipedia information flow analysis reveals the scale-free architecture of the semantic space.

Authors:  Adolfo Paolo Masucci; Alkiviadis Kalampokis; Victor Martínez Eguíluz; Emilio Hernández-García
Journal:  PLoS One       Date:  2011-02-28       Impact factor: 3.240

8.  Modeling statistical properties of written text.

Authors:  M Angeles Serrano; Alessandro Flammini; Filippo Menczer
Journal:  PLoS One       Date:  2009-04-29       Impact factor: 3.240

9.  Influence of reciprocal links in social networks.

Authors:  Yu-Xiao Zhu; Xiao-Guang Zhang; Gui-Quan Sun; Ming Tang; Tao Zhou; Zi-Ke Zhang
Journal:  PLoS One       Date:  2014-07-29       Impact factor: 3.240

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.