Literature DB >> 14747653

Evolution of document networks.

Filippo Menczer1.   

Abstract

How does a network of documents grow without centralized control? This question is becoming crucial as we try to explain the emergent scale-free topology of the World Wide Web and use link analysis to identify important information resources. Existing models of growing information networks have focused on the structure of links but neglected the content of nodes. Here I show that the current models fail to reproduce a critical characteristic of information networks, namely the distribution of textual similarity among linked documents. I propose a more realistic model that generates links by using both popularity and content. This model yields remarkably accurate predictions of both degree and similarity distributions in networks of web pages and scientific literature.

Mesh:

Year:  2004        PMID: 14747653      PMCID: PMC387305          DOI: 10.1073/pnas.0307554100

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  12 in total

1.  Emergence of scaling in random networks

Authors: 
Journal:  Science       Date:  1999-10-15       Impact factor: 47.728

2.  Structure of growing networks with preferential linking.

Authors:  S N Dorogovtsev; J F Mendes; A N Samukhin
Journal:  Phys Rev Lett       Date:  2000-11-20       Impact factor: 9.161

3.  Network analysis. The structure of the Web.

Authors:  J Kleinberg; S Lawrence
Journal:  Science       Date:  2001-11-30       Impact factor: 47.728

4.  Growing and navigating the small world Web by local content.

Authors:  Filippo Menczer
Journal:  Proc Natl Acad Sci U S A       Date:  2002-10-14       Impact factor: 11.205

5.  Diversity and complexity of HIV-1 drug resistance: a bioinformatics approach to predicting phenotype from genotype.

Authors:  Niko Beerenwinkel; Barbara Schmidt; Hauke Walter; Rolf Kaiser; Thomas Lengauer; Daniel Hoffmann; Klaus Korn; Joachim Selbig
Journal:  Proc Natl Acad Sci U S A       Date:  2002-06-11       Impact factor: 11.205

6.  Extracting knowledge from the World Wide Web.

Authors:  Monika Henzinger; Steve Lawrence
Journal:  Proc Natl Acad Sci U S A       Date:  2004-01-26       Impact factor: 11.205

7.  Tracking evolving communities in large linked networks.

Authors:  John Hopcroft; Omar Khan; Brian Kulis; Bart Selman
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-02       Impact factor: 11.205

8.  NETWORKS OF SCIENTIFIC PAPERS.

Authors:  D J PRICE
Journal:  Science       Date:  1965-07-30       Impact factor: 47.728

9.  From paragraph to graph: latent semantic analysis for information visualization.

Authors:  Thomas K Landauer; Darrell Laham; Marcia Derr
Journal:  Proc Natl Acad Sci U S A       Date:  2004-03-22       Impact factor: 11.205

10.  The simultaneous evolution of author and paper networks.

Authors:  Katy Börner; Jeegar T Maru; Robert L Goldstone
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-19       Impact factor: 11.205

View more
  8 in total

1.  The simultaneous evolution of author and paper networks.

Authors:  Katy Börner; Jeegar T Maru; Robert L Goldstone
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-19       Impact factor: 11.205

2.  Topical interests and the mitigation of search engine bias.

Authors:  S Fortunato; A Flammini; F Menczer; A Vespignani
Journal:  Proc Natl Acad Sci U S A       Date:  2006-08-10       Impact factor: 11.205

3.  Popularity versus similarity in growing networks.

Authors:  Fragkiskos Papadopoulos; Maksim Kitsak; M Ángeles Serrano; Marián Boguñá; Dmitri Krioukov
Journal:  Nature       Date:  2012-09-12       Impact factor: 49.962

4.  Characterizing the roles of bots on Twitter during the COVID-19 infodemic.

Authors:  Wentao Xu; Kazutoshi Sasahara
Journal:  J Comput Soc Sci       Date:  2021-08-30

5.  UV-B responsive microRNA genes in Arabidopsis thaliana.

Authors:  Xuefeng Zhou; Guandong Wang; Weixiong Zhang
Journal:  Mol Syst Biol       Date:  2007-04-17       Impact factor: 11.429

6.  Modeling statistical properties of written text.

Authors:  M Angeles Serrano; Alessandro Flammini; Filippo Menczer
Journal:  PLoS One       Date:  2009-04-29       Impact factor: 3.240

7.  A generative model for scientific concept hierarchies.

Authors:  Srayan Datta; Eytan Adar
Journal:  PLoS One       Date:  2018-02-23       Impact factor: 3.240

8.  Foundations of Temporal Text Networks.

Authors:  Davide Vega; Matteo Magnani
Journal:  Appl Netw Sci       Date:  2018-08-13
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.