Literature DB >> 14745041

Extracting knowledge from the World Wide Web.

Monika Henzinger1, Steve Lawrence.   

Abstract

The World Wide Web provides a unprecedented opportunity to automatically analyze a large sample of interests and activity in the world. We discuss methods for extracting knowledge from the web by randomly sampling and analyzing hosts and pages, and by analyzing the link structure of the web and how links accumulate over time. A variety of interesting and valuable information can be extracted, such as the distribution of web pages over domains, the distribution of interest in different areas, communities related to different topics, the nature of competition in different categories of sites, and the degree of communication between different communities or countries.

Mesh:

Year:  2004        PMID: 14745041      PMCID: PMC387294          DOI: 10.1073/pnas.0307528100

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  8 in total

1.  Emergence of scaling in random networks

Authors: 
Journal:  Science       Date:  1999-10-15       Impact factor: 47.728

2.  Accessibility of information on the web.

Authors:  S Lawrence; C L Giles
Journal:  Nature       Date:  1999-07-08       Impact factor: 49.962

3.  The large-scale organization of metabolic networks.

Authors:  H Jeong; B Tombor; R Albert; Z N Oltvai; A L Barabási
Journal:  Nature       Date:  2000-10-05       Impact factor: 49.962

4.  Structure of growing networks with preferential linking.

Authors:  S N Dorogovtsev; J F Mendes; A N Samukhin
Journal:  Phys Rev Lett       Date:  2000-11-20       Impact factor: 9.161

5.  Topology of evolving networks: local events and universality

Authors: 
Journal:  Phys Rev Lett       Date:  2000-12-11       Impact factor: 9.161

6.  Winners don't take all: Characterizing the competition for links on the web.

Authors:  David M Pennock; Gary W Flake; Steve Lawrence; Eric J Glover; C Lee Giles
Journal:  Proc Natl Acad Sci U S A       Date:  2002-04-16       Impact factor: 11.205

7.  Collective dynamics of 'small-world' networks.

Authors:  D J Watts; S H Strogatz
Journal:  Nature       Date:  1998-06-04       Impact factor: 49.962

8.  Strong regularities in world wide web surfing

Authors: 
Journal:  Science       Date:  1998-04-03       Impact factor: 47.728

  8 in total
  3 in total

1.  Evolution of document networks.

Authors:  Filippo Menczer
Journal:  Proc Natl Acad Sci U S A       Date:  2004-01-27       Impact factor: 11.205

2.  Googling social interactions: web search engine based social network construction.

Authors:  Sang Hoon Lee; Pan-Jun Kim; Yong-Yeol Ahn; Hawoong Jeong
Journal:  PLoS One       Date:  2010-07-21       Impact factor: 3.240

3.  Cataloging the biomedical world of pain through semi-automated curation of molecular interactions.

Authors:  Daniel G Jamieson; Phoebe M Roberts; David L Robertson; Ben Sidders; Goran Nenadic
Journal:  Database (Oxford)       Date:  2013-05-23       Impact factor: 3.451

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.