Affiliation:
1. IBM Almaden Research Center, San Jose, CA
Abstract
Algorithmic tools for searching and mining the Web are becoming increasingly sophisticated and vital. In this context, algorithms that use and exploit structural information about the Web perform better than generic methods in both efficiency and reliability.We present an extensive characterization of the graph structure of the Web, with a view to enabling high-performance applications that make use of this structure. In particular, we show that the Web emerges as the outcome of a number of essentially independent stochastic processes that evolve at various scales. A striking consequence of this scale invariance is that the structure of the Web is "fractal"---cohesive subregions display the same characteristics as the Web at large. An understanding of this underlying fractal nature is therefore applicable to designing data services across multiple domains and scales.We describe potential applications of this line of research to optimized algorithm design for Web-scale data analysis.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications
Reference46 articles.
1. The Lorel query language for semistructured data;Abiteboul S.;Int. J. Digital Libr.,1997
2. The nature of markets on the world wide web;Adamic L.;Q. J. Econ. Commerce,2000
3. Adamic L. and Huberman B. 1999. Scaling behavior on the world wide web. Technical comment on Barabasi and Albert {1999}. Adamic L. and Huberman B. 1999. Scaling behavior on the world wide web. Technical comment on Barabasi and Albert {1999}.
Cited by
56 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献