The web as a graph: measurements, models, and methods

doi:10.1007/3-540-48686-0_1

Book ChapterDOI

The web as a graph: measurements, models, and methods

- pp 1-17

TLDR

This paper describes two algorithms that operate on the Web graph, addressing problems from Web search and automatic community discovery, and proposes a new family of random graph models that point to a rich new sub-field of the study of random graphs, and raises questions about the analysis of graph algorithms on the Internet.

Abstract:

The pages and hyperlinks of the World-Wide Web may be viewed as nodes and edges in a directed graph. This graph is a fascinating object of study: it has several hundred million nodes today, over a billion links, and appears to grow exponentially with time. There are many reasons -- mathematical, sociological, and commercial -- for studying the evolution of this graph. In this paper we begin by describing two algorithms that operate on the Web graph, addressing problems from Web search and automatic community discovery. We then report a number of measurements and properties of this graph that manifested themselves as we ran these algorithms on the Web. Finally, we observe that traditional random graph models do not explain these observations, and we propose a new family of random graph models. These models point to a rich new sub-field of the study of random graphs, and raise questions about the analysis of graph algorithms on the Web.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

The Structure and Function of Complex Networks

Mark Newman

- 01 Jan 2003 -

Siam Review

TL;DR: Developments in this field are reviewed, including such concepts as the small-world effect, degree distributions, clustering, network correlations, random graph models, models of network growth and preferential attachment, and dynamical processes taking place on networks.

...read moreread less

Journal ArticleDOI

Finding community structure in very large networks.

Aaron Clauset, +2 more

- 06 Dec 2004 -

Physical Review E

TL;DR: A hierarchical agglomeration algorithm for detecting community structure which is faster than many competing algorithms: its running time on a network with n vertices and m edges is O (md log n) where d is the depth of the dendrogram describing the community structure.

...read moreread less

Journal ArticleDOI

A faster algorithm for betweenness centrality

Ulrik Brandes

- 01 Jun 2001 -

Journal of Mathematical Sociology

TL;DR: New algorithms for betweenness are introduced in this paper and require O(n + m) space and run in O(nm) and O( nm + n2 log n) time on unweighted and weighted networks, respectively, where m is the number of links.

...read moreread less

Journal ArticleDOI

Random graphs with arbitrary degree distributions and their applications.

Mark Newman, +4 more

- 24 Jul 2001 -

Physical Review E

TL;DR: It is demonstrated that in some cases random graphs with appropriate distributions of vertex degree predict with surprising accuracy the behavior of the real world, while in others there is a measurable discrepancy between theory and reality, perhaps indicating the presence of additional social structure in the network that is not captured by the random graph.

...read moreread less

Journal ArticleDOI

Evolution of networks

Sergey N. Dorogovtsev, +1 more

- 01 Jun 2002 -

Advances in Physics

TL;DR: The recent rapid progress in the statistical physics of evolving networks is reviewed, and how growing networks self-organize into scale-free structures is discussed, and the role of the mechanism of preferential linking is investigated.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Matrix computations

Gene H. Golub

Journal ArticleDOI

The anatomy of a large-scale hypertextual Web search engine

Sergey Brin, +1 more

TL;DR: This paper provides an in-depth description of Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext and looks at the problem of how to effectively deal with uncontrolled hypertext collections where anyone can publish anything they want.

...read moreread less

Journal Article

The Anatomy of a Large-Scale Hypertextual Web Search Engine.

Sergey Brin, +1 more

- 01 Jan 1998 -

Computer Networks

TL;DR: Google as discussed by the authors is a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext and is designed to crawl and index the Web efficiently and produce much more satisfying search results than existing systems.

...read moreread less

Proceedings Article

Fast algorithms for mining association rules

Rakesh Agrawal, +1 more

TL;DR: Two new algorithms for solving thii problem that are fundamentally different from the known algorithms are presented and empirical evaluation shows that these algorithms outperform theknown algorithms by factors ranging from three for small problems to more than an order of magnitude for large problems.

...read moreread less

Proceedings Article

Fast Algorithms for Mining Association Rules in Large Databases

Rakesh Agrawal, +1 more

Collapse

The web as a graph: measurements, models, and methods

Citations

The Structure and Function of Complex Networks

Finding community structure in very large networks.

A faster algorithm for betweenness centrality

Random graphs with arbitrary degree distributions and their applications.

Evolution of networks

References

Matrix computations

The anatomy of a large-scale hypertextual Web search engine

The Anatomy of a Large-Scale Hypertextual Web Search Engine.

Fast algorithms for mining association rules

Fast Algorithms for Mining Association Rules in Large Databases

Related Papers (5)

Emergence of Scaling in Random Networks

Collective dynamics of small-world networks

The anatomy of a large-scale hypertextual Web search engine

Statistical mechanics of complex networks

Authoritative sources in a hyperlinked environment