The PageRank Citation Ranking : Bringing Order to the Web

Open AccessProceedings Article

The PageRank Citation Ranking : Bringing Order to the Web

- Vol. 98, pp 161-172

TLDR

This paper describes PageRank, a mathod for rating Web pages objectively and mechanically, effectively measuring the human interest and attention devoted to them, and shows how to efficiently compute PageRank for large numbers of pages.

Abstract:

The importance of a Web page is an inherently subjective matter, which depends on the readers interests, knowledge and attitudes. But there is still much that can be said objectively about the relative importance of Web pages. This paper describes PageRank, a mathod for rating Web pages objectively and mechanically, effectively measuring the human interest and attention devoted to them. We compare PageRank to an idealized random Web surfer. We show how to efficiently compute PageRank for large numbers of pages. And, we show how to apply PageRank to search and to user navigation.

Citations

PDF

Open Access

More filters

Book ChapterDOI

Using PageRank to Characterize Web Structure

Gopal Pandurangan, +2 more

TL;DR: This work studies the distribution of PageRank values (used in the Google search engine) on the Web, and develops detailed models for the Web graph that explain this observation, and remain faithful to previously studied degree distributions.

...read moreread less

Journal ArticleDOI

Business Intelligence and Analytics: Research Directions

Ee-Peng Lim, +2 more

TL;DR: The article aims to review the state-of-the-art techniques and models and to summarize their use in BIA applications to categorize BIA research activities into three broad research directions: (a) big data analytics, (b) text analytics, and (c) network analytics.

...read moreread less

MonographDOI

Mathematics for Machine Learning

Marc Peter Deisenroth, +2 more

TL;DR: This self-contained textbook bridges the gap between mathematical and machine learning texts, introducing the mathematical concepts with a minimum of prerequisites to derive four central machine learning methods.

...read moreread less

Recognizing Nepotistic Links on the Web

Brian D. Davison

TL;DR: High accuracy in initial experiments is reported to show the potential for using a machine learning tool to automatically recognize and eliminate nepotistic links— links between pages that are present for reasons other than merit.

...read moreread less

Journal ArticleDOI

Graph-based term weighting for information retrieval

Roi Blanco, +1 more

- 01 Feb 2012 -

Information Retrieval

TL;DR: This work proposes a principled graph-theoretic approach of computing term weights and integrating discourse aspects into retrieval, and experimentally shows that this type of ranking performs comparably to BM25, and can even outperform it, across different TREC datasets and evaluation measures.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal Article

The Anatomy of a Large-Scale Hypertextual Web Search Engine.

Sergey Brin, +1 more

- 01 Jan 1998 -

Computer Networks

TL;DR: Google as discussed by the authors is a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext and is designed to crawl and index the Web efficiently and produce much more satisfying search results than existing systems.

...read moreread less

Journal ArticleDOI

Efficient crawling through URL ordering

Junghoo Cho, +2 more

TL;DR: In this paper, the authors study in what order a crawler should visit the URLs it has seen, in order to obtain more "important" pages first, and they show that a good ordering scheme can obtain important pages significantly faster than one without.

...read moreread less

Proceedings ArticleDOI

Silk from a sow's ear: extracting usable structures from the Web

Peter Pirolli, +2 more

TL;DR: This paper presents the exploration into techniques that utilize both the topology and textual similarity between items as well as usage data collected by servers and page meta-information lke title and size.

...read moreread less

Proceedings ArticleDOI

HyPursuit: a hierarchical network search engine that exploits content-link hypertext clustering

Ron Weiss, +2 more

TL;DR: Experience with HyPursuit suggests that abstraction functions based on hypertext clustering can be used to construct meaningful and scalable cluster hierarchies, and is encouraged by preliminary results on clustering based on both document contents and hyperlink structures.

...read moreread less

Journal ArticleDOI

The quest for correct information on the Web: hyper search engines

Massimo Marchiori

TL;DR: This paper presents a novel method to extract from a web object its “hyper” informative content, in contrast with current search engines, which only deal with the “textual’ informative content.

...read moreread less

The PageRank Citation Ranking : Bringing Order to the Web

Citations

Using PageRank to Characterize Web Structure

Business Intelligence and Analytics: Research Directions

Mathematics for Machine Learning

Recognizing Nepotistic Links on the Web

Graph-based term weighting for information retrieval

References

The Anatomy of a Large-Scale Hypertextual Web Search Engine.

Efficient crawling through URL ordering

Silk from a sow's ear: extracting usable structures from the Web

HyPursuit: a hierarchical network search engine that exploits content-link hypertext clustering

The quest for correct information on the Web: hyper search engines

Related Papers (5)

Authoritative sources in a hyperlinked environment

The anatomy of a large-scale hypertextual Web search engine

Emergence of Scaling in Random Networks

Collective dynamics of small-world networks

Latent dirichlet allocation