SALSA: the stochastic approach for link-structure analysis

doi:10.1145/382979.383041

Journal ArticleDOI

SALSA: the stochastic approach for link-structure analysis

Ronny Lempel, +1 more

- 01 Apr 2001 -

ACM Transactions on Information Systems

- Vol. 19, Iss: 2, pp 131-160

Chats0

TLDR

It is proved that SALSA is quivalent to a weighted in degree analysis of the link-sturcutre of WWW subgraphs, making it computationally more efficient than the Mutual reinforcement approach, and comparisions reveal a topological Phenomenon called the TKC effect which prevents the Mutual Reinforcement approach from identifying meaningful authorities.

Abstract:

Today, when searching for information on the WWW, one usually performs a query through a term-based search engine. These engines return, as the query's result, a list of Web pages whose contents matches the query. For broad-topic queries, such searches often result in a huge set of retrieved documents, many of which are irrelevant to the user. However, much information is contained in the link-structure of the WWW. Information such as which pages are linked to others can be used to augment search algorithms. In this context, Jon Kleinberg introduced the notion of two distinct types of Web pages: hubs and authorities. Kleinberg argued that hubs and authorities exhibit a mutually reinforcing relationship: a good hub will point to many authorities, and a good authority will be pointed at by many hubs. In light of this, he dervised an algoirthm aimed at finding authoritative pages. We present SALSA, a new stochastic approach for link-structure analysis, which examines random walks on graphs derived from the link-structure. We show that both SALSA and Kleinberg's Mutual Reinforcement approach employ the same metaalgorithm. We then prove that SALSA is quivalent to a weighted in degree analysis of the link-sturcutre of WWW subgraphs, making it computationally more efficient than the Mutual reinforcement approach. We compare that results of applying SALSA to the results derived through Kleinberg's approach. These comparisions reveal a topological Phenomenon called the TKC effectwhich, in certain cases, prevents the Mutual reinforcement approach from identifying meaningful authorities.

SALSA: the stochastic approach for link-structure analysis

Citations

Large-scale Graph Computation on Just a PC

Mining the Web: Discovering Knowledge from Hypertext Data

Data-Intensive Text Processing with MapReduce

WTF: the who to follow service at Twitter

A Survey on PageRank Computing

References

The anatomy of a large-scale hypertextual Web search engine

The Anatomy of a Large-Scale Hypertextual Web Search Engine.

Authoritative sources in a hyperlinked environment

Co-citation in the scientific literature: A new measure of the relationship between two documents

Citation analysis as a tool in journal evaluation.

Related Papers (5)

Authoritative sources in a hyperlinked environment

The PageRank Citation Ranking : Bringing Order to the Web

The anatomy of a large-scale hypertextual Web search engine

Topic-sensitive PageRank

Combating web spam with trustrank