scispace - formally typeset
A

Andrei Z. Broder

Researcher at Google

Publications -  241
Citations -  28441

Andrei Z. Broder is an academic researcher from Google. The author has contributed to research in topics: Web search query & Web page. The author has an hindex of 67, co-authored 241 publications receiving 27310 citations. Previous affiliations of Andrei Z. Broder include AmeriCorps VISTA & IBM.

Papers
More filters
Journal ArticleDOI

Trading Space for Time in Undirected $s-t$ Connectivity

TL;DR: This question is answered in the affirmative for sparse graphs by presentation of an algorithm that is faster than the random walk by a factor essentially proportional to the size of its workspace.
Patent

Method and system for using email receipts for targeted advertising

TL;DR: In this article, a technique for performing user classification based on email is presented, where information included in the stored emails may be extracted, and users may be classified into categories according to the extracted information.
Proceedings ArticleDOI

Scalable K-Means by ranked retrieval

TL;DR: This paper shows how to reduce the cost of the k-means algorithm by large factors by adapting ranked retrieval techniques, and proposes a variant of the WAND algorithm that uses the results of the intermediate results of nearest neighbor computations to improve performance.
Book ChapterDOI

Indexing shared content in information retrieval systems

TL;DR: This paper describes a new document representation model where related documents are organized as a tree, allowing shared content to be indexed just once, and shows how this representation model can be encoded in an inverted index.
Patent

Method and apparatus for finding mirrored hosts by analyzing urls

TL;DR: In this paper, a method and apparatus that detects mirrored host pairs using information about a large set of pages, including URLs, is described, and the identities of the detected mirrored hosts are then saved so that browsers, crawlers, proxy servers, or the like can correctly identify mirrored web sites.