A
Andrei Z. Broder
Researcher at Google
Publications - 241
Citations - 28441
Andrei Z. Broder is an academic researcher from Google. The author has contributed to research in topics: Web search query & Web page. The author has an hindex of 67, co-authored 241 publications receiving 27310 citations. Previous affiliations of Andrei Z. Broder include AmeriCorps VISTA & IBM.
Papers
More filters
Journal ArticleDOI
Trading Space for Time in Undirected $s-t$ Connectivity
TL;DR: This question is answered in the affirmative for sparse graphs by presentation of an algorithm that is faster than the random walk by a factor essentially proportional to the size of its workspace.
Patent
Method and system for using email receipts for targeted advertising
TL;DR: In this article, a technique for performing user classification based on email is presented, where information included in the stored emails may be extracted, and users may be classified into categories according to the extracted information.
Proceedings ArticleDOI
Scalable K-Means by ranked retrieval
Andrei Z. Broder,Lluis Garcia-Pueyo,Vanja Josifovski,Sergei Vassilvitskii,Srihari Venkatesan +4 more
TL;DR: This paper shows how to reduce the cost of the k-means algorithm by large factors by adapting ranked retrieval techniques, and proposes a variant of the WAND algorithm that uses the results of the intermediate results of nearest neighbor computations to improve performance.
Book ChapterDOI
Indexing shared content in information retrieval systems
Andrei Z. Broder,Nadav Eiron,Marcus Fontoura,Michael Herscovici,Ronny Lempel,John McPherson,Runping Qi,Eugene J. Shekita +7 more
TL;DR: This paper describes a new document representation model where related documents are organized as a tree, allowing shared content to be indexed just once, and shows how this representation model can be encoded in an inverted index.
Patent
Method and apparatus for finding mirrored hosts by analyzing urls
TL;DR: In this paper, a method and apparatus that detects mirrored host pairs using information about a large set of pages, including URLs, is described, and the identities of the detected mirrored hosts are then saved so that browsers, crawlers, proxy servers, or the like can correctly identify mirrored web sites.