scispace - formally typeset
G

Gurmeet Singh Manku

Researcher at Google

Publications -  34
Citations -  6403

Gurmeet Singh Manku is an academic researcher from Google. The author has contributed to research in topics: Data stream mining & Cache. The author has an hindex of 23, co-authored 34 publications receiving 6209 citations. Previous affiliations of Gurmeet Singh Manku include Stanford University & University of California, Berkeley.

Papers
More filters
Journal ArticleDOI

Approximate frequency counts over data streams

TL;DR: This talk will trace the history of the Approximate Frequency Counts paper, how it was conceptualized and how it influenced data stream research.
Book ChapterDOI

Chapter 31 – Approximate Frequency Counts over Data Streams

TL;DR: This chapter presents algorithms for computing frequency counts exceeding a user-specified threshold over data streams, which are simple and have provably small memory footprints.
Proceedings ArticleDOI

Detecting near-duplicates for web crawling

TL;DR: This work demonstrates that Charikar's fingerprinting technique is appropriate for near-duplicate detection and presents an algorithmic technique for identifying existing f-bit fingerprints that differ from a given fingerprint in at most k bit-positions, for small k.
Proceedings Article

Symphony: distributed hashing in a small world

TL;DR: Symphony, a novel protocol for maintaining distributed hash tables in a wide area network that is scalable, flexible, stable in the presence of frequent updates and offers small average latency with only a handful of long distance links per node.

Query Processing, Resource Management, and Approximation ina Data Stream Management System

TL;DR: This paper describes the ongoing work developing the Stanford Stream Data Manager (STREAM), a system for executing continuous queries over multiple continuous data streams that supports a declarative query language.