scispace - formally typeset
L

Luis Gravano

Researcher at Columbia University

Publications -  142
Citations -  14374

Luis Gravano is an academic researcher from Columbia University. The author has contributed to research in topics: Web query classification & Web search query. The author has an hindex of 55, co-authored 142 publications receiving 13852 citations. Previous affiliations of Luis Gravano include New York University & Google.

Papers
More filters
Proceedings ArticleDOI

Snowball: extracting relations from large plain-text collections

TL;DR: This paper develops a scalable evaluation methodology and metrics for the task, and presents a thorough experimental evaluation of Snowball and comparable techniques over a collection of more than 300,000 newspaper documents.
Proceedings ArticleDOI

Beyond Trending Topics: Real-World Event Identification on Twitter

TL;DR: This paper explores approaches for analyzing the stream of Twitter messages to distinguish between messages about real-world events and non-event messages, and relies on a rich family of aggregatestatistics of topically similar message clusters.
Book ChapterDOI

Efficient IR-style keyword search over relational databases

TL;DR: This paper adapts IR-style document-relevance ranking strategies to the problem of processing free-form keyword queries over RDBMSs, and develops query-processing strategies that build on a crucial characteristic of IR- style keyword search: only the few most relevant matches are generally of interest.
Proceedings Article

Approximate String Joins in a Database (Almost) for Free

TL;DR: In this article, the authors propose a technique for building approximate string join capabilities on top of commercial databases by exploiting facilities already available in them. But this technique relies on matching short substrings of length, called -grams, and taking into account both positions of individual matches and the total number of such matches.

Approximate String Joins in a Database (Almost) for Free -- Erratum

TL;DR: This paper develops a technique for building approximate string join capabilities on top of commercial databases by exploiting facilities already available in them, and demonstrates experimentally the benefits of the technique over the direct use of UDFs.