Optimizing top-k queries for middleware access: A unified cost-based approach

doi:10.1145/1206049.1206054

Journal ArticleDOI

Optimizing top-k queries for middleware access: A unified cost-based approach

Seung-won Hwang, +1 more

- 01 Mar 2007 -

ACM Transactions on Database Systems

- Vol. 32, Iss: 1, pp 5

TLDR

This article identifies and addresses the barriers of realizing a unified framework for optimizing top-k queries in middlewares, and develops efficient search schemes over such space for identifying the optimal algorithm.

Abstract:

This article studies optimizing top-k queries in middlewares. While many assorted algorithms have been proposed, none is generally applicable to a wide range of possible scenarios. Existing algorithms lack both the “generality” to support a wide range of access scenarios and the systematic “adaptivity” to account for runtime specifics. To fulfill this critical lacking, we aim at taking a cost-based optimization approach: By runtime search over a space of algorithms, cost-based optimization is general across a wide range of access scenarios, yet adaptive to the specific access costs at runtime. While such optimization has been taken for granted for relational queries from early on, it has been clearly lacking for ranked queries. In this article, we thus identify and address the barriers of realizing such a unified framework. As the first barrier, we need to define a “comprehensive” space encompassing all possibly optimal algorithms to search over. As the second barrier and a conflicting goal, such a space should also be “focused” enough to enable efficient search. For SQL queries that are explicitly composed of relational operators, such a space, by definition, consists of schedules of relational operators (or “query plans”). In contrast, top-k queries do not have logical tasks, such as relational operators. We thus define the logical tasks of top-k queries as building blocks to identify a comprehensive and focused space for top-k queries. We then develop efficient search schemes over such space for identifying the optimal algorithm. Our study indicates that our framework not only unifies, but also outperforms existing algorithms specifically designed for their scenarios.

Optimizing top-k queries for middleware access: A unified cost-based approach

Citations

A survey of top-k query processing techniques in relational database systems

Efficient processing of exact top-k queries over disk-resident sorted lists

A new approach for processing ranked subsequence matching based on ranked union

EcoTop: an economic model for dynamic processing of top-k queries in mobile-P2P networks

Fast First-Phase Candidate Generation for Cascading Rankers

References

Access path selection in a relational database management system

Optimal aggregation algorithms for middleware

Combining Fuzzy Information from Multiple Systems.

Evaluating top-k queries over web-accessible databases

Evaluating top-k queries over Web-accessible databases

Related Papers (5)

A survey of top-k query processing techniques in relational database systems

Optimizing Multi-Feature Queries for Image Databases

Optimal aggregation algorithms for middleware

Minimal probing: supporting expensive predicates for top-k queries

Evaluating top-k queries over Web-accessible databases