Cardinality estimation of joins

Patent

Cardinality estimation of joins

TLDR

In this article, a method of estimating cardinality of a join of tables using multi-column density values and additionally using coarser density values of a subset of the multiscale density attributes is presented.

Abstract:

A method of estimating cardinality of a join of tables using multi-column density values and additionally using coarser density values of a subset of the multi-column density attributes In one embodiment, the subset of attributes for the coarser densities is a prefix of the set of multi-column density attributes A number of tuples from each table that participate in the join may be estimated using densities of the subsets The cardinality of the join can be estimated using the multi-column density for each table and the estimated number of tuples that participate in the join from each table

Citations

PDF

Open Access

More filters

Patent

Application portability and extensibility through database schema and query abstraction

Richard D. Dettinger, +4 more

TL;DR: In this paper, a data repository abstraction layer provides a logical view of the underlying data repository that is independent of the particular manner of data representation, and a query abstraction layer is also provided.

...read moreread less

Patent

Annotation structure type determination

Brian John Cragun, +2 more

TL;DR: In this paper, the authors present methods, systems and articles of manufacture for organizing and selecting structures used to generate forms for capturing information as annotations made for a variety of different type data objects are provided.

...read moreread less

Patent

Method and apparatus for XML query evaluation using early-outs and multiple passes

Fatma Ozcan, +1 more

TL;DR: In this article, a method and apparatus for XML query evaluation using early-outs and multiple passes to evaluate an XML query is described, which allows the evaluation to exit early when the veracity of the query is determined, not necessarily when every component has been evaluated.

...read moreread less

Patent

Well organized query result sets

Richard D. Dettinger, +2 more

TL;DR: In this paper, the authors present an abstract database that separates a user from the physical representation of data stored in a data repository by providing a plurality of logical fields based on the substantive information stored in the underlying data repository.

...read moreread less

Patent

Automated creation of join graphs for unrelated data sets among relational databases

Jonathan Young, +5 more

TL;DR: In this paper, a computer system processes a plurality of unrelated database tables to create a join graph where each node in the join graph represents a table, and the nodes are connected by weighted, directed edges, where each directed edge represents a join from a first column in a first table to a second column in another table and where the weight of the directed edge represented a predicted level of success in performing the join.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Access path selection in a relational database management system

P. Griffiths Selinger, +4 more

TL;DR: System R as mentioned in this paper is an experimental database management system developed to carry out research on the relational model of data, which chooses access paths for both simple (single relation) and complex queries (such as joins), given a user specification of desired data as a boolean expression of predicates.

...read moreread less

Proceedings ArticleDOI

STHoles: a multidimensional workload-aware histogram

Nicolas Bruno, +2 more

TL;DR: STHoles is introduced, a “workload-aware” histogram that allows bucket nesting to capture data regions with reasonably uniform tuple density and outperform the best multidimensional histogram techniques that require access to and processing of the full data sets during histogram construction.

...read moreread less

Patent

A method of generating attribute cardinality maps

Basantkumar John Oommen, +1 more

TL;DR: In this paper, the authors proposed a method for creating a histogram for use in minimizing response time and resource consumption when optimizing a query in a database, and other like structures, the histogram being created by placing ordered elements into specific range until the next element to be considered for inclusion in the range is a predetermined distance from the (generalized) mean value associated with the elements within the range, whereupon that next element is placed in the following range.

...read moreread less

Proceedings ArticleDOI

Exploiting statistics on query expressions for optimization

Nicolas Bruno, +1 more

TL;DR: This paper shows how to extend a generic optimizer so that it also exploits statistics built on expressions corresponding to intermediate nodes of query plans, and shows that in some cases, the quality of the resulting plans is significantly better than when only base-table statistics are available.

...read moreread less

Cardinality estimation of joins

Citations

Application portability and extensibility through database schema and query abstraction

Annotation structure type determination

Method and apparatus for XML query evaluation using early-outs and multiple passes

Well organized query result sets

Automated creation of join graphs for unrelated data sets among relational databases

References

Access path selection in a relational database management system

STHoles: a multidimensional workload-aware histogram

A method of generating attribute cardinality maps

Exploiting statistics on query expressions for optimization

Related Papers (5)

Method and apparatus for exploiting statistics on query expressions for optimization

System and methodology for join enumeration in a memory-constrained environment

System, method, and computer program product for progressive query processing

Relational database access system using semantically dynamic objects

Extending the semantics of the outer join operator for un-nesting queries to a data base