scispace - formally typeset
Patent

Method and apparatus for implementing parallel operations in a database management system

TLDR
In this paper, the authors propose a data flow operator (DFO) to dynamically partition row sources for parallel processing, based on the ability to parallelize a row source, the partitioning requirements of consecutive row sources and the entire row source tree.
Abstract
The present invention implements parallel processing in a Database Management System. The present invention provides the ability to locate transaction and recovery information at one location and eliminates the need for read locks and two-phased commits. The present invention provides the ability to dynamically partition row sources for parallel processing. Parallelism is based on the ability to parallelize a row source, the partitioning requirements of consecutive row sources and the entire row source tree, and any specification in the SQL statement. A Query Coordinator assumes control of the processing of a entire query and can execute serial row sources. Additional threads of control, Query Server, execute a parallel operators. Parallel operators are called data flow operators (DFOs). A DFO is represented as structured query language (SQL) statements and can be executed concurrently by multiple processes, or query slaves. A central scheduling mechanism, a data flow scheduler, controls a parallelized portion of an execution plan, and can become invisible for serial execution. Table queues are used to partition and transport rows between sets of processes. Node linkages provide the ability to divide the plan into independent lists that can each be executed by a set of query slaves. The present invention maintains a bit vector that is used by a subsequent producer to determine whether any rows need to be produced to its consumers. The present uses states and a count of the slaves that have reached these states to perform its scheduling tasks.

read more

Citations
More filters
Patent

System and Methodology for Parallel Query Optimization Using Semantic-Based Partitioning

TL;DR: In this paper, a system and methodology for parallel query optimization using semantic-based partitioning is described, in which each subplan includes one or more operators for performing relational operations and adding operators for partitioning data and performing a given relational operation in parallel.
Patent

System and method for performing database operations on a continuous stream of tuples

TL;DR: In this article, the SQL compiler and SQL executor are extended to process queries requiring streaming mode processing of data stored in a table, and a scan operator performs table access functions during execution of the streaming mode query.
Patent

Method and system for linking and delinking data records

David Bayliss
TL;DR: In this paper, a method for linking entity references to entities and identifying associations between entities is presented, where the one or more entity references have at least one common data field and a comparison of the difference between the difference and a defined threshold is made.
Patent

System and method for configuring a parallel-processing database system

TL;DR: In this paper, a system and method for configuring a plurality of processing nodes into a parallel processing database system is described, where each node or node is configured to convert query-based programming code to intermediary source code in a common programming language and then compile the intermediate source code into a dynamic link library (DLL) or other type of executable.
Patent

Method and apparatus for operating on data with a conceptual data manipulation language

TL;DR: In this article, a data services layer is disclosed which maintains a dictionary of conceptual information and physical information about the data, and requests are written in a conceptual query language (CQL) which substantially uses terms belonging to or derived from a natural language.
References
More filters
Patent

Apparatus and method for decomposing database queries for database management system including multiprocessor digital data processing system

TL;DR: In this paper, an improved system for database query processing by means of "query decomposition" intercepts database queries prior to processing by a database management system ("DBMS") is presented.
Patent

SQL-based database system with improved indexing methodology

TL;DR: In this paper, a client/server database system with improved methods for performing database queries, particularly DSS-type queries, is described, which includes one or more Clients (e.g., Terminals or PCs) connected via a Network to a Server.
Proceedings ArticleDOI

Encapsulation of parallelism in the Volcano query processing system

TL;DR: The reasons for not choosing the bracket model, the novel operator model, and details of Volcano's exchange operator that parallelizes all other operators are described, which makes implementation of parallel database algorithms significantly easier and more robust.
Journal ArticleDOI

Data placement in Bubba

TL;DR: It is argued that data placement, especially declustering, in a highly-parallel system must be considered early in the design, so that mechanisms can be included for supporting variable declustered, for minimizing the most significant overheads associated with large-scale declusting, and for gathering the required statistics.
Patent

Multi-tiered indexing method for partitioned data

TL;DR: In this paper, a multi-tiered indexing method is disclosed for a partitioned table in a parallel or distributed database system, where a local index is created and maintained for each partition of the table and a coarse global index is maintained.
Related Papers (5)