International Business Machines Corporation
Pre-filtering of join execution over multi-column range summaries and other synopses
Last updated:
Abstract:
Techniques are provided for pre-filtering of join execution over multi-column range summaries and other synopses. An exemplary method comprises maintaining a synopsis for a plurality of data tables, wherein a given synopsis summarizes a set of records in a corresponding data table; and, in response to a request for a join operation for a set of the data tables: joining the synopses associated with the set of data tables to generate a joined synopsis; for joined records in the joined synopsis, obtaining corresponding records from the set of data tables as candidate records; and joining the candidate records. Two or more of the set of data tables can be distributed across a plurality of nodes and the synopses can be replicated and/or broadcasted across the plurality of nodes. Incremental updates to broadcasted and/or replicated synopses are optionally provided to at least one node.
Utility
10 Aug 2016
21 Sep 2021