Microsoft Corporation
Data lake workload optimization through explaining and optimizing index recommendations

Last updated:

Abstract:

Methods, systems and computer program products are described herein that enable data workload optimization through "what-if" modeling of indexes and index recommendation. In an example aspect, a system is configured to accept a workload comprising a plurality of queries directed at data having a first physical data layout, generate a set of candidate indexes based on the plurality of queries, enumerate index configurations based of the set of candidate indexes, each index configuration comprising a subset on the set of candidate indexes, generate a hierarchical graph of the index configurations, search the hierarchical graph for a recommended index configuration comprising an index configuration with the lowest estimated cost while pruning index configurations not considered from the graph of index configurations to generate a pruned graph, execute a graph query against the pruned graph generating a graph query result and perform an optimization operation based on the graph query result.

Status:
Grant
Type:

Utility

Filling date:

28 May 2020

Issue date:

29 Mar 2022