Cloudera, Inc.
BACKGROUND FORMAT OPTIMIZATION FOR ENHANCED QUERIES IN A DISTRIBUTED COMPUTING CLUSTER
Last updated:
Abstract:
A format conversion engine for Apache Hadoop that converts data from its original format to a database-like format at certain time points for use by a low latency (LL) query engine. The format conversion engine comprises a daemon that is installed on each data node in a Hadoop cluster. The daemon comprises a scheduler and a converter. The scheduler determines when to perform the format conversion and notifies the converter when the time comes. The converter converts data on the data node from its original format to a database-like format for use by the low latency (LL) query engine.
Status:
Application
Type:
Utility
Filling date:
6 Jul 2020
Issue date:
22 Oct 2020