Cloudera, Inc.
MUTATIONS IN A COLUMN STORE

Last updated:

Abstract:

Columnar storage provides many performance and space saving benefits for analytic workloads, but previous mechanisms for handling single row update transactions in column stores suffer from poor performance. A columnar data layout facilitates both low-latency random access capabilities together with high-throughput analytical access capabilities, simplifying Hadoop architectures for use cases involving real-time data. In disclosed embodiments, mutations within a single row are executed atomically across columns and do not necessarily include the entirety of a row. This allows for faster updates without the overhead of reading or rewriting larger columns.

Status:
Application
Type:

Utility

Filling date:

7 May 2021

Issue date:

26 Aug 2021