Cloudera, Inc.
MUTATIONS IN A COLUMN STORE
Last updated:
Abstract:
Columnar storage provides many performance and space saving benefits for analytic workloads, but previous mechanisms for handling single row update transactions in column stores suffer from poor performance. A columnar data layout facilitates both low-latency random access capabilities together with high-throughput analytical access capabilities, simplifying Hadoop architectures for use cases involving real-time data. In disclosed embodiments, mutations within a single row are executed atomically across columns and do not necessarily include the entirety of a row. This allows for faster updates without the overhead of reading or rewriting larger columns.
Status:
Application
Type:
Utility
Filling date:
7 May 2021
Issue date:
2 Sep 2021