Alibaba Group Holding Limited
Method and apparatus for data processing
Last updated:
Abstract:
This application generally relates to data processing methods and apparatus. One data processing method disclosed herein comprises: creating a Writable Snapshot based on data modification; creating a plurality of Read-Only ("RO") Snapshots by cloning the Writable Snapshot at distinct predetermined creation-times; receiving a data inquiry request; and conducting the data inquiry through indexing, in a RO Snapshot with a latest creation-time. This approach achieves separation of data modification and data inquiry, enabling efficient real-time updating. Further, by fast indexing and invert indexing, inquiry efficiency is further improved. Additionally, data is stored in data columns, wherein each column may be divided into multiple data blocks according to a fixed block size, and each data block has a same length. When modifying data, effect of the modification may be limited to the data blocks being modified, without affecting the other data blocks, which reduces resource consumption incurred by data modification.
Utility
4 Nov 2016
29 Sep 2020