VMware, Inc.
Organize chunk store to preserve locality of hash values and reference counts for deduplication

Last updated:

Abstract:

The present disclosure provides techniques for deduplicating files. The techniques include creating a data structure that organizes metadata about chunks of files, the organization of the metadata preserving order and locality of the chunks within files. The organization of the metadata within storage blocks of storage devices matches the order of chunks within files. Upon a read or write operation to a metadata, the preservation of locality of metadata results in the likely fetching, from storage into a memory cache, metadata of subsequent and contiguous chunks. The preserved locality results in faster subsequent read and write operations of metadata, because the read and write operations are likely to be executed from memory rather than from storage.

Status:
Grant
Type:

Utility

Filling date:

27 Aug 2019

Issue date:

28 Jun 2022