International Business Machines Corporation
Method and apparatus for data deduplication
Last updated:
Abstract:
A current file is obtained in the data. It is determined whether a similar historical file exists based on a sampled data block from at least one predetermined location in the current file. In response to non-existence of the similar historical file, the current file and corresponding metadata are stored on a file basis. In response to existence of the similar historical file, a deduplication operation is applied on the current file on a block basis.
Status:
Grant
Type:
Utility
Filling date:
28 Aug 2018
Issue date:
8 Feb 2022