International Business Machines Corporation
Context aware delta algorithm for genomic files
Last updated:
Abstract:
A method, computer system, and a computer program product for compressing at least one delta file for a plurality of genomic data files is provided. The present invention may include receiving the plurality of genomic data files as input. The present invention may also include determining a plurality of rows by traversing the received plurality of genomic data files. The present invention may then include comparing the plurality of rows associated with the traversed plurality of genomic data files. The present invention may further include generating a plurality of resulting delta files based on the compared plurality of rows. The present invention may also include compressing the generated plurality of resulting delta files by utilizing a general purpose file compressor.
Utility
31 Aug 2017
2 Nov 2021