Illumina, Inc.
Flexible Seed Extension for Hash Table Genomic Mapping

Last updated:

Abstract:

Methods, systems, and apparatuses, including computer programs for generating and using a hash table configured to improve mapping of reads are disclosed that include obtaining a first seed of K nucleotides from a reference sequence, generating a seed extension tree having a nodes, wherein each node of the nodes corresponds to (i) an extended seed that is an extension of the first seed and has a nucleotide length of K* and (ii) one or more locations, in a seed extension table, that include data describing reference sequence locations that match the extended seed, and for each node: storing interval information at a location of the hash table that corresponds to an index key for the extended seed, wherein the interval information references one or more locations in the seed extension table that include reference sequence locations that match the extended seed associated with the node.

Status:
Application
Type:

Utility

Filling date:

22 May 2020

Issue date:

26 Nov 2020