Illumina, Inc.
SYSTEMS AND METHODS FOR GROUPING AND COLLAPSING SEQUENCING READS

Last updated:

Abstract:

Disclosed herein are systems and methods for collapsing sequencing reads and identifying similar sequencing reads. In one example, a method includes generating a plurality of first identifier subsequences from a first identifier sequence of each nucleotide sequencing read and generating a first signature for the nucleotide sequencing read by applying hashing to the plurality of first identifier subsequences. The method may include assigning the nucleotide sequencing read to a first particular bin of a first data structure based on the first signature and determining a nucleotide sequence for each first particular bin of the first data structure with one or more nucleotide sequencing reads assigned.

Status:
Application
Type:

Utility

Filling date:

29 Oct 2019

Issue date:

30 Apr 2020