Merging similar UMIs? 

I was wondering if you had considered merging UMIs that might be erroneous copies, eg as outlined in this [blog](https://cgatoxford.wordpress.com/2015/08/14/unique-molecular-identifiers-the-problem-the-solution-and-the-proof/) post.

> The use of UMIs [...] would work perfectly if it were not for base-calling errors, which erroneously create sequencing reads with the same genomic coordinates and UMIs and that are identical for the base at which the error occurred.

If I understand the current code correctly, it considers two barcodes as separte UMIs if they differ even by one base. Would it be useful to merge reads into the same UMI if they are 'nearly' identical, e.g. based on Hamming distance?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merging similar UMIs? #5

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Merging similar UMIs? #5

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions