Skip to content

Re-evaluate impact of error correction #273

@standage

Description

@standage

Performing error correction drastically reduces the sequence content (specifically the number of distinct k-mers) in each data set, and accordingly the amount of memory required to track k-mer counts accurately. At one point we were pretty enthusiastic about this improvement, but abandoned it at one point since it led to some false negatives.

I think this decision was based on a small number of manually inspected variants (perhaps even 1), and not on overall statistics. And in any case all of the variants involved were SNVs, where our superiority is already marginal. We should re-investigate kevlar's performance on the latest simulations using error corrected data.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions