Skip to content
This repository was archived by the owner on Mar 23, 2023. It is now read-only.
This repository was archived by the owner on Mar 23, 2023. It is now read-only.

mnd file #44

@taskina-alena

Description

@taskina-alena

Thanks a lot for the amazing software! But I've noticed a few inconsistencies using it.
First of all, while you're doing mnd file, you assign pos1 and pos2 to center of restrictions fragments. Although original juicer software assign pos1 and pos2 to alignment coordinates (from SAM file). Secondly, you drop duplicates on [pos1, pos1] subset, which enormously reduce the number of contacts. What are your reasons for doing these two actions?

https://github.com/nanoporetech/pore-c/blob/a865b7ea64d7a972fe878e8f9ea4aa7553def363/pore_c/analyses/contacts.py (line 410)
df = (
contact_df.get_partition(partition)
.compute()
.astype({"align1_strand": "uint8", "align2_strand": "uint8"})
.assign(
pos1=lambda x: np.rint(x.eval("0.5 * (align1_fragment_start + align1_fragment_end)")).astype(int),
pos2=lambda x: np.rint(x.eval("0.5 * (align2_fragment_start + align2_fragment_end)")).astype(int),
)
.drop_duplicates(subset=["pos1", "pos1"])

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions