Skip to content

Question about pairtools parse strand information #167

@jiangshan529

Description

@jiangshan529

Hello, I am trying to understand the details about how pairtools works. In my pair.gz file, I choose a read pair to do the test, the read pair looks like this:

SRR710074.10273713      chr1    1054204 chr1    1054648 +       -       UU 

Then I searched the line number in .bam file and found:

SRR710074.10273713      81      chr1    1054613 60      36M     =       1054204 -445    ACTCCACCCCCCAGCGCCCACCCTTGAGTCAGGGTG
SRR710074.10273713      161     chr1    1054204 60      36M     =       1054613 445     GTCGCTCCAGTCTGAGCCTGGCCGTCGCCTCCAGCA

I use the blat tool in IGV genome browser and found actually the two reads are both on + strand, not as shown in the pair.gz file that one is on + strand and another on - strand. So how should I understand this?

I also searched 'UU' in the pair.gz file and do blat for some 'UU' pairs, and some of them can be blasted to several sites on the hg38 genome. And it is not what defined by 'UU'. So how should I interpret this? Thanks for your help!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions