Hi @felicityallen,
There seems to be an issue with indelgen in that the potential indel profiles that are generated are not consistent with one between a sequence and it's reverse complement.
Consider the following example:
>Oligo2_GGTTGAAAGTCTATAGTGGT 49 REVERSE
AAATGCGTAACAAAAACAAGCTCGGTATCTGGCCTATTCCACGCCACCCACCACTATAGACTTTCAACATTGTATTGTC
When we run indelgen on this sequence, we get the following possible inserts:
I1_L-1C2R2
I1_L0R1
I2_L-1C1R1
I2_L-1C2R2
I2_L-3C3R1
I2_L0C1R2
I2_L0C3R4
I2_L0R1
If we take the reverse complement of Oligo_2, and take the PAM location as 79 (sequence length) - 49 = 30, as below
>Oligo2_GGTTGAAAGTCTATAGTGGT_FORWARD 30 FORWARD
GACAATACAATGTTGAAAGTCTATAGTGGTGGGTGGCGTGGAATAGGCCAGATACCGAGCTTGTTTTTGTTACGCATTT
we get a different set of indels:
I1_L-1C2R2
I1_L-1R0 2
I1_L-2C1R0
I2_L-1C1R1
I2_L-1C2R2
I2_L-1R0 9
I2_L-2C1R0
I2_L-3C3R1
I have tried this for multiple examples. I would expect the set of returned indels to be the same. Maybe you know what the issue is here? I am not familiar with C++ myself, so it will take me a while to familiarise myself enough to be able to debug this problem
Hi @felicityallen,
There seems to be an issue with
indelgenin that the potential indel profiles that are generated are not consistent with one between a sequence and it's reverse complement.Consider the following example:
When we run
indelgenon this sequence, we get the following possible inserts:If we take the reverse complement of Oligo_2, and take the PAM location as 79 (sequence length) - 49 = 30, as below
we get a different set of indels:
I have tried this for multiple examples. I would expect the set of returned indels to be the same. Maybe you know what the issue is here? I am not familiar with C++ myself, so it will take me a while to familiarise myself enough to be able to debug this problem