Hi,
Thanks for sharing this cool work!
I'm having trouble using gemma_los.py to remove mitochondrial contigs from my HiFiasm assembly. I adapted the provided bash script for my animal genome, but the gemma_los.py part doesn't seem to be working.
Could you clarify what gemma_los.py does with the PAF file? My understanding is that mitochondrion.txt should list coordinates (like a BED file) so seqkit grep -v -f can remove the mitochondrial sequences, leaving only nuclear contigs. Is this correct?
Here's the command I'm using:
#!/bin/bash . /home/abdul/mambaforge/etc/profile.d/conda.sh conda activate notebooklab mt=$1 ref=$2 threads=58 minimap2 -t $threads -x asm5 $mt $ref> mitochondrion.paf python3 gemma_los.py mitochondrion.paf > mitochondrion.txt conda deactivate conda activate seqkit_env seqkit grep -v -f mitochondrion.txt $ref > Hifiasm_remove_mt.fasta
Hi,
Thanks for sharing this cool work!
I'm having trouble using gemma_los.py to remove mitochondrial contigs from my HiFiasm assembly. I adapted the provided bash script for my animal genome, but the
gemma_los.pypart doesn't seem to be working.Could you clarify what gemma_los.py does with the PAF file? My understanding is that mitochondrion.txt should list coordinates (like a BED file) so
seqkit grep -v -fcan remove the mitochondrial sequences, leaving only nuclear contigs. Is this correct?Here's the command I'm using:
#!/bin/bash . /home/abdul/mambaforge/etc/profile.d/conda.sh conda activate notebooklab mt=$1 ref=$2 threads=58 minimap2 -t $threads -x asm5 $mt $ref> mitochondrion.paf python3 gemma_los.py mitochondrion.paf > mitochondrion.txt conda deactivate conda activate seqkit_env seqkit grep -v -f mitochondrion.txt $ref > Hifiasm_remove_mt.fasta