Skip to content

Removing the Mitochondrial genome from contigs #1

@BenAawf

Description

@BenAawf

Hi,

Thanks for sharing this cool work!

I'm having trouble using gemma_los.py to remove mitochondrial contigs from my HiFiasm assembly. I adapted the provided bash script for my animal genome, but the gemma_los.py part doesn't seem to be working.

Could you clarify what gemma_los.py does with the PAF file? My understanding is that mitochondrion.txt should list coordinates (like a BED file) so seqkit grep -v -f can remove the mitochondrial sequences, leaving only nuclear contigs. Is this correct?

Here's the command I'm using:

#!/bin/bash . /home/abdul/mambaforge/etc/profile.d/conda.sh conda activate notebooklab mt=$1 ref=$2 threads=58 minimap2 -t $threads -x asm5 $mt $ref> mitochondrion.paf python3 gemma_los.py mitochondrion.paf > mitochondrion.txt conda deactivate conda activate seqkit_env seqkit grep -v -f mitochondrion.txt $ref > Hifiasm_remove_mt.fasta

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions