I'm running the complete LongStitch pipeline on a mammalian genome (~2.5Gb) using standard parameters. However, my final contig count jumped dramatically from a few hundred (pre-LongStitch) to 4000 contigs, despite maintaining excellent assembly metrics:
N50: 73Mb (chromosome-scale)
Issues:
Is this expected behavior for mammalian genomes with moderate repeats (~40-48% LINEs)?
How can I reduce contig count while preserving N50 73Mb?
Parameter tweaks for LongStitch to favor scaffolding over fragmentation:
I'm running the complete LongStitch pipeline on a mammalian genome (~2.5Gb) using standard parameters. However, my final contig count jumped dramatically from a few hundred (pre-LongStitch) to 4000 contigs, despite maintaining excellent assembly metrics:
N50: 73Mb (chromosome-scale)
Issues:
Is this expected behavior for mammalian genomes with moderate repeats (~40-48% LINEs)?
How can I reduce contig count while preserving N50 73Mb?
Parameter tweaks for LongStitch to favor scaffolding over fragmentation: