-
Notifications
You must be signed in to change notification settings - Fork 777
Description
DeepTrio crashes during the candidate_sweep phase when running with --model_type PACBIO on PacBio HiFi trio data.
Every shard fails with:
AttributeError: '_GeneratorContextManager' object has no attribute 'write'. The bug was reproduced across two independent HPC clusters, on both CPU and GPU SIF images, and on both v1.9.0 and v1.10.0.
DeepTrio --model_type=WGS (Illumina) works correctly using the same v1.9.0 SIF- the bug is specific to the PACBIO model path that triggers candidate_sweep.
Setup:
Operating system: RHEL 8 (HPC compute nodes)
DeepVariant version: DeepTrio v1.9.0 and v1.10.0
Installation method: Singularity 3.10.0 (singularity pull docker://google/deepvariant:deeptrio-1.9.0 and docker://google/deepvariant:deeptrio-1.10.0; also tested GPU SIFs deeptrio-1.9.0-gpu and deeptrio-1.10.0-gpu)
Type of data: PacBio HiFi WGS trio (child + mother + father), haplotagged BAMs aligned to GRCh38 (hg38)
Command:
singularity exec
--bind /usr/lib/locale/
--bind /path/to/data:/path/to/data
deepvariant_deeptrio-1.9.0.sif
/opt/deepvariant/bin/deeptrio/run_deeptrio
--model_type PACBIO
--ref /path/to/hg38.fa
--reads_child /path/to/child.GRCh38.haplotagged.bam
--reads_parent1 /path/to/mother.GRCh38.haplotagged.bam
--reads_parent2 /path/to/father.GRCh38.haplotagged.bam
--output_vcf_child /output/child.vcf.gz
--output_vcf_parent1 /output/mother.vcf.gz
--output_vcf_parent2 /output/father.vcf.gz
--output_gvcf_child /output/child.g.vcf.gz
--output_gvcf_parent1 /output/mother.g.vcf.gz
--output_gvcf_parent2 /output/father.g.vcf.gz
--sample_name_child "CHILD"
--sample_name_parent1 "MOTHER"
--sample_name_parent2 "FATHER"
--num_shards 32
--intermediate_results_dir /output/tmp/intermediate_results
v1.9.0 traceback:
Traceback (most recent call last):
File ".../deeptrio/make_examples.py", line 447, in
app.run(main)
File ".../absl/app.py", line 312, in run
_run_main(main, args)
File ".../absl/app.py", line 258, in _run_main
sys.exit(main(argv))
File ".../deeptrio/make_examples.py", line 437, in main
make_examples_core.make_examples_runner(options)
File ".../deepvariant/make_examples_core.py", line 3306, in make_examples_runner
candidates_writer.write(
AttributeError: '_GeneratorContextManager' object has no attribute 'write'
v1.10.0 traceback:
File ".../deepvariant/make_examples_core.py", line 3596, in make_examples_runner
candidates_writer.write(
AttributeError: '_GeneratorContextManager' object has no attribute 'write'
As a control, we confirmed that DeepTrio --model_type=WGS (Illumina) completes successfully using the same v1.9.0 CPU SIF on the same cluster, proving Singularity, the SIF, and the general DeepTrio pipeline work correctly.
Kindly let me know if there's something being done incorrectly on my end. Happy to share .err and .outs to help debug.