Skip to content

DeepTrio v1.9.0/v1.10.0 PACBIO candidate_sweep crashes: candidates_writer.write AttributeError #1064

@codespacebyaditya

Description

@codespacebyaditya

DeepTrio crashes during the candidate_sweep phase when running with --model_type PACBIO on PacBio HiFi trio data.
Every shard fails with:
AttributeError: '_GeneratorContextManager' object has no attribute 'write'. The bug was reproduced across two independent HPC clusters, on both CPU and GPU SIF images, and on both v1.9.0 and v1.10.0.
DeepTrio --model_type=WGS (Illumina) works correctly using the same v1.9.0 SIF- the bug is specific to the PACBIO model path that triggers candidate_sweep.

Setup:
Operating system: RHEL 8 (HPC compute nodes)
DeepVariant version: DeepTrio v1.9.0 and v1.10.0
Installation method: Singularity 3.10.0 (singularity pull docker://google/deepvariant:deeptrio-1.9.0 and docker://google/deepvariant:deeptrio-1.10.0; also tested GPU SIFs deeptrio-1.9.0-gpu and deeptrio-1.10.0-gpu)
Type of data: PacBio HiFi WGS trio (child + mother + father), haplotagged BAMs aligned to GRCh38 (hg38)

Command:
singularity exec
--bind /usr/lib/locale/
--bind /path/to/data:/path/to/data
deepvariant_deeptrio-1.9.0.sif
/opt/deepvariant/bin/deeptrio/run_deeptrio
--model_type PACBIO
--ref /path/to/hg38.fa
--reads_child /path/to/child.GRCh38.haplotagged.bam
--reads_parent1 /path/to/mother.GRCh38.haplotagged.bam
--reads_parent2 /path/to/father.GRCh38.haplotagged.bam
--output_vcf_child /output/child.vcf.gz
--output_vcf_parent1 /output/mother.vcf.gz
--output_vcf_parent2 /output/father.vcf.gz
--output_gvcf_child /output/child.g.vcf.gz
--output_gvcf_parent1 /output/mother.g.vcf.gz
--output_gvcf_parent2 /output/father.g.vcf.gz
--sample_name_child "CHILD"
--sample_name_parent1 "MOTHER"
--sample_name_parent2 "FATHER"
--num_shards 32
--intermediate_results_dir /output/tmp/intermediate_results

v1.9.0 traceback:
Traceback (most recent call last):
File ".../deeptrio/make_examples.py", line 447, in
app.run(main)
File ".../absl/app.py", line 312, in run
_run_main(main, args)
File ".../absl/app.py", line 258, in _run_main
sys.exit(main(argv))
File ".../deeptrio/make_examples.py", line 437, in main
make_examples_core.make_examples_runner(options)
File ".../deepvariant/make_examples_core.py", line 3306, in make_examples_runner
candidates_writer.write(
AttributeError: '_GeneratorContextManager' object has no attribute 'write'

v1.10.0 traceback:
File ".../deepvariant/make_examples_core.py", line 3596, in make_examples_runner
candidates_writer.write(
AttributeError: '_GeneratorContextManager' object has no attribute 'write'

As a control, we confirmed that DeepTrio --model_type=WGS (Illumina) completes successfully using the same v1.9.0 CPU SIF on the same cluster, proving Singularity, the SIF, and the general DeepTrio pipeline work correctly.

Kindly let me know if there's something being done incorrectly on my end. Happy to share .err and .outs to help debug.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions