This bundle contains an optimized phASER build focused on speeding up the default SNP-only workflow (--include_indels 0).
- Create the environment:
conda env create -f environment.yml
conda activate phaser-fast-beta- Download the official NA06986 test case (large: BAM is multiple GB):
./scripts/download_test_case.sh- Run the test case (single thread, default MAPQ=255):
./scripts/run_test_case.shOutputs go to out/.
If you already have external tools installed (samtools, bcftools, bgzip, tabix, bedtools) and working on your PATH:
python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txtFrom the bundle root:
python3 phaser/phaser.py \
--threads 1 --io_threads 0 \
--vcf /path/to/sample.vcf.gz \
--bam /path/to/sample.bam \
--paired_end 1 \
--mapq 255 --baseq 10 \
--sample SAMPLE_ID_IN_VCF \
--o out/sampleNotes:
--mapq 255is the default used in the upstream NA06986 example.--include_indels 1is supported but not part of the optimized path in this beta.- This beta build is safe to run from paths containing spaces/parentheses (e.g.
Dropbox (Personal)).
This bundle includes:
phaser_gene_ae/(gene-level haplotypic expression)phaser_pop/(phASER-pop utilities)
You can run:
./scripts/smoke_test.shPlease include:
- Command used
*.logoutput (phASER writes<prefix>.log)- Platform (
uname -a) and python version (python3 --version) - Whether you used conda or pip
Known non-issue:
*.haplotypic_counts.txtmay show a different ordering of read-index lists (aReads/bReads) while counts remain the same.