### [seqc paper](https://genomebiology.biomedcentral.com/articles/10.1186/s13059-022-02816-6#Fig1) - raw data https://ftp-trace.ncbi.nlm.nih.gov/ReferenceSamples/seqc/Somatic_Mutation_WG/data/ - high confidence variant calls https://ftp-trace.ncbi.nlm.nih.gov/ReferenceSamples/seqc/Somatic_Mutation_WG/release/latest/ we could subsample to 1 chromosome for a small test dataset for development nf-core/sarek uses [seqc2 for benchmarking](https://www.biorxiv.org/content/10.1101/2023.07.19.549462v3.full) ### hap.py use to benchmark against high confidence data https://hpc.nih.gov/apps/hap.py.html