New Branch in GIT repn
f1000_dev
on image
/home/ubuntu/scratch/ngseasy
Openstack VM
Images
Get Genomes
17.05.2016
ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA_000001405.22_GRCh38.p7/GCA_000001405.22_GRCh38.p7_genomic.fna.gz
Get test data
Index Genomes
bwa
├── hs37d5.fasta
├── hs37d5.fasta.amb
├── hs37d5.fasta.ann
├── hs37d5.fasta.bwt
├── hs37d5.fasta.pac
├── hs37d5.fasta.sa
PLAN BY MONDAY 23rd
giab_data_indexes
https://github.com/genome-in-a-bottle/giab_data_indexes
Test Data
GATK Gold Standard Run
This is the "Gold Standard". This will a week if no bugs.
The Glue
Open :-
- BASH done better than before
- logging
- read a user supplied config file (spreadsheet like)
- user specifies the pipeline
- SJN TO ADD CONFIG PARAMETER LIST
- consider converting to .yaml behind the scenes
- self checks : does input exist move on
RECON BY MONDAY NEXT WEEK
New Branch in GIT repn
f1000_devon image
/home/ubuntu/scratch/ngseasyOpenstack VM
Images
Get Genomes
Get test data
Index Genomes
bwa
PLAN BY MONDAY 23rd
giab_data_indexes
https://github.com/genome-in-a-bottle/giab_data_indexes
Test Data
GATK Gold Standard Run
This is the "Gold Standard". This will a week if no bugs.
The Glue
Open :-
RECON BY MONDAY NEXT WEEK