Skip to content

help interpreting output #7

@RichardCorbett

Description

@RichardCorbett

Hi Kevin,
I was looking at some GIAB data this morning and found the link to your tool. I gave it a whirl with this command:

vgraph repmatch --include-regions GIAB/HG001_GRCh37_GIAB_highconf_CG-IllFB-IllGATKHC-Ion-10X-SOLID_CHROM1-X_v.3.3.2_highconf_nosomaticdel.bed --reference /home/pubseq/genomes/Homo_sapiens/GRCh37/1000genomes/bwa_ind/genome/GRCh37-lite.fa GIAB/HG001_GRCh37_GIAB_highconf_CG-IllFB-IllGATKHC-Ion-10X-SOLID_CHROM1-X_v.3.3.2_highconf_PGandRTGphasetransfer.vcf.gz gsc/GSC.vcf.gz > out.txt

in which the output file contained these match lines:

107     MATCH== TYPE=H
 2     MATCH=. TYPE=N

3429981 MATCH== TYPE=T
176428 MATCH=X TYPE=H

I think I can guess what the bottom two lines represent, but I was wondering if you could explain all 4 lines? If there is a better way to quantify a match I'd be happy to know that as well.

thanks,
Richard

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions