Skip to content

Known issues with metadata

Dmytro Titov edited this page May 31, 2018 · 3 revisions

IHEC

Multiple "big_data_url" records

Most of the "browser" entries contain single "big_data_url" per entry, e.g.:

        "browser": {
          "signal": {
            "big_data_url": "http://epigenomesportal.ca/tracks/Roadmap/hg19/28402.Roadmap.SRS309499_DNA_Lib_1282_543.H3K9me3.signal.bigWig",
            "md5sum": "fdd9e47803d78ba0bb770735a7413d52",
            "primary": "true"
          }
        }

But some of them actually contain multiple URLs there:

        "browser": {
          "signal_unstranded": {
            "big_data_url": [
              "http://epigenomesportal.ca/tracks/Blueprint/hg38/59577.Blueprint.ERS150368.H3K27me3.signal_unstranded.bigWig",
              "http://epigenomesportal.ca/tracks/Blueprint/hg38/61306.Blueprint.ERS150368.H3K27me3.signal_unstranded.bigWig"
            ],
            "md5sum": [
              "dc3a32a8709e86f537a99002f88138c5",
              "0e1d7439e3ff059c7757817f6c887c3f"
            ],
            "primary": "true"
          },
          "peak_calls": {
            "big_data_url": [
              "http://epigenomesportal.ca/tracks/Blueprint/hg38/59578.Blueprint.ERS150368.H3K27me3.peak_calls.bigBed",
              "http://epigenomesportal.ca/tracks/Blueprint/hg38/61307.Blueprint.ERS150368.H3K27me3.peak_calls.bigBed"
            ],
            "md5sum": [
              "23c264e73f87a75614959aaedaf3abfc",
              "533a28b51d5777eebc0733829790ee40"
            ],
            "primary": "true"
          }
        }

This adds complexity to the conversion between JSON and GSuite.

Attributes duplication

Some attributes are duplicated, e.g. the name of the attribute is capitalized ("donor_age" and "DONOR_AGE" under the "sample_data" section).

Clone this wiki locally