Skip to content

Data Quality Tolerance? #95

@FStephenQuaratiello

Description

@FStephenQuaratiello

Hi,

I've been noticing a slight (~1%) discrepancy between the number of records imported to BigQuery with this tool, and the number of requests reported by the Cloudflare GraphQL API for a given time period. For example, the GraphQL API reports 46,532 requests in a given hour, but in BigQuery, there are only 45,736 records with an EdgeStartTimestamp in that hour. A small difference, to be sure, but a noticeable one.

Is this within expectations? And is there a better way to measure the health/quality of data imported by this tool?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions