Skip to content

Implement CpcSketch #37

@tisonkun

Description

@tisonkun

https://datasketches.apache.org/docs/CPC/CpcSketches.html

I noticed my current sketch for estimating unique values is more like the CpcSketch than an HllSketch #3 (comment)

So I track this issue for the task and discuss the difference between Hll and Cpc. @leerho is one always better than the other? Or we can use either to get similar results.

Upon the implementation part, the last time I found so many flavors and I don't quite understand what PairTable is for. I'd appreciate it if one can share the design ideas and where to start :P

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions