Skip to content

Allow user-specified functional transforms to be applied during deserialization. #398

@j-tyler

Description

@j-tyler

As an extension of issue 397, allow users to specify functional transforms that, ideally, will be applied during deserialization. This allows users to specify what is most-optimal for their use case. Why is this valuable?

  1. Records may be retrieved from data storage full of low cardinality duplicate strings. Interning these straight into the record can save memory usage.
  2. Records that end up cached should ideally use immutable collections. Allowing user to do this on deserialization can save memory allocation overhead.
  3. As specified in issue 397, utilization of specific libraries like fastutil could be a simple deserialization transform that the user configures.
    ...

I'm sure there are other reasons users would have. Currently my project takes Avro records and re-processes them after deserialization leading to wasted memory allocation overhead.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions