Description
There is currently an inconsistency in how our system handles null values depending on the execution mode. To ensure predictable behavior and data integrity, we should align these strategies.
Current Behavior
- Batch Mode: Null values are automatically coerced/set to
0.
- Streaming Mode: Null values trigger a runtime error.
Proposed Change
We need to decide on a unified strategy for handling null values across both modes. Potential directions include:
- Strict Error Handling: Both modes return an error to prevent silent data corruption.
- Default Coercion: Both modes default to
0 (or a type-appropriate default).
- Configurable Policy: Allow users to define the
on_null behavior (e.g., ignore, fill_zero, or raise_error).
Context
#753 (comment)
To-Do / Discussion Points
Description
There is currently an inconsistency in how our system handles
nullvalues depending on the execution mode. To ensure predictable behavior and data integrity, we should align these strategies.Current Behavior
0.Proposed Change
We need to decide on a unified strategy for handling
nullvalues across both modes. Potential directions include:0(or a type-appropriate default).on_nullbehavior (e.g.,ignore,fill_zero, orraise_error).Context
#753 (comment)
To-Do / Discussion Points