fix(agg): use f64 sum for integer AVG to avoid overflow by ser-vasilich · Pull Request #221 · RayforceDB/rayforce

ser-vasilich · 2026-06-02T17:39:45Z

Summary

The scalar reduction path computed AVG on integer columns as
(double)acc.sum_i / cnt where sum_i is accumulated through
unsigned wrap (to dodge UBSan). For columns whose true sum exceeds
2^63 — e.g. an i64 user-id column with signed values spread across
±9e18 over millions of rows, true sum ~7.6e24 — the wrap leaves
garbage in sum_i and the mean is wrong by orders of magnitude.

Add a parallel f64 sum_d field to reduce_acc_t, populate it in
every integer reduction kernel (REDUCE_LOOP_I, BOOL/U8 path) and in
reduce_merge. OP_AVG (and VAR/STDDEV mean) now read sum_d
instead of sum_i. F64 path is unchanged.

Reduction kernels gain one f64 add per element; vectorises with the
existing integer SIMD store.

The scalar reduction path computed AVG on integer columns as (double)acc.sum_i / cnt where sum_i is accumulated via uint64 wrap to dodge UBSan. For columns whose true sum exceeds 2^63 (e.g. ClickBench UserID, signed values around ±9e18 × 10M rows, true sum ~7.6e24) the wrap leaves garbage and the mean is wrong by orders of magnitude. Add a parallel f64 sum_d field to reduce_acc_t, populate it in every integer reduction kernel (REDUCE_LOOP_I, BOOL/U8 path) and in reduce_merge. AVG (and VAR/STDDEV mean) now read sum_d instead of sum_i. F64 path is unchanged.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(agg): use f64 sum for integer AVG to avoid overflow#221

fix(agg): use f64 sum for integer AVG to avoid overflow#221
ser-vasilich wants to merge 1 commit into
masterfrom
fix/avg-i64-overflow

ser-vasilich commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ser-vasilich commented Jun 2, 2026

Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant