Skip to content

Conversation

@lga-zurich
Copy link
Contributor

This creates parquet files that result in better performance as the parquet decoding is more efficient not having to process as many row groups individually.

@zoltan
Copy link

zoltan commented Jan 16, 2026

we see a really big difference in runtime with this change

@paul-aiyedun
Copy link
Contributor

@lga-zurich An update (35c5611) was made to add row group size (in bytes) as a parameter when generating TPC-H datasets. Can we use that instead?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants