Fetch tuples in small batches in adaptive executor where possible by marcocitus · Pull Request #5195 · citusdata/citus

marcocitus · 2021-08-20T09:15:59Z

DESCRIPTION: Fetch tuples in small batches in adaptive executor where possible

This PR makes RunDistributedExecution reentrant such that we avoid creating an excessively large tuple store for queries that do not require materialization. This is beneficial to restrict memory usage, avoid unnecessary disk I/O and thereby improve performance of queries with large result sets.

It does not cover local execution (seems somewhat important) and multi-row inserts (seems unimportant) yet.

codecov · 2021-08-20T09:19:33Z

Codecov Report

❌ Patch coverage is 85.45455% with 16 lines in your changes missing coverage. Please review.
✅ Project coverage is 88.90%. Comparing base (4d6fb1d) to head (eb1bc1d).

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #5195   +/-   ##
=======================================
  Coverage   88.90%   88.90%           
=======================================
  Files         286      286           
  Lines       63227    63304   +77     
  Branches     7937     7950   +13     
=======================================
+ Hits        56214    56283   +69     
- Misses       4736     4744    +8     
  Partials     2277     2277

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

marcocitus · 2021-08-20T09:23:42Z

src/backend/distributed/executor/adaptive_executor.c

-	ResetExplainAnalyzeData(taskList);
+	MemoryContext memoryContext = AllocSetContextCreate(executorState->es_query_cxt,
+														"AdaptiveExecutor",
+														ALLOCSET_DEFAULT_SIZES);


TODO: maybe reset this context at the very end since it uses es_query_ctx

marcocitus · 2021-08-20T09:25:47Z

src/backend/distributed/executor/adaptive_executor.c

+		execution->rowsReceivedInCurrentRun = 0;
+
+		/* TODO: GUC? be smart? */
+		int maxBatchSize = 10000;


TODO: pick batch size

DS-AdamMilazzo · 2023-05-08T16:28:23Z

I hope this feature makes it in. :-)

marcocitus · 2026-04-07T17:19:09Z

@microsoft-github-policy-service agree [company="Snowflake"]

Though this work was done during my Microsoft/Citus Data tenure, so already belongs to Microsoft.

marcocitus · 2026-04-07T17:20:01Z

@microsoft-github-policy-service agree

…d execution, tests. - Resource clean-up: AdaptiveExecutorEnd() releases sessions/connections when an error occurs between AdaptiveExecutorRun calls. Also handle early termination (cursor close, LIMIT satisfied) with proper clean-up of in-flight worker queries. - ShouldRunTasksSequentially() check in FinishDistributedExecution() replaced with explicit sessionsCleanedUp flag on DistributedExecution struct. Fixes double CleanUpSessions on sequential path. - Adaptive batch sizing via citus.executor_batch_size (default 0 => auto). Auto mode calculates batch size from work_mem and TupleDesc (attlen + typmod for varlena, 128B default for unbounded). Floor 100, ceiling 1M rows. - Remote execution uses LibPQ's chunked mode (PG17+), GUC configurable for now. - Local execution is eager; it runs to completion. - Regress test suite: 11 test cases covering batch sizes 1/10/100K/auto, empty results, LIMIT, aggregation, DML RETURNING, GUC behavior, between-batch error cleanup, cursor close mid-batch and cross-batch-size result consistency.

marcocitus added the performance label Aug 20, 2021

marcocitus commented Aug 20, 2021

View reviewed changes

Fetch tuples in small batches in adaptive executor where possible

51734b0

colm-mchugh force-pushed the marcocitus/reentrant-executor branch from f33de24 to a8d14ee Compare April 7, 2026 17:05

colm-mchugh force-pushed the marcocitus/reentrant-executor branch 3 times, most recently from c140524 to 4200722 Compare April 9, 2026 09:31

colm-mchugh marked this pull request as ready for review April 9, 2026 10:05

colm-mchugh requested a review from tejeswarm April 9, 2026 10:05

colm-mchugh force-pushed the marcocitus/reentrant-executor branch from 4200722 to eb1bc1d Compare April 9, 2026 12:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fetch tuples in small batches in adaptive executor where possible#5195

Fetch tuples in small batches in adaptive executor where possible#5195
marcocitus wants to merge 2 commits intomainfrom
marcocitus/reentrant-executor

marcocitus commented Aug 20, 2021 •

edited

Loading

Uh oh!

codecov bot commented Aug 20, 2021 •

edited

Loading

Uh oh!

marcocitus Aug 20, 2021

Uh oh!

marcocitus Aug 20, 2021

Uh oh!

DS-AdamMilazzo commented May 8, 2023

Uh oh!

marcocitus commented Apr 7, 2026

Uh oh!

marcocitus commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

marcocitus commented Aug 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Aug 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

marcocitus Aug 20, 2021

Choose a reason for hiding this comment

Uh oh!

marcocitus Aug 20, 2021

Choose a reason for hiding this comment

Uh oh!

DS-AdamMilazzo commented May 8, 2023

Uh oh!

marcocitus commented Apr 7, 2026

Uh oh!

marcocitus commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

marcocitus commented Aug 20, 2021 •

edited

Loading

codecov bot commented Aug 20, 2021 •

edited

Loading