Feature: Run audits concurrently using concurrent_tasks setting by airhorns · Pull Request #5718 · TobikoData/sqlmesh

airhorns · 2026-03-04T01:54:43Z

I'm porting a really test-heavy dbt project over to sqlmesh and noticing overall runtimes are WAY slower, and its because sqlmesh runs the audits in serial for each model, and that serial audit run blocks donwstream models from getting started. This PR adjusts the concurrenct tasks approach to apply to both models and audits run during an apply, or just all the audits when running an audit_only run.

WIP

Summary

Adds per-model audit concurrency in SnapshotEvaluator.audit() using concurrent_apply_to_values, controlled by the existing concurrent_tasks connection setting
Adds cross-model audit concurrency in Scheduler for audit_only=True runs — all audit tasks are flattened into a single thread pool since audits are read-only SELECT queries and don't need DAG ordering
Renames ddl_concurrent_tasks to concurrent_tasks on SnapshotEvaluator to reflect its broader scope

Closes #5468

🤖 Generated with Claude Code

Adds two levels of audit concurrency: 1. Per-model (SnapshotEvaluator): audits within a single snapshot now run concurrently via concurrent_apply_to_values, controlled by concurrent_tasks. This benefits both plan/apply and audit-only runs. 2. Cross-model (Scheduler): when audit_only=True, all audit tasks across all snapshots are flattened into a single thread pool instead of following DAG ordering. Since audits are read-only SELECT queries with no side effects, DAG dependencies are irrelevant and all concurrent_tasks slots stay filled. The SnapshotEvaluator parameter ddl_concurrent_tasks is renamed to concurrent_tasks to reflect its broader scope. Closes TobikoData#5468 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

CLAassistant · 2026-03-04T01:54:53Z

All committers have signed the CLA.

- Circuit breaker: Use a shared threading.Event to cancel remaining audit tasks when the circuit breaker fires. Previously, CircuitBreakerError was collected like any other error and all tasks ran to completion. - Nested concurrency: Pass audit_concurrent_tasks=1 from the scheduler's flat pool to the evaluator, preventing max_workers * concurrent_tasks threads from hitting the DB simultaneously. Add audit_concurrent_tasks parameter to SnapshotEvaluator.audit() for this override. - Add tests for circuit breaker short-circuiting, blocking audit error collection (NodeAuditsErrors), and nested concurrency prevention. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Apply ruff formatting to new/modified lines - Fix mypy error in test_audit_only_no_nested_concurrency: use fully mocked evaluator instead of real evaluator with replaced method, avoiding type mismatch on call_count/call_args_list Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

airhorns and others added 2 commits March 3, 2026 21:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: Run audits concurrently using concurrent_tasks setting#5718

Feature: Run audits concurrently using concurrent_tasks setting#5718
airhorns wants to merge 3 commits intoTobikoData:mainfrom
airhorns:feat/concurrent-audits-clean

airhorns commented Mar 4, 2026 •

edited

Loading

Uh oh!

CLAassistant commented Mar 4, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

airhorns commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

CLAassistant commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

airhorns commented Mar 4, 2026 •

edited

Loading

CLAassistant commented Mar 4, 2026 •

edited

Loading