fix(cli): make _verbose_console silent-aware and gate replay prints (#209) by jrob5756 · Pull Request #223 · microsoft/conductor

jrob5756 · 2026-05-21T18:37:27Z

Follow-up to #211, closes #209.

Problem

PR #211 gated the three foreground dashboard URL prints behind is_verbose() but left several other stderr leaks in place. Under --silent ("No progress output. Only JSON result on stdout."):

app.py:_run_replay still printed Press Ctrl+C to exit and Replay stopped unconditionally.
run.py had ~12 _verbose_console.print call sites that bypassed is_verbose() — warnings (dashboard failed to start, log-file open failure, workflow-hash mismatch), Press Esc to interrupt, Event log written to ..., Log written to ..., and _print_resume_instructions (printed on failure).

Fix

Fix at the source. Subclass Console for _verbose_console so every .print(...) is a no-op when is_verbose() is False. This:

Aligns the implementation with the long-misleading name (_verbose_console is now actually verbose-aware).
Removes the per-call-site audit burden — adding new _verbose_console.print(...) calls in run.py will now respect --silent automatically.
Is a no-op for the already-gated helpers (verbose_log, verbose_log_agent_*, the four PR fix(cli): gate remaining dashboard URL prints behind is_verbose() for --silent compliance #211 dashboard prints) — their internal is_verbose() checks become belt-and-suspenders.

The app-wide console in app.py is intentionally NOT made silent-aware because it carries real error messages (print_error etc.). The two remaining replay prints in _run_replay are gated per-call instead.

Acceptance criterion

$ conductor --silent replay /tmp/log.jsonl 2>/tmp/stderr  &
$ wc -c /tmp/stderr
0 /tmp/stderr

Verified end-to-end. Pre-fix, the same invocation leaked Press Ctrl+C to exit to stderr.

Tests

TestSilentAwareConsole (3 tests) — verifies the subclass mechanism and that the module-level _verbose_console instance uses it. The test_module_verbose_console_is_silent_aware test guards against a future refactor accidentally swapping back to a plain Console (the original regression).
TestReplaySilentCompliance (2 tests) — verifies conductor --silent replay produces no stderr, and that without --silent the messages still appear (guard against an over-eager gate).

Mocks ReplayDashboard.start/stop and patches asyncio.Event to raise CancelledError; otherwise the tests would hang on the await asyncio.Event().wait() the replay command uses to keep the dashboard alive.

All 397 existing tests/test_cli tests pass.

Rubber-duck pass

Consulted before implementation. Key findings adopted:

Test plan would have hung on infinite dashboard waits — solved via the asyncio.Event patch above.
Press Esc only fires for non---web interactive runs, so a --silent run --web test couldn't exercise it — the silent-aware subclass tests at the unit level cover all 12 run.py call sites at once instead.
Confirmed no existing tests assert these messages appear in non-silent mode (no tests broken).

…209) PR #211 (follow-up to #203) gated three foreground dashboard URL prints behind `is_verbose()` but left the remaining stderr leaks in place, violating the `--silent` contract ("No progress output. Only JSON result on stdout."). Specifically: * `app.py:_run_replay` still printed 'Press Ctrl+C to exit' and 'Replay stopped' unconditionally. * `run.py` had ~12 `_verbose_console.print` call sites that bypassed `is_verbose()` — warnings (dashboard failed to start, log-file open failure, workflow-hash mismatch), 'Press Esc to interrupt', 'Event log written to...', 'Log written to...', and `_print_resume_instructions` (printed on failure). Fix at the source: subclass `Console` for `_verbose_console` so every `.print(...)` no-ops when `is_verbose()` is False. This aligns the implementation with the long-misleading name, removes the per-call-site audit burden, and is a no-op for the already-gated helpers. The app-wide `console` in app.py is intentionally NOT made silent-aware because it carries real error messages; the two remaining replay prints are gated per-call instead. Acceptance criterion verified: `conductor --silent replay <log>` produces 0 bytes on stderr (was leaking 'Press Ctrl+C to exit'). Regression tests: * `TestSilentAwareConsole` (3 tests) — verifies the subclass mechanism and that the module-level instance uses it. * `TestReplaySilentCompliance` (2 tests) — verifies the replay command produces no stderr under `--silent` and still prints when verbose. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Comprehensive PR review surfaced one critical coverage gap and a few defensive improvements. All addressed in this commit: * **Critical (pr-test-analyzer)**: The `KeyboardInterrupt` branch at `app.py:1007-1009` (`Replay stopped`) had zero behavioral test coverage — the previous `CancelledError`-via-`asyncio.Event` path never reaches the outer `except KeyboardInterrupt`. Added `test_silent_replay_suppresses_keyboardinterrupt_message` and its verbose counterpart that drive that branch via `patch("asyncio.run", side_effect=KeyboardInterrupt())`. This is also more robust than the inner-path mocking: a future refactor of `_run_replay`'s wait primitive cannot silently bypass it. * **Important (type-design-analyzer)**: `_SilentAwareConsole.__init__` now locks `stderr=True`. A future caller constructing an instance with `stderr=False` would have silently routed gated output to stdout and corrupted the `--silent` JSON contract. * **Important (comment-analyzer)**: Tests now assert on `result.stderr` (not the combined `result.output`), matching the test names and catching a hypothetical regression that moved the prints to stdout. * **Suggestion (pr-test-analyzer)**: Added `test_quiet_replay_prints_dashboard_messages` to lock in the contract at `app.py:204` that `--quiet` (MINIMAL) keeps `verbose_mode=True` — MINIMAL means "limited progress", not zero. * **Suggestion (multiple agents)**: Class-level comment on `_SilentAwareConsole` warns that only `.print()` is gated; `.rule()`/`.log()`/`.status()`/`.print_json()` would silently bypass `--silent`. All current call sites use only `.print`. Verified: `conductor --silent replay <log>` still produces 0 bytes on stderr; `conductor --quiet replay <log>` still prints the dashboard URL and Ctrl+C hint as before. 400 `tests/test_cli` tests pass (was 397; +3 net new). Lint/format/typecheck clean. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

jrob5756 · 2026-05-21T18:46:32Z

Addressed PR review feedback (commit bcc29a5):

Critical

✅ Covered the KeyboardInterrupt branch at app.py:1007-1009 — the previous tests only exercised the inner CancelledError path. Added test_silent_replay_suppresses_keyboardinterrupt_message + verbose counterpart via patch("asyncio.run", side_effect=KeyboardInterrupt()). Also more refactor-robust than the inner-path mock.

Important

✅ Locked _SilentAwareConsole.__init__ to force stderr=True (prevents a future second instance from corrupting the stdout JSON contract).
✅ Tests now assert on result.stderr (not the combined result.output) — matches the test names and catches a hypothetical stdout-leak regression.

Suggestions

✅ Added test_quiet_replay_prints_dashboard_messages to lock in the app.py:204 contract that --quiet (MINIMAL) does NOT suppress.
✅ Class-level comment on _SilentAwareConsole warns that only .print() is gated (.rule/.log/.status/.print_json would silently bypass — latent footgun for future contributors).

Verification

conductor --silent replay <log>: 0 bytes on stderr ✓
conductor --quiet replay <log>: dashboard URL + Ctrl+C hint visible ✓
400 tests/test_cli tests pass (was 397; +3 net new). Lint/format/typecheck clean.

Not adopted (with reason)

Inline # silent: comments at app.py:996/:1008 — comment-noise vs. marginal future-proofing.
Removing the redundant outer is_verbose() guards in run.py helpers — out of scope; 16+ existing tests in test_logging.py patch _verbose_console with a plain Console and rely on the outer guards.
Overriding .log/.rule/etc. on _SilentAwareConsole — YAGNI; the warning comment is sufficient defense (and a future test failure would surface the gap immediately).
"Tighten 'in this module' phrasing" — cosmetic.

jrob5756 and others added 2 commits May 21, 2026 14:37

jrob5756 merged commit efa520f into main May 21, 2026
9 checks passed

jrob5756 deleted the fix/209-silent-replay-press-ctrlc branch May 21, 2026 18:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(cli): make _verbose_console silent-aware and gate replay prints (#209)#223

fix(cli): make _verbose_console silent-aware and gate replay prints (#209)#223
jrob5756 merged 2 commits into
mainfrom
fix/209-silent-replay-press-ctrlc

jrob5756 commented May 21, 2026

Uh oh!

jrob5756 commented May 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jrob5756 commented May 21, 2026

Problem

Fix

Acceptance criterion

Tests

Rubber-duck pass

Uh oh!

jrob5756 commented May 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant