feat(recording): add AudioOnly capture target and recording pipeline … by ManthanNimodiya · Pull Request #1881 · CapSoftware/Cap

ManthanNimodiya · 2026-06-01T17:40:04Z

Summary

Adds AudioOnly variant to ScreenCaptureTarget, the source of truth for what gets captured, propagated through every pipeline match
Wires AudioOnly through instant and studio recording pipelines, skipping screen/camera capture entirely and building audio-only output
Makes display: Option in SingleSegment and MultipleSegments, backwards-compatible via serde default, existing recordings unaffected
Makes screen: Option in studio Pipeline, audio-only recordings have no display track
Adds audio_only: bool to RecordingMeta so the editor and share page can read it from recording-meta.json
Tauri command layer: skips screen capture permission check and shareable content acquisition for AudioOnly

What's not in this PR

(Phase 6-7)
Desktop UI (mode selector), desktop editor (waveform view), and share page (AudioPlayer fallback) are follow-up PRs that build on this foundation.

Test plan

cargo clippy --workspace --all-targets -- -D warnings passes clean
Old recording-meta.json files without display or audio_only deserialize correctly via serde defaults
Existing screen/camera/camera-only recordings behave unchanged (manual verification on desktop app)

Greptile Summary

This PR adds the AudioOnly variant to ScreenCaptureTarget and wires it through the instant and studio recording pipelines, skipping screen/camera capture and building audio-only output. It also makes display: Option<VideoMeta> in segment structs with serde defaults for backward compatibility, adds audio_only: bool to RecordingMeta, and adds CurrentRecordingTarget::Audio for the Tauri state layer.

ScreenCaptureTarget::AudioOnly is added and propagated through all match sites (telemetry, screenshot, capture pipeline, shareable content acquisition).
display is made Option<VideoMeta> in SingleSegment and MultipleSegments with backward-compatible serde defaults; all callers updated with as_ref().map(...).unwrap_or_default() fallbacks.
Pipeline::screen is made Option<OutputPipeline> in the studio pipeline, but the cancel-guard task in spawn_watcher immediately cancels audio tracks when screen is absent.

Confidence Score: 3/5

Not safe to merge without addressing the cancel-guard regression in the studio pipeline and the several audio-only finalization failures flagged across review rounds.

The cancel-guard task in Pipeline::spawn_watcher immediately cancels the microphone pipeline for every audio-only studio recording, making studio audio-only recordings non-functional. Combined with still-unresolved issues from earlier rounds — ProjectRecordingsMeta::new returning Err for audio-only (blocking finalization), audio_only: false hardcoded in persist_final_recording_meta, and AudioOnly incorrectly opening the camera window — the audio-only path is not end-to-end functional.

crates/recording/src/studio_recording.rs (cancel-guard in spawn_watcher), crates/rendering/src/project_recordings.rs (SegmentRecordings.display still non-optional), apps/desktop/src-tauri/src/recording.rs (camera window triggered for AudioOnly, audio_only: false hardcoded in meta writers)

Important Files Changed

Filename	Overview
crates/recording/src/studio_recording.rs	Makes `screen` optional throughout the studio pipeline to support audio-only mode; introduces a P1 bug where the cancel-guard task in `spawn_watcher` immediately cancels mic/audio pipelines when `screen` is `None`.
crates/rendering/src/project_recordings.rs	Converts display panics to `Err` returns for audio-only, but `SegmentRecordings.display` is still `Video` (non-optional), so `ProjectRecordingsMeta::new` propagated errors still prevent audio-only finalization.
apps/desktop/src-tauri/src/recording.rs	Propagates `AudioOnly` through recording start/finish/finalize paths; `unwrap_or_default()` on missing display paths is safe but fires `create_screenshot` against the recording directory for audio-only.
crates/recording/src/sources/screen_capture/mod.rs	Adds `AudioOnly` variant to `ScreenCaptureTarget` with correct `None` returns for display, area, rect, and name lookups.
crates/recording/src/instant_recording.rs	Adds AudioOnly pipeline branch using `DashSegmentedAudioMuxer`; `video_info` made optional and handled correctly at stop time.
crates/project/src/meta.rs	Makes `display` optional in `SingleSegment` and `MultipleSegment` with backward-compatible serde defaults; `min_fps`/`max_fps` updated with `unwrap_or(0)` fallbacks.
apps/desktop/src-tauri/src/lib.rs	Adds `CurrentRecordingTarget::Audio` variant and maps `ScreenCaptureTarget::AudioOnly` to it correctly.
apps/desktop/src-tauri/src/import.rs	Updates all `display` field accesses to use `Option`; adds `audio_only: false` to all imported recording metas.
crates/recording/src/recovery.rs	Updates recovery to use Option display fields; correctly wraps display VideoMeta in Some for recovered segments.
crates/rendering/src/lib.rs	Updates `screen_fps` and `display_start_offset` accesses to use `Option`; falls back to 0 for audio-only.

Comments Outside Diff (4)

crates/recording/src/studio_recording.rs, line 1615-1636 (link)

audio_only flag always written as false by the studio pipeline

persist_final_recording_meta hardcodes audio_only: false in the RecordingMeta it writes to disk. For audio-only studio recordings, start_recording correctly writes audio_only: true initially, but this function overwrites the file at the end of the recording, so downstream consumers (editor, share page) will always read audio_only: false. The same problem exists in write_in_progress_meta (line 1656), which runs before recording even begins and overwrites the initial value. Both functions need to either accept the capture target as a parameter or read and preserve the existing audio_only value from disk.

Prompt To Fix With AI

This is a comment left during a code review.
Path: crates/recording/src/studio_recording.rs
Line: 1615-1636

Comment:
**`audio_only` flag always written as `false` by the studio pipeline**

`persist_final_recording_meta` hardcodes `audio_only: false` in the `RecordingMeta` it writes to disk. For audio-only studio recordings, `start_recording` correctly writes `audio_only: true` initially, but this function overwrites the file at the end of the recording, so downstream consumers (editor, share page) will always read `audio_only: false`. The same problem exists in `write_in_progress_meta` (line 1656), which runs before recording even begins and overwrites the initial value. Both functions need to either accept the capture target as a parameter or read and preserve the existing `audio_only` value from disk.

How can I resolve this? If you propose a fix, please make it concise.

apps/desktop/src-tauri/src/recording.rs, line 853-869 (link)

Audio-only mode incorrectly triggers the camera window

The AudioOnly target enters the same branch as CameraOnly here, which calls ShowCapWindow::Camera { centered: true } and sets was_camera_only_recording = true. For an audio-only recording there is no camera feed, so this opens a camera preview window with nothing to show and attaches incorrect state metadata. If the camera permission has not been granted, this could also produce an unexpected permission prompt. The AudioOnly case should likely skip this block entirely.

Prompt To Fix With AI

This is a comment left during a code review.
Path: apps/desktop/src-tauri/src/recording.rs
Line: 853-869

Comment:
**Audio-only mode incorrectly triggers the camera window**

The `AudioOnly` target enters the same branch as `CameraOnly` here, which calls `ShowCapWindow::Camera { centered: true }` and sets `was_camera_only_recording = true`. For an audio-only recording there is no camera feed, so this opens a camera preview window with nothing to show and attaches incorrect state metadata. If the camera permission has not been granted, this could also produce an unexpected permission prompt. The `AudioOnly` case should likely skip this block entirely.

How can I resolve this? If you propose a fix, please make it concise.

apps/desktop/src-tauri/src/recording.rs, line 2749-2750 (link)

Audio-only studio recordings cannot complete finalization

SegmentRecordings.display is a non-optional Video, so ProjectRecordingsMeta::new returns Err("SingleSegment/MultipleSegment missing display") whenever display is None. At both this call site (line 2749) and handle_recording_finish (line 2506), the result is propagated with ?. For audio-only studio recordings this means neither config.write nor any downstream steps run — the recording's project config is never written and the recording appears incomplete from the editor's perspective.

Prompt To Fix With AI

This is a comment left during a code review.
Path: apps/desktop/src-tauri/src/recording.rs
Line: 2749-2750

Comment:
**Audio-only studio recordings cannot complete finalization**

`SegmentRecordings.display` is a non-optional `Video`, so `ProjectRecordingsMeta::new` returns `Err("SingleSegment/MultipleSegment missing display")` whenever display is `None`. At both this call site (line 2749) and `handle_recording_finish` (line 2506), the result is propagated with `?`. For audio-only studio recordings this means neither `config.write` nor any downstream steps run — the recording's project config is never written and the recording appears incomplete from the editor's perspective.

How can I resolve this? If you propose a fix, please make it concise.

crates/recording/src/studio_recording.rs, line 595-609 (link)

Cancel-guard immediately terminates audio pipelines for audio-only recordings

When screen is None (audio-only), screen_done is None, so the if let Some(done) = screen_done guard is skipped and the spawned task proceeds directly to calling mic_cancel.cancel() (and cam_cancel, sys_cancel). Because the task is spawned but not immediately polled, it fires at the very next async yield point after recording starts — effectively cancelling the microphone pipeline before meaningful audio is captured. Every audio-only studio recording would produce an empty or near-empty output.

The cancellation task should only be spawned when there is an actual screen pipeline to act as the trigger. When screen is absent, the audio pipelines should run until Pipeline::stop is called explicitly.

Prompt To Fix With AI

This is a comment left during a code review.
Path: crates/recording/src/studio_recording.rs
Line: 595-609

Comment:
**Cancel-guard immediately terminates audio pipelines for audio-only recordings**

When `screen` is `None` (audio-only), `screen_done` is `None`, so the `if let Some(done) = screen_done` guard is skipped and the spawned task proceeds directly to calling `mic_cancel.cancel()` (and `cam_cancel`, `sys_cancel`). Because the task is spawned but not immediately polled, it fires at the very next async yield point after recording starts — effectively cancelling the microphone pipeline before meaningful audio is captured. Every audio-only studio recording would produce an empty or near-empty output.

The cancellation task should only be spawned when there is an actual screen pipeline to act as the trigger. When `screen` is absent, the audio pipelines should run until `Pipeline::stop` is called explicitly.

How can I resolve this? If you propose a fix, please make it concise.

Prompt To Fix All With AI

Fix the following 1 code review issue. Work through them one at a time, proposing concise fixes.

---

### Issue 1 of 1
crates/recording/src/studio_recording.rs:595-609
**Cancel-guard immediately terminates audio pipelines for audio-only recordings**

When `screen` is `None` (audio-only), `screen_done` is `None`, so the `if let Some(done) = screen_done` guard is skipped and the spawned task proceeds directly to calling `mic_cancel.cancel()` (and `cam_cancel`, `sys_cancel`). Because the task is spawned but not immediately polled, it fires at the very next async yield point after recording starts — effectively cancelling the microphone pipeline before meaningful audio is captured. Every audio-only studio recording would produce an empty or near-empty output.

The cancellation task should only be spawned when there is an actual screen pipeline to act as the trigger. When `screen` is absent, the audio pipelines should run until `Pipeline::stop` is called explicitly.

_{Reviews (4): Last reviewed commit: "fix(recording): skip audio-only segments..." | Re-trigger Greptile}

…foundation

…dd Audio target variant

ManthanNimodiya · 2026-06-07T02:58:45Z

@greptileai please re-review

…rdings

ManthanNimodiya · 2026-06-07T04:27:58Z

@greptileai please re-review

ManthanNimodiya · 2026-06-07T05:06:20Z

@greptileai please re-review

ManthanNimodiya · 2026-06-07T16:59:59Z

@richiemcilroy, can you have a quick look and lmk if I can go for the phase 6 and 7, no hurry just in case this gets buried

superagent-security · 2026-06-16T02:32:49Z

Superagent didn't find any vulnerabilities or security issues in this PR.

tembo · 2026-06-16T02:38:04Z

    let needs_remux = if fragmented {
        segment_metas.iter().any(|seg| {
-            let display_path = seg.display.path.to_path(&recording_dir);
+            let display_path = seg
+                .display
+                .as_ref()
+                .map(|d| d.path.to_path(&recording_dir))
+                .unwrap_or_default();
            display_path.is_dir()
        })


unwrap_or_default() here will treat display: None like an empty path; is_dir() can easily become true (e.g. current dir), which would incorrectly mark audio-only recordings as needing remux.

Suggested change

let needs_remux = if fragmented {

segment_metas.iter().any(|seg| {

let display_path = seg.display.path.to_path(&recording_dir);

let display_path = seg

.display

.as_ref()

.map(|d| d.path.to_path(&recording_dir))

.unwrap_or_default();

display_path.is_dir()

})

let needs_remux = if fragmented {

segment_metas.iter().any(|seg| {

seg.display

.as_ref()

.is_some_and(|d| d.path.to_path(&recording_dir).is_dir())

})

} else {

false

};

tembo · 2026-06-16T02:38:05Z

        sharing: None,
        inner: RecordingMetaInner::Studio(Box::new(studio_meta.clone())),
        upload: None,
+        audio_only: false,


This hardcodes audio_only: false when persisting the final meta. If recording-meta.json was initially written with audio_only: true, this will overwrite it at the end (same issue in write_in_progress_meta).

Suggested change

audio_only: false,

audio_only: RecordingMeta::load_for_project(recording_dir)

.ok()

.map(|m| m.audio_only)

.unwrap_or(false),

tembo · 2026-06-16T02:38:06Z

        .enumerate()
        .map(|(index, segment)| {
-            let duration = get_video_duration_secs(&segment.display.path.to_path(project_path))?;
+            let duration = get_video_duration_secs(&segment.display.as_ref().map(|d| d.path.clone()).unwrap_or_default().to_path(project_path))?;


Using unwrap_or_default() here means a missing display falls back to project_path and you’ll try to read a directory as a video. I think this should stay an explicit error (like the full_timeline_for_source_segments path).

Suggested change

let duration = get_video_duration_secs(&segment.display.as_ref().map(|d| d.path.clone()).unwrap_or_default().to_path(project_path))?;

let display = segment.display.as_ref().ok_or("Missing display video")?;

let duration = get_video_duration_secs(&display.path.to_path(project_path))?;

tembo · 2026-06-16T02:38:07Z

+                let display = s
+                    .display
+                    .as_ref()
+                    .ok_or_else(|| "SingleSegment missing display".to_string())


Now that display is optional, this returns an error whenever it’s None. If studio audio-only recordings are expected to finalize, callers need to skip ProjectRecordingsMeta::new (or this type needs to represent display as optional too) so audio-only paths don’t fail here.

tembo · 2026-06-16T02:38:44Z

            tokio::spawn(async move {
-                // When screen (video) finishes, cancel the other pipelines
-                let _ = screen_done.await;
+                if let Some(done) = screen_done {
+                    let _ = done.await;
+                }


This still cancels the other pipelines immediately for audio-only (screen_done = None). If the intent is “cancel others when screen finishes”, the task should exit early when there’s no screen pipeline.

Suggested change

tokio::spawn(async move {

// When screen (video) finishes, cancel the other pipelines

let _ = screen_done.await;

if let Some(done) = screen_done {

let _ = done.await;

}

tokio::spawn(async move {

let Some(done) = screen_done else {

return;

};

let _ = done.await;

tembo · 2026-06-16T02:38:48Z

            let display_output_path = match &updated_studio_meta {
                StudioRecordingMeta::SingleSegment { segment } => {
-                    segment.display.path.to_path(&recording_dir)
+                    segment.display.as_ref().map(|d| d.path.clone()).unwrap_or_default().to_path(&recording_dir)


Minor footgun with the unwrap_or_default() change: for audio-only, this resolves to recording_dir and then gets passed into create_screenshot a few lines later. Might be worth guarding the screenshot generation behind if segment.display.is_some() (and same for the multiple-segment case) so audio-only doesn’t try to thumbnail a directory.

feat(recording): add AudioOnly capture target and recording pipeline …

f8cd75a

…foundation

greptile-apps Bot reviewed Jun 1, 2026

View reviewed changes

Comment thread crates/rendering/src/project_recordings.rs Outdated

Comment thread apps/desktop/src-tauri/src/lib.rs Outdated

fix(recording): return Err for audio-only in ProjectRecordingsMeta, a…

ab6bfe3

…dd Audio target variant

greptile-apps Bot reviewed Jun 7, 2026

View reviewed changes

Comment thread crates/recording/src/studio_recording.rs Outdated

fix(recording): restore CROSS_TRACK_SNAP_SECS AV sync for screen reco…

07506b3

…rdings

greptile-apps Bot reviewed Jun 7, 2026

View reviewed changes

Comment thread apps/desktop/src-tauri/src/recording.rs

fix(recording): skip audio-only segments in needs_fragment_remux check

64507df

Merge branch 'main' into feat/audio-only-recording

a6f0661

Merge branch 'main' into feat/audio-only-recording

146958b

tembo Bot reviewed Jun 16, 2026

View reviewed changes

-        audio_only: false,
+        audio_only: RecordingMeta::load_for_project(recording_dir)
+            .ok()
+            .map(|m| m.audio_only)
+            .unwrap_or(false),

	let duration = get_video_duration_secs(&segment.display.as_ref().map(\|d\| d.path.clone()).unwrap_or_default().to_path(project_path))?;
	let display = segment.display.as_ref().ok_or("Missing display video")?;
	let duration = get_video_duration_secs(&display.path.to_path(project_path))?;

Conversation

ManthanNimodiya commented Jun 1, 2026 • edited by greptile-apps Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What's not in this PR

Test plan

Greptile Summary

Confidence Score: 3/5

Important Files Changed

Comments Outside Diff (4)

Uh oh!

Uh oh!

Uh oh!

ManthanNimodiya commented Jun 7, 2026

Uh oh!

Uh oh!

ManthanNimodiya commented Jun 7, 2026

Uh oh!

Uh oh!

ManthanNimodiya commented Jun 7, 2026

Uh oh!

ManthanNimodiya commented Jun 7, 2026

Uh oh!

superagent-security Bot commented Jun 16, 2026

Uh oh!

tembo Bot Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

tembo Bot Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

tembo Bot Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

tembo Bot Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

tembo Bot Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

tembo Bot Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ManthanNimodiya commented Jun 1, 2026 •

edited by greptile-apps Bot

Loading