Krea VACE 14b Pipeline - Mixin #311

BuffMcBigHuge · 2026-01-06T22:17:12Z

VACE architecture unification
- Migrated KreaRealtimeVideoPipeline from lazy loading to the unified VACEEnabledPipeline mixin
- Enhanced mixin with vace_layers support, FP8 quantization, and text encoder CPU offloading
- Unified VACE handling across all pipelines
Krea V2V prompt reset bug fix
- Fixed prompt transition issue in V2V/VACE mode
- Implemented mode-specific temporal interpolation defaults (0 for video mode, 4 for text mode)
- Frontend now dynamically adjusts transition steps based on input mode
Code quality improvements
- Refactored KV cache recomputation code (moved helper to class level)
- Removed optional optimization code for simplicity
- Resolved merge conflicts cleanly

This PR is a derivative of #297.

Note: This branch doesn't allow for Krea + VACE on 32GB VRAM.

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

…-mixin Signed-off-by: BuffMcBigHuge <marco@bymar.co>

ryanontheinside

Some comments and questions - this does not include any consideration around the cache recomp interactions with VACE that we've all been discussing offline.

ryanontheinside · 2026-01-07T12:26:45Z

src/scope/core/pipelines/krea_realtime_video/modules/causal_model.py

-                kv_cache["k"][:, :local_end_index] = roped_key
-                kv_cache["v"][:, :local_end_index] = v
+                # Only update kv_cache if it exists (VACE forward passes kv_cache=None)
+                if kv_cache is not None:


We skip this with is_tf = False anyways, but this check is separately unreachable since kv_cache is always None here

ryanontheinside · 2026-01-07T13:29:17Z

src/scope/core/pipelines/krea_realtime_video/pipeline.py

-        # Initialize optional LoRA adapters on the underlying model BEFORE quantization.
+        # Load text encoder before VACE initialization (may be offloaded to CPU)
+        start = time.time()
+        text_encoder = WanTextEncoderWrapper(


I wonder if we should be more discriminate with the CPU offloading of the text encoder or exclude it entirely. I think you had mentioned that the Mixin approach already precludes 5090's with VACE+Krea, in which case we should probably not offload the text encoder at all. However if this does enable 5090+VACE+Krea, that will surely only be for development purposes, so maybe we do this with a flag and document it. As of now, the text encoder is always offloaded for Krea even when there is sufficient VRAM

Great point. I added it in anyways but now I'm realizing that it adds quite a bit of latency when changing prompts. I think supporting 32gb is a stretch and wiser to skip offloading all together.

ryanontheinside · 2026-01-07T14:08:51Z

src/scope/core/pipelines/wan2_1/vace/models/causal_vace_model.py

            new_block.block_id = saved_block_id

+            # Move new block to target device/dtype
+            new_block = new_block.to(device=orig_device, dtype=orig_dtype)


IIUC, i think the memory optimization order is reversed. Currently, new_block is moved to GPU before orig_block is moved to CPU causing both blocks to be on GPU simultaneously

Hmm I made the change but didn't see any VRAM saving.

ryanontheinside · 2026-01-07T14:25:24Z

src/scope/core/pipelines/krea_realtime_video/pipeline.py

+        )
+        print(f"Loaded text encoder in {time.time() - start:3f}s")
+        # Move text encoder to target device but use dtype of weights
+        text_encoder = text_encoder.to(device=device)


if device is cuda device, this allocates to GPU, but _init_vace in mixin.py immediately moves to CPU

ryanontheinside · 2026-01-07T14:33:05Z

src/scope/core/pipelines/wan2_1/vace/utils/weight_loader.py

+    # Use assign=True to preserve original tensor dtype (important for FP8 weights)
    missing_keys, unexpected_keys = actual_model.load_state_dict(
-        vace_state_dict, strict=False
+        vace_state_dict, strict=False, assign=True


Does assign=True interact correctly with the .to(device, dtype) calls in
mixin.py (lines 157-158)? Those move VACE components to GPU before this
load happens - wondering if assign=True might replace them with CPU tensors
from the checkpoint, making that earlier allocation unnecessary.

I think you're right, i suppose this makes the gpu/cpu assignment duplicated. This is now fixed, great find.

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

BuffMcBigHuge added 21 commits December 30, 2025 17:53

Krea 14b VACE testing.

e04aad5

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

VACE context testing.

33a82b3

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Modifications to VACE computation, testing on control video input.

6d40951

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Merge with main, fix krea vace schema.

94aacd8

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Small fix.

8177eec

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Small fixes, linting.

960a100

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Refactor of vaceEnabled bool.

091cd5b

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Merge branch 'main' into marco/feat/krea-vace-14b

d7accae

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Merge branch 'main' into marco/feat/krea-vace-14b

e94c4cc

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Removed old experimental option, small fixes and refactor.

3ce469b

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Removed kv cache recompute environment variable.

18dcac0

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Fix to vae testing.

60e9e89

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Merge branch 'main' into marco/feat/krea-vace-14b

72fe881

Remove extra vace path for krea.

697e0c1

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Added Wan 2.1 14b default vae.

a6f1f89

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Attempt at mixin, oom on 32GB.

8b66b9f

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Linting.

9e0700b

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Linting.

7c9932a

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Removed unnecessary fallback.

6d61646

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Better handling of vace_enabled.

f5b3251

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Merge branch 'marco/feat/krea-vace-14b' into marco/feat/krea-vace-14b…

f5dd8d2

…-mixin Signed-off-by: BuffMcBigHuge <marco@bymar.co>

BuffMcBigHuge mentioned this pull request Jan 6, 2026

Krea VACE 14b Pipeline #297

Open

BuffMcBigHuge marked this pull request as ready for review January 6, 2026 22:20

Merge branch 'main' into marco/feat/krea-vace-14b-mixin

8805bb6

ryanontheinside reviewed Jan 7, 2026

View reviewed changes

BuffMcBigHuge added 3 commits January 7, 2026 13:52

Removed text encoder offloading, removed impossible conditional logic.

f62f66f

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Linting, updating docs, fix to cpu/gpu assignment.

7e976c7

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Small modification to attention block.

f53eeec

Signed-off-by: BuffMcBigHuge <marco@bymar.co>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Krea VACE 14b Pipeline - Mixin #311

Krea VACE 14b Pipeline - Mixin #311

Uh oh!

BuffMcBigHuge commented Jan 6, 2026 •

edited

Loading

Uh oh!

ryanontheinside left a comment

Uh oh!

ryanontheinside Jan 7, 2026

Uh oh!

ryanontheinside Jan 7, 2026

Uh oh!

BuffMcBigHuge Jan 7, 2026

Uh oh!

ryanontheinside Jan 7, 2026

Uh oh!

BuffMcBigHuge Jan 7, 2026 •

edited

Loading

Uh oh!

ryanontheinside Jan 7, 2026

Uh oh!

ryanontheinside Jan 7, 2026

Uh oh!

BuffMcBigHuge Jan 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Krea VACE 14b Pipeline - Mixin #311

Are you sure you want to change the base?

Krea VACE 14b Pipeline - Mixin #311

Uh oh!

Conversation

BuffMcBigHuge commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ryanontheinside left a comment

Choose a reason for hiding this comment

Uh oh!

ryanontheinside Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

ryanontheinside Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

BuffMcBigHuge Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

ryanontheinside Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

BuffMcBigHuge Jan 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ryanontheinside Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

ryanontheinside Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

BuffMcBigHuge Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

BuffMcBigHuge commented Jan 6, 2026 •

edited

Loading

BuffMcBigHuge Jan 7, 2026 •

edited

Loading