[LTX-2] Run Gemma-3 Text Encoder natively in JAX via TorchAX by mbohlool · Pull Request #398 · AI-Hypercomputer/maxdiffusion

mbohlool · 2026-05-04T20:08:36Z

Description

This PR transitions the LTX-2 pipeline's text encoding process to utilize TorchAX, bridging the Gemma-3 model natively into JAX and significantly optimizing memory usage to prevent TPU out-of-memory errors. Minor PyLint warnings across the pipeline were also resolved during the refactor.

Key changes include:

TorchAX Integration: Replaced the eager PyTorch-based text encoder execution with the JAX-native TorchaxGemma3TextEncoder. TPU sharding is now manually distributed across the batch dimension via jax.device_put to prevent Softmax OOM crashes.
VAE Memory Optimization: Updated the VAE decoding loop to conditionally apply sharding constraints. By disabling sequential slicing and dynamically adjusting batch sharding for batch_size > 2, HBM crashes during decoding are avoided.
Lint Cleanup: Addressed minor PyLint warnings in the pipeline and encoder wrapper to maintain code health.

Benchmarks

Performance comparison demonstrating latency improvements from TorchAX integration.

Configuration	Text Encoding (CPU)	Text Encoding (TorchAX)	Text Encoding Impr.	Total Time (TE on CPU)	Total Time (TE on TorchAX)	Generation Impr.
Batch Size 1 (Latency Optimized)	3.75s	2.52s	32.93%	13.19s	11.67s	11.47%
Batch Size 1 (w/ Upsampler)	3.57s	2.47s	30.72%	16.65s	15.61s	6.28%
Batch Size 8 (Throughput Optimized)	23.23s	5.86s	74.77%	80.14s	60.40s	24.64%
Batch Size 8 (w/ Upsampler)	23.36s	6.10s	73.87%	114.98s	86.74s	24.56%

github-actions · 2026-05-04T20:08:45Z

e2e testgrid: https://8bcf50593faf4ea38060e236169827e5-dot-us-central1.composer.googleusercontent.com/dags/maxdiffusion_tpu_e2e/grid

mbohlool requested a review from entrpn as a code owner May 4, 2026 20:08

mbohlool force-pushed the text_encoder_tpu3 branch 2 times, most recently from 13e195e to 7707c3d Compare May 4, 2026 20:42

Offload LTX-2 text encoder to TorchAX and resolve lint issues

252c34e

mbohlool force-pushed the text_encoder_tpu3 branch from 7707c3d to 252c34e Compare May 5, 2026 23:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LTX-2] Run Gemma-3 Text Encoder natively in JAX via TorchAX#398

[LTX-2] Run Gemma-3 Text Encoder natively in JAX via TorchAX#398
mbohlool wants to merge 1 commit intomainfrom
text_encoder_tpu3

mbohlool commented May 4, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mbohlool commented May 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Benchmarks

Uh oh!

github-actions Bot commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mbohlool commented May 4, 2026 •

edited

Loading