merge changes from upstream by lhphanto · Pull Request #550 · intrinsic-dev/aic

lhphanto · 2026-05-14T20:15:03Z

google-cla · 2026-05-14T20:15:33Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

…nion

…tching

…ning too

- Takes c = (B, d) conditioning → projects to 6×d parameters via SiLU + Linear - Zero-initialized output linear → all gates start at 0, so the model is a pure identity at init (training stability) - Reuses the same self.norm (no learnable affine) for both SA and FFN pre-norms TransformerLayer: - Removed norm1, norm2 — replaced by self.adaLN - forward now takes cond and uses the adaptive shift/scale/gate for both sub-layers - SA residual: x = x + gate_sa * attn_out — gate is learned per-feature, per-layer - FFN pre-norm: adaLN.norm(x) * (1 + scale_ff) + shift_ff — same shared norm instance VectorFieldTransformer: - Removed norm_task from __init__ (task no longer a sequence token) - forward computes cond = timestep_emb + task_token.squeeze(1) — both are (B, d) and summed - Prefix is now just [img_tokens, state_tokens] — shorter sequence, cheaper attention - cond passed to every layer so each layer independently adapts its normalization

lhphanto added 29 commits May 14, 2026 20:30

first commit

7946382

add more logging

a8ea154

add max magnitude

51dd4b2

add logging of fts_tare_offset

b630169

v1 add support to reset env

3641df5

fix env reset

692bb51

random reset v1

eb9aef0

random reset v2

1deba42

env_reset3

7bc28dd

reset during teleop v1

29dfb5a

recording v1 mostly working except sc_port

303be82

policy implementation v1

87344ae

policy v2

d464ea7

policy v3

83317ae

add sample and layernorm for conditions

1fefbae

CFG and add plotting for training

e2b632b

add some profiling code, improve latency, output 6d instead of quater…

6b12211

…nion

rot6d change, flow matching v1 and qsm script v1

2df96a1

fix image encoder, action 6d conversion etc

f501f9e

misc fixes for RL training, and a minor change for crash in RunFlowMa…

3e6b0fa

…tching

reduce image buffer memory usage 1

7cd2d81

add support for approach only diffusion. and some changes in qsm trai…

b2a23fc

…ning too

use joint attention instead of separate cross attention

04ade84

misc changes for latency improvement

51504a4

misc

30b41f8

rlpd v1

f671197

refined rlpd code, mostly reward part

9738139

better logging

65860de

lhphanto added 6 commits May 14, 2026 20:30

more logging for rlpd

7277dda

refactor keypoint model code

32d9c97

support resume

82f1337

misc

d241f06

add docker

8faa29b

dep fix

796b471

lhphanto force-pushed the main branch from 03ffcf4 to 796b471 Compare May 14, 2026 20:32

log new data

4ac1cec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

merge changes from upstream#550

merge changes from upstream#550
lhphanto wants to merge 36 commits into
intrinsic-dev:mainfrom
lhphanto:main

lhphanto commented May 14, 2026

Uh oh!

google-cla Bot commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

lhphanto commented May 14, 2026

Uh oh!

google-cla Bot commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant