Fix/verification by huseyincavusbi · Pull Request #1441 · TransformerLensOrg/TransformerLens

huseyincavusbi · 2026-06-24T18:12:10Z

Description

Fixes three verification bugs found during Gemma4 adapter testing:

Component benchmark false failures — Models using DelegatedAttentionBlockBridge (Gemma4) reported 81-96 component failures per model, dragging P1 from 100% to 50%. These are benchmark infrastructure failures. The component comparison can't call delegated attention/rotary/PLE modules in isolation because they require model-specific kwargs the benchmark doesn't provide. Added skip logic in component_outputs.py that detects DelegatedAttentionBlockBridge (via missing hook_q_input in hook aliases) and skips untestable components (attn, rotary_emb, per_layer projections). The gold-standard forward_pass_logits test was already passing. This just stops the benchmark from reporting false negatives.
Phase 2 empty header — main_benchmark.py printed a "PHASE 2:" header with zero tests on every run regardless of whether Phase 2 was selected. Moved the header inside the should_run_phase(2) guard.
Misleading "encoder-decoder" warning — The HF model loading log printed "for encoder-decoder model" for multimodal architectures using AutoModelForImageTextToText. Removed the incorrect label.

Verification (tiny-random/gemma-4-e):

Dev branch: 15/64 component failures
Fix branch: 0/31 component failures (all untestable components skipped)
Unit tests: 167/167 passed,

Type of change

Bug fix (non-breaking change which fixes an issue)

Checklist:

My changes generate no new warnings
New and existing unit tests pass locally with my changes
I have not rewritten tests relating to key interfaces which would affect backward compatibility

…an also be multimodal (AutoModelForImageTextToText)

…ed in every run

…attn/PLE/rotary_emb can't be tested in isolation

…ng instead of bridge_model.blocks

…locks

…ry is top-level, only _test_component handles it)

jlarson4 · 2026-06-24T19:25:20Z

Looks good! One small suggestion: _is_delegated_block() detects delegation by checking that hook_q_input is missing from the block's hook aliases, which is a bit indirect. The benchmark already skips attention via an explicit flag (maintain_native_attention / requires_position_embeddings), you could set maintain_native_attention=True on DelegatedAttentionBlockBridge, which would then reuse the existing skip for that case, rather than adding the hook-alias check?

huseyincavusbi added 6 commits June 24, 2026 18:31

fix: remove misleading 'encoder-decoder' label from AutoModel log — c…

bc50003

…an also be multimodal (AutoModelForImageTextToText)

fix: skip Phase 2 header when phase not selected — empty header print…

1b25029

…ed in every run

fix: skip component benchmarking for DelegatedAttentionBlockBridge — …

f41eaed

…attn/PLE/rotary_emb can't be tested in isolation

fix: detect DelegatedAttentionBlockBridge via adapter.component_mappi…

50dc4a2

…ng instead of bridge_model.blocks

fix: add rotary_emb skip in _test_component for delegated attention b…

8ebb729

…locks

fix: remove dead rotary_emb skip from _test_component_recursive (rota…

4d64b5d

…ry is top-level, only _test_component handles it)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix/verification#1441

Fix/verification#1441
huseyincavusbi wants to merge 6 commits into
TransformerLensOrg:devfrom
huseyincavusbi:fix/verification

huseyincavusbi commented Jun 24, 2026

Uh oh!

jlarson4 commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

huseyincavusbi commented Jun 24, 2026

Description

Type of change

Checklist:

Uh oh!

jlarson4 commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants