MatFormer LoRA Support

__Title:__ Weight slicing bypasses LoRA layers

__Problem:__

```python
# Current - bypasses LoRA
gate_proj = self.gate_proj.weight[:m]
```

__Fix:__ Use output slicing instead:

```python
# Fixed - LoRA applied
gate_out = self.gate_proj(x)[:, :, :m]
up_out = self.up_proj(x)[:, :, :m]
hidden = gate_out * up_out
output = self.down_proj(F.pad(hidden, (0, dff - m)))
```

__Why:__ Direct `.weight` access skips LoRA's `W + BA` computation.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MatFormer LoRA Support #6

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

MatFormer LoRA Support #6

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions