inference-time-intervention

Here are 3 public repositories matching this topic...

peterdresslar / hybrid-signal-lab

Research codebase for intervention, benchmarking, and signal analysis in language models

benchmarking attention hybrid interpretability hybrid-models llm mechanistic-interpretability llm-inference inference-time-intervention

Updated Apr 21, 2026
Python

Pomilon-Intelligence-Lab / ALSI

Star

Early baby steps towards a long-term vision regarding Mamba-2's state interpretability.

Updated Feb 4, 2026
Python

metaSATOKEN / Recync_framework

Star

Runtime detection and control of LLM coherence failures (looping, hallucination, context loss). No fine-tuning. Zero iatrogenic harm. 69 experiments across 5 architectures.

reproducible-research transformers pytorch open-science pythia checkpoint-restart ai-safety runtime-monitoring control-barrier-functions mechanistic-interpretability llm-safety inference-time-intervention coherence-control

Updated Mar 30, 2026
Python

Improve this page

Add a description, image, and links to the inference-time-intervention topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the inference-time-intervention topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly