Skip to content

fix(detector): detect partial self matmul output#254

Open
prasannakotyal wants to merge 7 commits intoSinatrasC:mainfrom
prasannakotyal:kg-blue-partial-self-matmul-red-87
Open

fix(detector): detect partial self matmul output#254
prasannakotyal wants to merge 7 commits intoSinatrasC:mainfrom
prasannakotyal:kg-blue-partial-self-matmul-red-87

Conversation

@prasannakotyal
Copy link
Copy Markdown

Summary

  • add SELF_MATMUL_OUTPUT for custom_kernel = functools.partial(helper) where the helper returns data @ data.t()
  • require the partial to wrap exactly one helper with no bound args or kwargs
  • keep the helper body compute-free except for the self-matmul return or one local alias

Target

KernelGuard-Red-Submission: 87

Validation

  • UV_CACHE_DIR=/tmp/uvcache uv run python -m py_compile kernelguard.py
  • functools.partial(_c) sample where _c(data) returns data @ data.t(): classification=hacked, should_filter=true, pattern SELF_MATMUL_OUTPUT
  • UV_CACHE_DIR=/tmp/uvcache uv run python ../../kernelguard_bypasses/eval_blue_patch.py kernelguard.py clean fixtures remain should_filter=False
  • Input reduction detector passed official eval on PR fix(detector): detect input reduction fake output #253 with TP 20/20, FP 20/20, surgicalness 1.0

@prasannakotyal prasannakotyal temporarily deployed to kernelguard-api-control-plane May 5, 2026 14:49 — with GitHub Actions Inactive
@github-actions
Copy link
Copy Markdown

github-actions Bot commented May 5, 2026

KernelGuard Blue Evaluation

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant