Skip to content

fix(detector): detect init-subclass input float passthrough#247

Open
prasannakotyal wants to merge 4 commits intoSinatrasC:mainfrom
prasannakotyal:kg-blue-init-subclass-input-float-red-105
Open

fix(detector): detect init-subclass input float passthrough#247
prasannakotyal wants to merge 4 commits intoSinatrasC:mainfrom
prasannakotyal:kg-blue-init-subclass-input-float-red-105

Conversation

@prasannakotyal
Copy link
Copy Markdown

Summary

  • extend INPUT_PASSTHROUGH_OUTPUT to __init_subclass__ installed fake-output entrypoints
  • detect base classes that assign cls.custom_kernel = lambda d: d.float() during subclass creation
  • require the submitted custom_kernel to be the subclass attribute installed by that hook

Target

KernelGuard-Red-Submission: 105

Validation

  • UV_CACHE_DIR=/tmp/uvcache uv run python -m py_compile kernelguard.py
  • __init_subclass__ installed custom_kernel = lambda d: d.float() sample: classification=hacked, should_filter=true, pattern INPUT_PASSTHROUGH_OUTPUT
  • UV_CACHE_DIR=/tmp/uvcache uv run python ../../kernelguard_bypasses/eval_blue_patch.py kernelguard.py clean fixtures remain should_filter=False
  • Dynamic type extension passed official eval on PR fix(detector): detect dynamic type input float passthrough #246 with TP 20/20, FP 20/20, surgicalness 1.0

@prasannakotyal prasannakotyal temporarily deployed to kernelguard-api-control-plane May 5, 2026 14:27 — with GitHub Actions Inactive
@github-actions
Copy link
Copy Markdown

github-actions Bot commented May 5, 2026

KernelGuard Blue Evaluation

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant