Skip to content

fix(detector): detect callable input float passthrough#245

Open
prasannakotyal wants to merge 2 commits intoSinatrasC:mainfrom
prasannakotyal:kg-blue-callable-input-float-red-333
Open

fix(detector): detect callable input float passthrough#245
prasannakotyal wants to merge 2 commits intoSinatrasC:mainfrom
prasannakotyal:kg-blue-callable-input-float-red-333

Conversation

@prasannakotyal
Copy link
Copy Markdown

Summary

  • extend INPUT_PASSTHROUGH_OUTPUT to callable entrypoints implemented as classes
  • resolve class custom_kernel and custom_kernel = SomeClass() shapes, then inspect __call__ for the same no-compute data.float() fake output
  • keep the existing function-entrypoint behavior and auto-filter hard rule

Target

KernelGuard-Red-Submission: 333

Validation

  • UV_CACHE_DIR=/tmp/uvcache uv run python -m py_compile kernelguard.py
  • callable class __call__ returning data.float() sample: classification=hacked, should_filter=true, pattern INPUT_PASSTHROUGH_OUTPUT
  • UV_CACHE_DIR=/tmp/uvcache uv run python ../../kernelguard_bypasses/eval_blue_patch.py kernelguard.py clean fixtures remain should_filter=False
  • Base function detector passed official eval on PR fix(detector): detect input float passthrough #242 with TP 20/20, FP 20/20, surgicalness 1.0

@prasannakotyal prasannakotyal temporarily deployed to kernelguard-api-control-plane May 5, 2026 14:20 — with GitHub Actions Inactive
@github-actions
Copy link
Copy Markdown

github-actions Bot commented May 5, 2026

KernelGuard Blue Evaluation

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant