adding in the openai integration by xzrderek · Pull Request #338 · eval-protocol/python-sdk

xzrderek · 2025-11-18T09:50:57Z

Note

Adds OpenAI RFT integration to convert Eval Protocol evaluation functions into Python grader specs, plus example usage and tests.

Integrations:
- Add eval_protocol/integrations/openai_rft.py with build_python_grader_from_evaluation_test to convert evaluation-style functions into OpenAI Python grader specs by:
  - Stripping decorators/annotations via AST, renaming to _ep_eval, and embedding helper types (EvaluationRow, EvaluateResult, Message).
  - Building grade(sample, item) that maps item/sample to a duck-typed row and normalizes outputs to a float score.
- Export in eval_protocol/integrations/__init__.py.
Examples:
- examples/openai_rft/example_rapidfuzz.py: demo @evaluation_test using RapidFuzz and conversion to grader.
- examples/openai_rft/test_openai_grader.py: script to validate/run the grader via OpenAI API.
Tests:
- tests/test_openai_rft_integration.py: verifies grader generation from plain and wrapped functions and correct scoring behavior.

^{Written by Cursor Bugbot for commit 6fc1c36. This will update automatically on new commits. Configure here.}

cursor · 2025-11-18T09:52:21Z

eval_protocol/integrations/openai_rft.py

+    except Exception:
+        pass
+
+    return 0.0


Bug: Unhandled Exceptions Propagate, Crashing Evaluation

The call to _ep_eval on line 152 is outside the try-except block, so exceptions raised by the user's evaluation function will not be caught. Only normalization errors are caught, allowing crashes from the evaluation logic to propagate instead of gracefully returning 0.0 as intended.

examples/openai_rft/example_rapidfuzz.py

examples/openai_rft/test_openai_grader.py

eval_protocol/integrations/openai_rft/adapter.py

dphuang2

great!

adding in the openai integration

2dccccf

xzrderek requested review from benjibc and dphuang2 November 18, 2025 09:51

cursor bot reviewed Nov 18, 2025

View reviewed changes

dphuang2 reviewed Nov 18, 2025

View reviewed changes

examples/openai_rft/example_rapidfuzz.py Show resolved Hide resolved

dphuang2 reviewed Nov 18, 2025

View reviewed changes

examples/openai_rft/test_openai_grader.py Show resolved Hide resolved

dphuang2 reviewed Nov 18, 2025

View reviewed changes

eval_protocol/integrations/openai_rft/adapter.py Outdated Show resolved Hide resolved

dphuang2 approved these changes Nov 18, 2025

View reviewed changes

xzrderek added 4 commits November 18, 2025 14:56

move to not be in package

935beeb

update path

ecaa657

add to export

67b1ac0

remove unneeded

6fc1c36

xzrderek merged commit 2d46f12 into main Nov 19, 2025
9 checks passed

xzrderek deleted the derekx/openai-integration branch November 19, 2025 07:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding in the openai integration#338

adding in the openai integration#338
xzrderek merged 5 commits intomainfrom
derekx/openai-integration

xzrderek commented Nov 18, 2025 •

edited by cursor bot

Loading

Uh oh!

cursor bot Nov 18, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dphuang2 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

xzrderek commented Nov 18, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cursor bot Nov 18, 2025

Choose a reason for hiding this comment

Bug: Unhandled Exceptions Propagate, Crashing Evaluation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dphuang2 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xzrderek commented Nov 18, 2025 •

edited by cursor bot

Loading