eval-protocol · SandyYuan · Jan 16, 2026 · Jan 16, 2026 · Jan 16, 2026 · Jan 16, 2026
diff --git a/eval_protocol/rewards/ifeval/README.md b/eval_protocol/rewards/ifeval/README.md
@@ -0,0 +1,50 @@
+# IFEval Reward Function
+
+Evaluates how well model responses follow instruction constraints. Returns a partial credit score (0.0 to 1.0).
+
+## Quick Start
+
+```python
+import sys
+sys.path.insert(0, '/path/to/eval_protocol/rewards/ifeval')
+from reward import ifeval_partial_credit_reward
+
+response = "Hello world! This is my response."
+ground_truth = {
+    "instruction_id": ["keywords:existence"],
+    "kwargs": [{"keywords": ["hello", "world"]}]
+}
+
+score = ifeval_partial_credit_reward(response, ground_truth)
+# Score: 1.0 (all constraints satisfied)
+```
+
+## Dependencies
+
+```bash
+pip install nltk langdetect emoji syllapy immutabledict absl-py
+```
+
+NLTK resources are downloaded automatically on first use.
+
+## Notes
+
+- Automatically strips `<think>...</think>` tags before evaluation
+- Ground truth can be a dict, list, or JSON string
+- 112 total constraints (54 IFEval/IFTrain + 58 IFBench OOD)
+
+## File Sources
+
+**Copied from `open-instruct/open_instruct/IFEvalG/`:**
+- `ifeval_instructions.py` (from `instructions.py`)
+- `ifeval_registry.py` (from `instructions_registry.py`)
+- `ifeval_util.py` (from `instructions_util.py`)
+
+**Copied from `IFBench/` (commit 8e6a9be, 2025-01):**
+- `ifbench_instructions.py` (from `instructions.py`)
+- `ifbench_registry.py` (from `instructions_registry.py`)
+- `ifbench_util.py` (from `instructions_util.py`)
+
+**New code:**
+- `reward.py` - main reward function
+- `__init__.py` - package exports
diff --git a/eval_protocol/rewards/ifeval/__init__.py b/eval_protocol/rewards/ifeval/__init__.py
@@ -0,0 +1,13 @@
+"""IFEval reward function for evaluating instruction-following capabilities.
+
+Usage:
+    import sys
+    sys.path.insert(0, '/path/to/eval_protocol/rewards/ifeval')
+    from reward import ifeval_partial_credit_reward
+
+    score = ifeval_partial_credit_reward(response, ground_truth)
+"""
+
+from .reward import ifeval_partial_credit_reward
+
+__all__ = ["ifeval_partial_credit_reward"]