fix(evals): differentiate false positive vs false negative in escalation scorer by Ajayvardhanreddy · Pull Request #13 · Ajayvardhanreddy/agent-execution-engine

Ajayvardhanreddy · 2026-05-25T00:20:59Z

fix(evals): differentiate false positive vs false negative in escalation scorer

False negative (missed required escalation) → 0.0, hard_fail gate. False positive (unnecessary escalation) → 0.3 partial credit.

Rationale: a fraud case silently resolved is a security failure. An agent that escalates unnecessarily errs on the side of caution — bad UX but not a safety risk. Treating them identically obscured which failure mode appeared in reports.

…ion scorer False negative (missed required escalation) → 0.0, hard_fail gate. False positive (unnecessary escalation) → 0.3 partial credit. Rationale: a fraud case silently resolved is a security failure. An agent that escalates unnecessarily errs on the side of caution — bad UX but not a safety risk. Treating them identically obscured which failure mode appeared in reports.

Ajayvardhanreddy merged commit dcfba47 into dev May 25, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(evals): differentiate false positive vs false negative in escalation scorer#13

fix(evals): differentiate false positive vs false negative in escalation scorer#13
Ajayvardhanreddy merged 1 commit into
devfrom
feat/phase-4-eval-harness

Ajayvardhanreddy commented May 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Ajayvardhanreddy commented May 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant