Skip to content

Conversation

@CDBiddulph
Copy link

Previously, bootstrap_trace_data would capture execution traces in the prediction parameter of wrapped_metric, but then discarded them. Metrics would always receive trace=None instead of the actual trace. This would break metrics that use the trace.

Also fixed attempted division of two lists, dividing their lengths instead.

Previously, bootstrap_trace_data would capture execution traces in the `prediction` parameter of wrapped_metric, but then discarded them. Metrics would always receive trace=None instead of the actual trace. This would break metrics that use the trace.

Also fixed attempted division of two lists, dividing their lengths instead.
CDBiddulph added a commit to CDBiddulph/gepa-legibility that referenced this pull request Dec 9, 2025
@CDBiddulph
Copy link
Author

Ping - could somebody please take a look?

CDBiddulph and others added 2 commits December 22, 2025 09:46
When raise_on_error=False and unpacking fails, the previous code used
`continue` which skipped adding the failed example to the output list.
This broke the 1:1 correspondence between inputs and outputs, causing
issues in GEPA which zips outputs with validation IDs.

The fix adds a placeholder entry with an empty trace, allowing downstream
code (like make_reflective_dataset) to skip it gracefully while
maintaining correct output ordering.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant