Fix completion info extraction for offline best-of-n and self-consistency by smirnovlad · Pull Request #223 · IINemo/thinkbooster

smirnovlad · 2026-03-03T17:49:11Z

Summary

get_completion_info() was receiving List[str] (from detect_steps()) instead of List[StepCandidate] in offline best-of-n and self-consistency strategies, so the isinstance guard always returned defaults (None, False, False)
This caused context_limit_hit_rate and max_steps_hit_rate to always be 0 for these two strategies
Fix: capture completion info eagerly while StepCandidate objects are still available, store in trajectory dicts, and propagate through to final results

Test plan

Syntax check passes for both modified files
Run eval with offline_best_of_n on a small dataset, verify metrics.json reports correct context_limit_hit_count / max_steps_hit_count
Run eval with self_consistency on a small dataset, same check

…ency get_completion_info() was receiving List[str] (from detect_steps()) instead of List[StepCandidate], causing the isinstance guard to always return defaults. This meant context_limit_hit_rate and max_steps_hit_rate were always 0. Capture completion info eagerly while StepCandidate objects are still available, store it in trajectory dicts, and propagate through to final results.

smirnovlad-test assigned smirnovlad Mar 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix completion info extraction for offline best-of-n and self-consistency#223

Fix completion info extraction for offline best-of-n and self-consistency#223
smirnovlad wants to merge 1 commit into
mainfrom
fix/completion-info-extraction

smirnovlad commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

smirnovlad commented Mar 3, 2026

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant