Hello — submitting DevonOS for the SciCode leaderboard.
Test split scored:
Main Problem Resolve Rate: 10.8 (7 / 65)
Subproblem: 29.2 (85 / 291)
Mains fully resolved: 11, 14, 21, 35, 65, 71, 74.
Run on AMD 24-core + NVIDIA RTX 3070 · 199 s wall.
Date: 2026-05-15.
Happy to share additional details (witness-deficit audit covering
the 185 unsolved subproblems) if useful for review.
Thank you.
Hello — submitting DevonOS for the SciCode leaderboard.
Test split scored:
Main Problem Resolve Rate: 10.8 (7 / 65)
Subproblem: 29.2 (85 / 291)
Mains fully resolved: 11, 14, 21, 35, 65, 71, 74.
Run on AMD 24-core + NVIDIA RTX 3070 · 199 s wall.
Date: 2026-05-15.
Happy to share additional details (witness-deficit audit covering
the 185 unsolved subproblems) if useful for review.
Thank you.