We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent d7cf357 commit edf6b61Copy full SHA for edf6b61
1 file changed
leaderboard.html
@@ -200,7 +200,7 @@ <h3 class="text-sm font-medium text-gray-900 mb-2">Methodology</h3>
200
<strong>Cooperative:</strong> Two agents work on separate features that may conflict. Success requires both features to pass tests after merging.
201
</p>
202
<p class="text-sm text-gray-600 leading-relaxed mt-2">
203
- <strong>Solo:</strong> A single agent implements both features sequentially. This serves as the upper bound for what agents can achieve.
+ <strong>Solo:</strong> A single agent implements both features. This serves as the upper bound for what agents can achieve.
204
205
<p class="text-sm text-gray-500 mt-3">
206
All models evaluated on the same 652 tasks with identical prompts and tool access.
0 commit comments