terminal-bench

Multi-agent reasoning MCP server for Claude Code. Spawns parallel research agents to find knowledge LLMs don't have. +23.1% on Terminal Bench 2.0 SWE tasks.

research mcp multi-agent developer-tools ai-agents claude training-data fine-tuning nia llm model-context-protocol mcp-server claude-code terminal-bench

Updated Apr 13, 2026
TypeScript

Improve this page

Add a description, image, and links to the terminal-bench topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the terminal-bench topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

terminal-bench

Here are 6 public repositories matching this topic...

harbor-framework / harbor

LiberCoders / CLI-Gym

plaume8 / spoox

li-boxuan / Terminal-bench-OpenHands-trajectories

ayush0824 / parse-log-stats

sam-siavoshian / Symposium

Improve this page

Add this topic to your repo