Evaluation framework for testing LLM agents' ability to use MCP tools
-
Updated
May 12, 2026 - Python
Evaluation framework for testing LLM agents' ability to use MCP tools
YAML Based Eval Specification Language and AI generation pipeline for LLMs and Developers.
A pytest plugin integrating pydantic-evals
Add a description, image, and links to the pydantic-evals topic page so that developers can more easily learn about it.
To associate your repository with the pydantic-evals topic, visit your repo's landing page and select "manage topics."