Framework for evaluating and giving structured feedback to LLM-based agents (like OpenAI's GPT models). It provides a feedback loop mechanism to analyze, critique, and improve the performance of LLMs through real-time or post-hoc feedback using structured evaluation metrics or natural language.
fintools-ai/llm-agent-evaluator
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|