A small runnable simulator for controlled multi-agent workflows.
This repository turns governance, orchestration, and evaluation concepts into a working reference implementation.
It demonstrates a small agent system with:
- explicit roles,
- bounded retries,
- fallback behavior,
- escalation triggers,
- structured logs,
- and a lightweight evaluation summary.
- Planner: decides how the task should be handled
- Executor: performs the main task action
- Evaluator: checks whether the result is acceptable
- Supervisor: decides whether to accept, retry, fallback, or escalate
normal_successretry_then_successfallback_after_failure
python run_demo.py --scenario normal_successEach run produces:
- a decision log,
- retry and escalation events,
- final outcome,
- latency,
- cost estimate,
- and a small evaluation summary.
run_demo.pysrc/agents.pysrc/controller.pysrc/evaluation.pysrc/scenarios.pyexamples/sample-output.md
A multi-agent system should not only produce outputs. It should also expose control logic clearly enough to be debugged, evaluated, and improved.