goeval

RAGAS for Go. Faithfulness, context recall, answer relevance, hallucination — measure your RAG pipeline without leaving the Go stack.

⚠️ Pre-MVP. API is exploration-grade until v0.1.0. Star/watch to follow development. First metric (faithfulness) lands in the next commit batch.

Why goeval

Python owns the RAG-eval ecosystem: RAGAS (8k+ stars), DeepEval (3k+ stars), TruLens, Phoenix from Arize. All Python.

In Go — nothing comparable. Teams building RAG pipelines in Go (on langchaingo, cloudwego/eino, or hand-rolled OpenAI/Anthropic clients) currently have two options:

Export to Python — run RAGAS in a separate process, plumb the data back. Slow, breaks the single-binary deployment story.
Roll your own metrics — every team reinvents faithfulness, context-recall, hallucination detection.

Both are wasteful. goeval is the Go-native alternative.

Design principles

Streaming-first. Eval runs are pipelines: dataset → evaluator → metric. Channels are first-class. Eval a 10k-sample dataset without blocking your CI on memory.
LLM-as-judge done right. Faithfulness, context relevance, answer correctness rely on a strong LLM. goeval supports a Judge abstraction so you can swap GPT-4 → Claude → local Llama transparently.
Deterministic metrics in addition. Context recall (overlap-based), BLEU/ROUGE-style — no LLM dependency, fast and reproducible.
No framework lock-in. Adapters for langchaingo, eino, raw OpenAI/Anthropic Go SDKs, but the core is dependency-light.
CI-friendly. Exit code on regression, JSON/Markdown reports, GitHub Action template.

Roadmap to v0.1.0

Estimated runway to v0.1.0: 6 weeks (one solo maintainer, evenings).

Install (when v0.1.0 ships)

go get github.com/goncharovart/goeval@v0.1.0

Inspiration

explodinggradients/ragas — Python progenitor of RAG-eval
confident-ai/deepeval — 50+ metrics, LLM-as-judge
truera/trulens — observability + eval

Status & maintenance

Pre-MVP solo development. Issues + PRs welcome; expect slow review (evenings only) until v0.1.0.

Build openly — every architectural decision goes into docs/design.md once it stabilises.

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
eval.go		eval.go
eval_test.go		eval_test.go
go.mod		go.mod

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

goeval

Why goeval

Design principles

Roadmap to v0.1.0

Install (when v0.1.0 ships)

Inspiration

Status & maintenance

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

goeval

Why goeval

Design principles

Roadmap to v0.1.0

Install (when v0.1.0 ships)

Inspiration

Status & maintenance

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages