Evalica, your favourite evaluation toolkit
-
Updated
Apr 22, 2026 - Python
Evalica, your favourite evaluation toolkit
Source code and data for the EDM 2022 paper
Package to do Bradley-Terry Model pairwise compairsons
Moody Lenses internal campaign decision engine — LLM-powered multi-dimensional evaluation for marketing campaign selection
public ranking system for ai agents
Extract archery recurve and compound event scores from Ianseo and builds a website containing the resulting ranks of all archers. The statistical analysis was written in R, using the PlackettLuce package.
Concept-Guided Chain-of-Thought (CGCoT) pairwise annotation tool for systematic text evaluation using LLMs. Generate breakdowns, compare items, compute scores, and validate against human judgments. Supports Ollama, Hugging Face, Google Gemini, OpenAI, and Anthropic models.
R Package: Pairwise Comparison Tools for LLM-Based Writing Evaluation
UI for straightforward Bradley-Terry feedback loop
AI-powered personal recommendation engine that learns preferences through pairwise comparison
Oscars predictions using Bradley-Terry model and stan
This is the online appendix for the paper "Bayesian Paired-Comparison with the bpcs package"
Bayesian Spatial Bradley--Terry
Bradley Terry Modelini hesaplanmak için küçük bir fonksiyon
Local Bradley-Terry paired comparison tool for ranking images. Upload JSON or SQLite, rate pairs, export rankings. Everything runs in your browser.
Fast inference for multi-body rankings using Newman's efficient Plackett-Luce algorithm. Achieves 5-70x speedup with Numba JIT compilation.
Sorting made better, powered by science.
Add a description, image, and links to the bradley-terry topic page so that developers can more easily learn about it.
To associate your repository with the bradley-terry topic, visit your repo's landing page and select "manage topics."