Add Model Compare Tab by KartikP · Pull Request #509 · brain-score/brain-score.web

KartikP · 2026-02-11T15:23:40Z

compare.models.mov

mschrimpf · 2026-03-04T14:19:01Z

This is great!

Some nitpicks:

Per-Benchmark Score Correlation

Can we use the same visualization as for the compare-benchmarks? For instance, Pearson R etc are shown differently in the two plots. In fact they're even shown differently within the compare-models plot as there is a box at the top detailing the stats and then there's another one with the same information in the top left. I don't care so much which one we use, except that we show information once, uniformly, and that the Brain-Score logo would be nice to show somewhere on the plot so that it's visible when people re-post the plot.
The search is great! Can we just make it bit more forgiving? For instance, searching for "resnet 50 sin" does not yield resnet50-SIN. (Going forward it would be great to include our tags here too, such that you can e.g. search for "transformer".)
Could we include more information about each model that is selected? For instance the rank, who contributed it, a link to the model page (like a mini model card). I would put this as two boxes below the search instead of the current stats.
At least according to the legend, scores are repeated: average vision includes everything, neural includes {V1, V2, V4, IT}. I would say we either let the user choose the level, or we keep it at {V1, V2, V4, IT, Behavior} -- i.e. gray out engineering by default.
For the individual benchmark dots, is it possible to make them clickable? I.e. link to the benchmark page? (Tooltip could even be some of the benchmark stimuli, but just the link would already be great.)
(When clicking on e.g. V1, I would have expected that only those scores remain and everything else goes away. But this is only a minor inconvenience and I guess otherwise it wouldn't be possible to filter individually.)

Top Benchmark Differences

Move the title text a tiny bit higher, it overlaps with the bars.

Mean Score by Domain

Let's show the individual benchmark dots? Or do more of a boxplot/violin plot?

mschrimpf · 2026-03-04T14:20:31Z

I want to be clear that these are all nitpicks that we can address in an update in the near future. Having this live as-is would already be an improvement over not having it :)

KartikP added 4 commits February 11, 2026 10:01

compare tab for model stats

eff7377

styling changes

f09fac2

default model in compare tab

12a417e

Merge branch 'master' into kp/model-compare

d7a00bc

mike-ferguson self-requested a review February 11, 2026 21:20

KartikP requested review from mike-ferguson and removed request for mike-ferguson February 17, 2026 18:52

mschrimpf approved these changes Mar 4, 2026

View reviewed changes

KartikP added 2 commits March 5, 2026 13:19

address all of Martin's comments

852cd70

Merge branch 'master' into kp/model-compare

2b8d70c

KartikP merged commit 12cbc98 into master Mar 5, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Model Compare Tab#509

Add Model Compare Tab#509
KartikP merged 6 commits intomasterfrom
kp/model-compare

KartikP commented Feb 11, 2026

Uh oh!

mschrimpf commented Mar 4, 2026

Uh oh!

mschrimpf commented Mar 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

KartikP commented Feb 11, 2026

Uh oh!

mschrimpf commented Mar 4, 2026

Per-Benchmark Score Correlation

Top Benchmark Differences

Mean Score by Domain

Uh oh!

mschrimpf commented Mar 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants