Skip to content

[FEATURE] Every Eval Ever #47

@borgr

Description

@borgr

Problem

Is your proposal tackling an exisitng problem or limitation?

  • No, it's an addition

Proposal

Describe what you would like to see happen.

  • Type:
    • New Ontology (data source for multiple tasks)
    • New Task(s)
    • New Model(s)
    • New Metric(s)
    • [ x] Other
  • Area(s) of code: somehwere at the output level

Alternatives

Have you considered other approaches? Briefly list them and why you did not choose them.

Additional Context

Links to related issues, PRs, docs, or external references.
Screenshots or small examples if useful.

Implementation

  • I plan to implement this in a PR
  • I am proposing the idea and would like someone else to pick it up
    Hey folks, cool evaluation tool. I was wondering if you thought of integrating your outputs to the standard format set by EveryEvalEver, and making an easy way to share the results of the evaluation in a way evaluation folks would be able to study them?

This will bring some more people to look at what you are doing and for the long run allow other comparisons to be made as it will be in the place we (eval folks) will be looking.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions