Short Description
Define quality dimensions for evaluating workflow steps (e.g., data credibility, interpretability, completeness) inspired by Karenina benchmarking.
Deliverables
- Benchmark rubric and example annotations
- Schema for benchmark storage and FAIR sharing
- Example evaluation of one workflow step
Dependencies
#95 and #99