Skip to content

Pull requests: huggingface/lighteval

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Korean completed and Basque fixed
#1179 opened Mar 6, 2026 by inakiLakunza Loading…
Fix: pass through custom_tasks and enable multilingual in eval command
#1172 opened Feb 19, 2026 by dzautner Loading…
2 tasks done
Add jfinqa: Japanese Financial Numerical Reasoning QA
#1169 opened Feb 17, 2026 by ajtgjmdjp Loading…
2 of 3 tasks
fix: restore task list display logic
#1166 opened Feb 10, 2026 by s1eeping-king Loading…
Fix TypeError in aa_omniscience_prompt
#1161 opened Jan 22, 2026 by pjavanrood Loading…
Fix split loading error in bigbench
#1159 opened Jan 22, 2026 by pjavanrood Loading…
Fix RecursionError in imdb_contrastset_prompt
#1155 opened Jan 22, 2026 by pjavanrood Loading…
Fix non-existent evaluation splits in lextreme
#1151 opened Jan 22, 2026 by pjavanrood Loading…
Fix evaluation split config in lsat_qa
#1149 opened Jan 22, 2026 by pjavanrood Loading…
Improve NarrativeQA metrics and prompt structure
#1147 opened Jan 22, 2026 by pjavanrood Loading…
Fix key mismatch and context access in PubMedQA
#1143 opened Jan 22, 2026 by pjavanrood Loading…
Fix TypeError in real_toxicity_prompts
#1141 opened Jan 22, 2026 by pjavanrood Loading…
Fix column mismatch and metric in SimpleQA
#1139 opened Jan 22, 2026 by pjavanrood Loading…
Fix subset names in StoryCloze
#1137 opened Jan 22, 2026 by pjavanrood Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.