Skip to content

ds4cabs/OpenRegEval

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

OpenRegEval

CABS: ds4cabs CABS: 2026 status: MVP in progress type: Evaluation Agent domain: Regulatory Reasoning

Intern: Matías Ignacio Pinto Chávez Project: OpenRegEval

Overview

OpenRegEval is a lightweight LLM evaluation agent for regulatory reasoning on FDA drug label content. It builds structured benchmarks, retrieves grounded evidence from openFDA and DailyMed, and scores model outputs for citation accuracy, factual grounding, and refusal behavior.

Deliverables

  • MVP proposal and summary for a 2-week evaluation agent
  • Full project plan for a summer-length OpenRegEval system
  • Benchmarking and hallucination auditing workflows for FDA label reasoning

Tech Stack

Python, PyTorch, Hugging Face Transformers, PEFT/LoRA, openFDA API, DailyMed API, pandas, scikit-learn, Streamlit

Releases

No releases published

Packages

 
 
 

Contributors