Skip to content

Conversation

@nick-galluzzo
Copy link
Owner

No description provided.

…tion

Adds stability testing capabilities using real commit data and curated examples. Includes CLI
interface for running single, batch, and comparative stability tests with configurable parameters. Provides
detailed reporting on evaluation consistency and model performance metrics.
Adds a full validation suite with obvious cases, ranking consistency, score distribution, and edge case tests
to ensure the commit message evaluator works correctly before running benchmarks. Includes rich CLI output and
detailed test reporting.
@nick-galluzzo nick-galluzzo merged commit 362b66e into main Aug 5, 2025
3 checks passed
@nick-galluzzo nick-galluzzo deleted the feat/benchmarking-scripts branch August 5, 2025 07:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant