plugin-testing

Here are 8 public repositories matching this topic...

sjnims / cc-plugin-eval

4-stage evaluation framework for testing Claude Code plugin component triggering. Validates skills, agents, and commands activate correctly via programmatic detection and LLM judgment.

cli typescript test-automation developer-tools evaluation-framework testing-framework claude llm ai-testing anthropic claude-code claude-agent-sdk plugin-testing

Updated May 12, 2026
TypeScript

Benchmark harness for A/B testing Claude Code plugins against OOLONG long-context reasoning tasks. Compare truncation vs RLM-RS recursive chunking strategies. Features Claude Code hooks integration, SQLite persistence, and comprehensive scoring aligned with the OOLONG paper methodology.

python nlp cli benchmark developer-tools chunking claude rlm oolong ai-evaluation llm long-context anthropic context-window claude-code plugin-testing recursive-language-model

Updated Apr 13, 2026
Python

imbflool / cc-plugin-eval

Star

🚀 Automate the evaluation of Claude Code plugin components to ensure accurate triggering of skills, agents, commands, and hooks.

cli typescript test-automation developer-tools evaluation-framework claude llm ai-testing anthropic claude-code claude-agent-sdk plugin-testing

Updated May 14, 2026
TypeScript

JSLEEKR / skilltest

Star

Testing harness for AI agent skills and plugins — contract-driven YAML specs with mocked environments

cli typescript testing-framework ai-agent claude-code agent-testing plugin-testing skill-testing

Updated Apr 4, 2026
TypeScript

codifycli / codify-plugin-test

Star

Testing framework for Codify plugins with complete lifecycle testing, IPC communication, and cross-platform support

plugin testing framework test ipc integration-testing codify plugin-testing

Updated Apr 23, 2026
TypeScript

ashak-odree / WordPress-Plugin-QA-Test-Report---easy.jobs

Star

Performed end-to-end manual testing and source code analysis on the easy.jobs WordPress Plugin (v2.7.1), identifying 8 bugs across 22 test cases covering authentication, job management, candidate management, security, and edge cases.

test-cases wordpress-plugin qa functional-testing software-testing bug-report manual-testing plugin-testing

Updated Apr 27, 2026

jeremylongshore / j-rig-skill-binary-eval

Sponsor

Star

Binary-criteria evaluation harness for Claude skills with planned extension to plugins, agents, and MCP servers. Score every change yes/no across 7 layers — package integrity, trigger quality, functional quality, regression protection, baseline value, model variance, rollout safety. Never gradients.

mcp regression-testing skill-evaluation ai-evaluation llm-eval claude-code plugin-testing eval-harness agent-eval binary-criteria

Updated May 13, 2026
TypeScript

jeremylongshore / intent-eval-lab

Sponsor

Star

Vendor-neutral research umbrella for measuring AI plugin, agent, and MCP server quality across CLI runtimes (Claude Code, Gemini CLI, Copilot CLI, Codex CLI).

mcp skill-discovery opentelemetry ai-evaluation gemini-cli claude-code plugin-testing cross-cli agent-eval invocation-rate

Updated May 15, 2026
Shell

Improve this page

Add a description, image, and links to the plugin-testing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the plugin-testing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

plugin-testing

Here are 8 public repositories matching this topic...

sjnims / cc-plugin-eval

zircote / oolong-pairs

imbflool / cc-plugin-eval

JSLEEKR / skilltest

codifycli / codify-plugin-test

ashak-odree / WordPress-Plugin-QA-Test-Report---easy.jobs

jeremylongshore / j-rig-skill-binary-eval

jeremylongshore / intent-eval-lab

Improve this page

Add this topic to your repo