Skip to content
#

oolong

Here is 1 public repository matching this topic...

oolong-pairs

Benchmark harness for A/B testing Claude Code plugins against OOLONG long-context reasoning tasks. Compare truncation vs RLM-RS recursive chunking strategies. Features Claude Code hooks integration, SQLite persistence, and comprehensive scoring aligned with the OOLONG paper methodology.

  • Updated Jan 20, 2026
  • Python

Improve this page

Add a description, image, and links to the oolong topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the oolong topic, visit your repo's landing page and select "manage topics."

Learn more