prompt-compression

Here are 37 public repositories matching this topic...

open-compress / claw-compactor

14-stage Fusion Pipeline for LLM token compression — reversible compression, AST-aware code analysis, intelligent content routing. Zero LLM inference cost. MIT licensed.

Updated Apr 1, 2026
Python

atjsh / llmlingua-2-js

Star

JavaScript/TypeScript implementation of LLMLingua-2 (Experimental)

nodejs javascript typescript web tensorflow transformers webgpu hf tensorflowjs prompt-engineering transformer-js prompt-compression llmlingua

Updated Sep 14, 2025
TypeScript

centminmod / or-cli

Sponsor

Star

Python command-line tool for interacting with AI models through the OpenRouter API/Cloudflare AI Gateway, or local self-hosted Ollama. Optionally support Microsoft LLMLingua prompt token compression

openai linkup opik rag openai-api txtai llms llm-inference openrouter ollama cloudflare-ai ollama-api prompt-compression structured-outputs openai-api-client openrouter-api cloudflare-ai-gateway ai-rag llmlingua

Updated Dec 28, 2025

sriinnu / clipforge-PAKT

Star

Lossless-first prompt compression for JSON, YAML, CSV, and Markdown. Library, CLI, MCP server, desktop app, and browser extension.

markdown cli yaml json csv mcp developer-tools lossless-compression llm pakt prompt-compression token-compression coding-agent

Updated Apr 20, 2026
TypeScript

chappyasel / meta-kb

Star

A self-improving knowledge base about LLM agent infrastructure

markdown machine-learning ai artificial-intelligence multi-agent knowledge-graph knowledge-base self-learning ai-agents rag autonomous-research llm anthropic prompt-compression agent-skills agent-memory claude-code context-engineering openclaw

Updated Apr 9, 2026
TypeScript

NodeNestor / claude-rolling-context

Star

Rolling context compression for Claude Code — never hit the context wall. Auto-compresses old messages while keeping recent context verbatim. Zero config, zero latency. Works as a Claude Code plugin.

claude ai-agent anthropic context-window context-management prompt-compression context-compression llm-context ai-coding claude-code claude-code-plugin claude-code-extension rolling-context

Updated Mar 10, 2026
Python

napmany / cutia

Star

CUTIA: compress prompts while preserving quality

dspy prompt-engineering prompt-compression

Updated Feb 2, 2026
Python

pleasedodisturb / awesome-llm-token-optimization

Star

A curated list of strategies, tools, papers, and resources for reducing LLM token costs and improving efficiency in production.

Updated Apr 20, 2026

kaistAI / GenPI

Star

This repository is the official implementation of Generative Context Distillation.

agent distillation prompt-injection prompt-compression prompt-internalization context-distillation

Updated May 10, 2025
Python

therohanparmar / t3-toon

Star

TOON for TYPO3 — a compact, human-readable, and token-efficient data format for AI prompts & LLM contexts. Perfect for ChatGPT, Gemini, Claude, Mistral, and OpenAI integrations (JSON ⇄ TOON).

Updated Mar 2, 2026
PHP

gladehq / claude-shorthand

Star

LLMLingua-2 prompt compression hook for Claude Code — cut token usage by ~55%

macos linux cli developer-tools token claude prompt-tuning llm prompt-engineering prompt-compression llmlingua token-optimization claudecode claudecode-hooks claudecode-plugin

Updated Mar 16, 2026
Python

contextcrunch-ai / contextcrunch-python

Star

Compress LLM Prompts and save 80%+ on GPT-4 in Python

python api llm prompt-compression

Updated Jan 17, 2024
Python

Kelpejol / prompt-compression-gateway

Star

API gateway for LLM prompt compression with policy enforcement built on LLMLingua. Demonstrates cost control, prompt safety, and LLM execution boundaries.

python api-gateway fastapi llm prompt-compression

Updated Dec 26, 2025
Python

ksm26 / Prompt-Compression-and-Query-Optimization

Star

Enhance the performance and cost-efficiency of large-scale Retrieval Augmented Generation (RAG) applications. Learn to integrate vector search with traditional database operations and apply techniques like prefiltering, postfiltering, projection, and prompt compression.

Updated Jul 23, 2024
Jupyter Notebook

sidedwards / tinyprompt

Star

A fast, Unix-style CLI tool for semantic prompt compression. Cuts LLM prompt tokens by 10-20x with >90% fidelity, saving costs and latency.

cli text-processing compresssion llm llmops prompt-compression

Updated Sep 19, 2025
Python

chirindaopensource / compact_prompt_unified_pipeline_prompt_data_compression_LLM_workflows

Star

End-to-End Python implementation of CompactPrompt (Choi et al., 2025): a unified pipeline for LLM prompt and data compression. Features modular compression pipeline with dependency-driven phrase pruning, reversible n-gram encoding, K-means quantization, and embedding-based exemplar selection. Achieves 2-4x token reduction while preserving accuracy.

Updated Nov 30, 2025
Jupyter Notebook

npow / kompact

Sponsor

Star

LLM context compression proxy — 40-70% token savings, zero code changes

python proxy openai tfidf ai-agents claude fastapi gpt4 llm cost-reduction tiktoken anthropic context-window llm-optimization prompt-compression context-compression token-optimization

Updated Mar 22, 2026
Python

Starscream-11813 / Frugal-ICL

Star

This repository contains the code and data of the paper titled "FrugalPrompt: Reducing Contextual Overhead in Large Language Models via Token Attribution."

prompt-compression frugal-ai token-attribution globenc decompx frugal-prompt

Updated Mar 6, 2026
Jupyter Notebook

d4551 / piratebao

Star

PirateBao is a TypeScript/Bun agent-skill package for terse pirate-speak AI coding replies that preserve technical detail while cutting filler, with hooks, compressor CLI, OpenCode/Codex/Claude/Gemini cargo, .bao validation, npmjs gates, and token eval checks.

cli typescript ai opencode npm-package codex ai-agents bun bao prompt-compression gemini-cli agentic-ai ai-skills claude-code token-efficiency coding-agent

Updated Apr 13, 2026
TypeScript

swbratcher / prompt-cloud

Star

Prompt Cloud Intent Compression

ai-agents agent-systems prompt-engineering context-window llm-agents prompt-compression token-optimization ai-harness

Updated Apr 8, 2026

Improve this page

Add a description, image, and links to the prompt-compression topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the prompt-compression topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

prompt-compression

Here are 37 public repositories matching this topic...

open-compress / claw-compactor

atjsh / llmlingua-2-js

centminmod / or-cli

sriinnu / clipforge-PAKT

chappyasel / meta-kb

NodeNestor / claude-rolling-context

napmany / cutia

pleasedodisturb / awesome-llm-token-optimization

kaistAI / GenPI

therohanparmar / t3-toon

gladehq / claude-shorthand

contextcrunch-ai / contextcrunch-python

Kelpejol / prompt-compression-gateway

ksm26 / Prompt-Compression-and-Query-Optimization

sidedwards / tinyprompt

chirindaopensource / compact_prompt_unified_pipeline_prompt_data_compression_LLM_workflows

npow / kompact

Starscream-11813 / Frugal-ICL

d4551 / piratebao

swbratcher / prompt-cloud

Improve this page

Add this topic to your repo