cost-reduction

Here are 62 public repositories matching this topic...

rtk-ai / rtk

CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies

rust cli productivity open-source developer-tools command-line-tool llm cost-reduction anthropic ai-coding claude-code token-optimization agentic-coding

Updated Apr 27, 2026
Rust

fajarhide / omni

Sponsor

Star

A smart context filter that removes noise, improves responses, and reduces token usage up to 90%

rust cli homebrew hooks mcp ai-agents cost-reduction token-reduction efficiency-tools antigravity context-distillation claude-code token-optimization token-efficiency

Updated Apr 28, 2026
Rust

flightlesstux / prompt-caching

Star

Automatic prompt caching for Claude Code. Cuts token costs by up to 90% on repeated file reads, bug fix sessions, and long coding conversations - zero config.

typescript mcp developer-tools claude llm cost-reduction prompt-caching anthropic claude-code token-optimization

Updated Apr 16, 2026
TypeScript

AssafWoo / homebrew-pandafilter

Star

The context intelligence layer for AI coding agents. Compressing noise, routing content to the right strategy, preserving session state across compactions, and surfacing the files that actually matter.

Updated Apr 27, 2026
Rust

web-werkstatt / ai-context-optimizer

Star

💰 Save money on AI API costs! 76% token reduction, Auto-Fix token limits, Universal AI compatibility. Cline • Copilot • Claude • Cursor

Updated Jun 18, 2025

kalibr-ai / kalibr-sdk-python

Star

Stop overpaying to run your agents. Kalibr routes every request to lower-cost model and tool paths without degrading performance.

Updated Apr 27, 2026
Python

i5heu / bonito-cache

Star

Just hook it in front of your public S3 bucket and enjoy reduction in bandwidth costs from your bucket

cdn cache s3 cost-reduction

Updated Feb 25, 2023
Go

SuppieRK / ccp

Star

CLI proxy for coding agents that cuts noisy terminal output while preserving command behavior

go cli productivity open-source terminal opencode developer-tools command-line-tool codex llm cost-reduction token-reduction ai-coding claude-code agentic-coding

Updated Apr 16, 2026
Go

Sagargupta16 / claude-cost-optimizer

Sponsor

Star

Save 30-60% on Claude Code costs -- proven strategies, real benchmarks, copy-paste configs, and interactive tools

best-practices developer-tools claude cost-optimization ai-tools ai-development llm prompt-engineering cost-reduction anthropic ai-coding claude-code token-optimization

Updated Apr 27, 2026
TypeScript

amahi2001 / python-token-killer

Star

Minimize LLM tokens from Python objects, code, logs, diffs, and more. Zero deps. Ultra-Lightweight.

python ai developer-tools agents llm cost-reduction llm-tools agentic-workflow token-optimization

Updated Apr 28, 2026
Python

umitkacar / llm-context-optimizer

Star

Biological code organization system with 1,029+ production-ready snippets - 95% token reduction for Claude/GPT with AI-powered discovery & offline packs

Updated Nov 10, 2025
Python

joe-l-mathew / kube-resource-suggest

Star

A Kubernetes resource recommender that extends the API server to provide native suggestions.

kubernetes devops capacity-planning prometheus k8s autoscaling finops resource-optimization oom-killer kubernetes-tools cost-reduction rightsizing kubelet-metrics

Updated Dec 12, 2025
Go

Adrijan-Petek / gas-fee-optimizer

Star

Small utility that polls RPC endpoints for Base / Optimism / Arbitrum, writes timestamped JSON reports into `reports/`, and can post to a webhook.

Updated Jan 23, 2026
JavaScript

robhowley / pi-structured-return

Star

Pi extension that turns noisy CLI output into compact structured results - fewer tokens, full logs preserved.

pi cost-reduction token-optimization pi-coding-agent pi-extension

Updated Apr 25, 2026
TypeScript

Paja73 / claude-auto-api

Star

Claude Code settings.json auto-config tool to quickly switch API_KEY, AUTH_TOKEN, and model configs across multi-model setups. Secure backup and desensitized previews. 🐙

python scraper ai chatbot selenium assistant free summarizer documented large-file-upload github-copilot auto-fix cost-reduction claude-ai long-text claude3 cursor-editor claude-code

Updated Apr 28, 2026
JavaScript

burgerkhan6227 / tokenWise-Optimizer

Star

🎯 Optimize LLM token usage by 70-90% with smart context ranking, reducing costs while maintaining quality and performance.

python nlp machine-learning ai embeddings openai gpt semantic-search rag fastapi vector-search llm cost-reduction context-optimization token-optimization

Updated Apr 28, 2026
Python

himanshu-03 / Device-Failure-Analysis

Sponsor

Star

To build a predictive model using machine learning to predict the probability of a device failure. When building this model, be sure to minimize false positives and false negatives. The column you are trying to Predict is called failure with binary value 0 for non-failure and 1 for failure.

python data-science machine-learning numpy jupyter-notebook logistic pandas seaborn kaggle supervised-learning maintenance logistic-regression knn-classifier googlecolab decsion-tree device-failure-prediction cost-reduction device-failure-analysis device-failure

Updated Jan 8, 2024
Jupyter Notebook

Ylsssq926 / clawclip

Star

Cut your OpenClaw / ZeroClaw token bill. Find which model earns its cost. Prove whether optimizations actually work. Local, no upload.

hermes ai-agent ai-observability cost-reduction local-ai agent-tools llm-cost token-optimization agent-debugging openclaw zeroclaw hermes-agent agent-analytics prompt-efficiency

Updated Apr 26, 2026
TypeScript

Nyquest-ai / nyquest-rust-fullstack-pub

Star

Nyquest — Semantic Compression Proxy for LLMs. 350+ rules, local LLM stage, 15-75% token savings. Full Rust stack.

rust compression proxy openai axum llm cost-reduction anthropic semantic-compression token-optimization

Updated Apr 16, 2026
Rust

Naseem77 / tokenWise-Optimizer

Star

Smart Context Optimization for LLMs - Reduce tokens by 66%, save 40% on API costs. Intelligent ranking and selection of relevant context using embeddings, keywords, and semantic analysis.

python nlp machine-learning ai embeddings openai gpt semantic-search rag fastapi vector-search llm cost-reduction context-optimization token-optimization

Updated Oct 31, 2025
Python

Improve this page

Add a description, image, and links to the cost-reduction topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cost-reduction topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cost-reduction

Here are 62 public repositories matching this topic...

rtk-ai / rtk

fajarhide / omni

flightlesstux / prompt-caching

AssafWoo / homebrew-pandafilter

web-werkstatt / ai-context-optimizer

kalibr-ai / kalibr-sdk-python

i5heu / bonito-cache

SuppieRK / ccp

Sagargupta16 / claude-cost-optimizer

amahi2001 / python-token-killer

umitkacar / llm-context-optimizer

joe-l-mathew / kube-resource-suggest

Adrijan-Petek / gas-fee-optimizer

robhowley / pi-structured-return

Paja73 / claude-auto-api

burgerkhan6227 / tokenWise-Optimizer

himanshu-03 / Device-Failure-Analysis

Ylsssq926 / clawclip

Nyquest-ai / nyquest-rust-fullstack-pub

Naseem77 / tokenWise-Optimizer

Improve this page

Add this topic to your repo