LLMint

Token economics library for Go. Provider abstractions, composable middleware for caching, cascading, dedup, batching, and distillation, with built-in cost tracking. Pure library -- no binaries.

Architecture

flowchart LR
    App[Application] --> Chain

    subgraph "Middleware Stack (composable)"
        Chain[llmint.Chain] --> Account[account]
        Account --> Dedup[dedup]
        Dedup --> Batch[batch]
        Batch --> PromptCache[promptcache]
        PromptCache --> Distill[distill]
        Distill --> Cascade[cascade]
    end

    subgraph Providers
        Cascade --> Anthropic[provider/anthropic]
        Cascade --> OpenAI[provider/openai]
        Cascade --> Mock[provider/mock]
    end

    subgraph "Bindings"
        CABI[cabi/ — C FFI] --> Chain
        Python[python/ — analytics] -.->|cost data| App
    end

Install

go get github.com/chitinhq/llmint

Requires Go 1.18+. Zero external dependencies -- the library is pure Go.

Core Types

Type	Purpose
`Provider`	Interface every LLM backend implements (`Complete`, `Name`, `Models`)
`Middleware`	`func(Provider) Provider` -- wraps providers with cross-cutting concerns
`Request` / `Response`	Canonical provider-agnostic input/output
`ModelInfo`	Per-model pricing: input, output, cache read/write per million tokens
`Usage`	Raw token counts + `ComputeCost(ModelInfo)` for USD calculation
`Savings`	Per-technique savings record; `TotalSavings()` aggregates a slice
`CacheStatus`	`CacheMiss` / `CacheHit` / `CachePartial`

Quick Start

import (
    "context"
    "github.com/chitinhq/llmint"
    "github.com/chitinhq/llmint/provider/mock"
    "github.com/chitinhq/llmint/middleware/dedup"
    "github.com/chitinhq/llmint/middleware/cascade"
)

// Basic completion
p := mock.New("claude-3-5-sonnet-20241022", "Hello!")
resp, err := p.Complete(context.Background(), &llmint.Request{
    Model:    "claude-3-5-sonnet-20241022",
    Messages: []llmint.Message{{Role: "user", Content: "Hi"}},
})

// Middleware composition (applied left-to-right, first = outermost)
wrapped := llmint.Chain(logging, rateLimit, cache)(baseProvider)

Middleware

Package	Purpose
`middleware/account`	Records usage entries (tokens, cost, duration) to a pluggable `Sink`
`middleware/dedup`	Caches responses by request hash; returns `CacheHit` on duplicates
`middleware/batch`	Queues requests, flushes on size threshold or time window
`middleware/promptcache`	Annotates requests with `cache_control: ephemeral` for provider-side prompt caching
`middleware/distill`	Replaces system prompts with shorter distilled equivalents from a `Library`
`middleware/cascade`	Escalates through model tiers (cheap to expensive) based on confidence scoring

Cascade Example

models := []cascade.Model{
    {Provider: haiku, Name: "haiku", Threshold: 0.8},
    {Provider: sonnet, Name: "sonnet", Threshold: 0.6},
    {Provider: opus, Name: "opus", Threshold: 0},  // always accept
}
p := cascade.New(models, cascade.WithMaxEscalations(2))(nil)

Providers

Package	Backend
`provider/anthropic`	Anthropic Messages API (Claude)
`provider/openai`	OpenAI Chat Completions API (GPT-4o, etc.)
`provider/mock`	Deterministic responses for testing

C Bindings

The cabi/ directory exposes LLMint as a shared library via cgo for use from C, Python, or any FFI-capable language:

cd cabi && make

Python Analytics

The python/ directory contains a separate Python package for cost analytics and reporting.

Development

go build ./...
go test ./...
golangci-lint run

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github		.github
cabi		cabi
examples		examples
middleware		middleware
provider		provider
python		python
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
chain.go		chain.go
chain_test.go		chain_test.go
go.mod		go.mod
integration_test.go		integration_test.go
llmint.go		llmint.go
llmint_test.go		llmint_test.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLMint

Architecture

Install

Core Types

Quick Start

Middleware

Cascade Example

Providers

C Bindings

Python Analytics

Development

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLMint

Architecture

Install

Core Types

Quick Start

Middleware

Cascade Example

Providers

C Bindings

Python Analytics

Development

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages