feat: add Avian as a cloud LLM inference provider by avianion · Pull Request #8666 · mudler/LocalAI

avianion · 2026-02-27T04:26:23Z

Summary

Adds Avian as a new Go backend for LocalAI, enabling users to access cloud-hosted LLMs through Avian's OpenAI-compatible API at https://api.avian.io/v1.

Backend

New Go gRPC backend at backend/go/avian/ following the existing huggingface backend pattern
Proxies chat completion requests (both streaming and non-streaming) to the Avian API
Supports structured messages, temperature, top_p, max_tokens, and stop sequences
Authentication via AVIAN_API_KEY environment variable
Configurable base URL via AVIAN_API_BASE environment variable (defaults to https://api.avian.io/v1)

Gallery Models

Four models available out of the box:

Model	Context	Output	Input Price	Output Price
`deepseek/deepseek-v3.2`	164K	65K	$0.26/1M	$0.38/1M
`moonshotai/kimi-k2.5`	131K	8K	$0.45/1M	$2.20/1M
`z-ai/glm-5`	131K	16K	$0.30/1M	$2.55/1M
`minimax/minimax-m2.5`	1M	1M	$0.30/1M	$1.10/1M

Usage

# Set API key
export AVIAN_API_KEY=your-api-key

# Install the model from the gallery
local-ai models install avian-deepseek-v3.2

# Or use a model config YAML
cat > models/avian-deepseek.yaml << EOF
name: avian-deepseek
backend: avian
parameters:
  model: deepseek/deepseek-v3.2
  max_tokens: 8192
context_size: 164000
EOF

# Query the model
curl http://localhost:8080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"model":"avian-deepseek-v3.2","messages":[{"role":"user","content":"Hello!"}]}'

Build Infrastructure

Backend definition in Makefile (BACKEND_AVIAN = avian|golang|.|false|true)
CI workflow entries for Linux (amd64/arm64) and macOS (metal/darwin)
Backend index.yaml entries with OCI image references
Added to .NOTPARALLEL and docker-build-backends targets

Files Changed

backend/go/avian/ - Go gRPC backend implementation (main.go, avian.go, Makefile, run.sh, package.sh)
gallery/avian.yaml - Base model configuration template
gallery/index.yaml - Four Avian model entries
backend/index.yaml - Backend metadata and OCI image entries
Makefile - Backend build targets
.github/workflows/backend.yml - CI build matrix entries

Test plan

Backend compiles: make -C backend/go/avian
Docker build succeeds: BACKEND=avian make docker-build-avian
Model loads with valid AVIAN_API_KEY
Chat completions work (non-streaming)
Chat completions work (streaming)
Gallery model install works: local-ai models install avian-deepseek-v3.2
Error handling: missing API key returns clear error message

cc @mudler

netlify · 2026-02-27T04:26:28Z

✅ Deploy Preview for localai ready!

Name	Link
🔨 Latest commit	`023abe2`
🔍 Latest deploy log	https://app.netlify.com/projects/localai/deploys/69ab42c401003b0008f6c3d6
😎 Deploy Preview	https://deploy-preview-8666--localai.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

avianion · 2026-02-27T21:48:28Z

Hey @mudler, would love your review on this when you get a chance. Happy to address any feedback!

avianion · 2026-03-05T06:13:50Z

Friendly follow-up — this PR is still active and ready for review. Would appreciate a look when you get a chance! cc @mudler

avianion · 2026-03-05T06:18:44Z

Friendly follow-up — this PR is still active and ready for review. All feedback has been addressed. Would appreciate a look when you get a chance! cc @mudler

avianion · 2026-03-05T08:37:15Z

Hey @mudler — friendly follow-up on this PR. Avian is an OpenAI-compatible inference provider that's already live and powering apps like ISEKAI ZERO. This is a lightweight integration (standard OpenAI-compatible endpoint) and we're happy to address any feedback or make adjustments. Would love to get this merged if you have a moment to review. Thanks!

Add Avian (https://avian.io) as a Go backend that proxies requests to the Avian OpenAI-compatible API at https://api.avian.io/v1. Backend implementation: - Go gRPC backend at backend/go/avian/ following the huggingface backend pattern - Supports chat completions with structured messages and streaming (SSE) - Authentication via AVIAN_API_KEY environment variable - Configurable base URL via AVIAN_API_BASE environment variable Gallery models: - deepseek/deepseek-v3.2: 164K context, $0.26/$0.38 per 1M tokens - moonshotai/kimi-k2.5: 131K context, $0.45/$2.20 per 1M tokens - z-ai/glm-5: 131K context, $0.30/$2.55 per 1M tokens - minimax/minimax-m2.5: 1M context, $0.30/$1.10 per 1M tokens Build infrastructure: - Backend definition in Makefile (golang backend) - CI workflow entries for Linux (amd64/arm64) and macOS (metal) - Backend index.yaml entries with OCI image references Signed-off-by: Kyle D <deximia@hotmail.com>

github-actions bot added area/ai-model dependencies labels Feb 27, 2026

avianion force-pushed the feat/add-avian-provider branch 2 times, most recently from c76099f to c64030a Compare February 27, 2026 21:08

avianion force-pushed the feat/add-avian-provider branch from c64030a to 023abe2 Compare March 6, 2026 21:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add Avian as a cloud LLM inference provider#8666

feat: add Avian as a cloud LLM inference provider#8666
avianion wants to merge 1 commit intomudler:masterfrom
avianion:feat/add-avian-provider

avianion commented Feb 27, 2026

Uh oh!

netlify bot commented Feb 27, 2026 •

edited

Loading

Uh oh!

avianion commented Feb 27, 2026

Uh oh!

avianion commented Mar 5, 2026

Uh oh!

avianion commented Mar 5, 2026

Uh oh!

avianion commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

avianion commented Feb 27, 2026

Summary

Backend

Gallery Models

Usage

Build Infrastructure

Files Changed

Test plan

Uh oh!

netlify bot commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for localai ready!

Uh oh!

avianion commented Feb 27, 2026

Uh oh!

avianion commented Mar 5, 2026

Uh oh!

avianion commented Mar 5, 2026

Uh oh!

avianion commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

netlify bot commented Feb 27, 2026 •

edited

Loading