inference-platform

Here are 5 public repositories matching this topic...

bentoml / BentoML

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

python machine-learning deep-learning model-serving multimodal mlops ml-engineering ai-inference llm generative-ai llmops llm-serving model-inference-service llm-inference inference-platform

Updated Mar 16, 2026
Python

InftyAI / llmaz

Star

☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!

kubernetes inference huggingface llm modelscope llamacpp vllm text-generation-inference ollama sglang inference-platform

Updated Jan 26, 2026
Go

gauravkukal / sensei-agentic-inference-orchestrator

Star

A conceptual framework for a high-scale Agentic AI orchestrator, inspired by enterprise-grade inference platforms.

distributed-systems ml-ops generative-ai inference-platform agentic-ai

Updated Mar 15, 2026

CeciliaGit / ai-inference-platform-lab

Star

AI inference platform architecture lab demonstrating admission control, fairness scheduling, bounded queues, and graceful degradation under burst traffic.

distributed-systems microservices load-testing prometheus observability system-design platform-architecture inference-platform ai-infrastructure rag-architecture

Updated Mar 15, 2026
Python

TheStableFoundation / stbl-protocol

Star

artificial-intelligence inference-platform web333

Updated Jan 7, 2026
Rust

Improve this page

Add a description, image, and links to the inference-platform topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the inference-platform topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inference-platform

Here are 5 public repositories matching this topic...

bentoml / BentoML

InftyAI / llmaz

gauravkukal / sensei-agentic-inference-orchestrator

CeciliaGit / ai-inference-platform-lab

TheStableFoundation / stbl-protocol

Improve this page

Add this topic to your repo