Agent OS: Stop prompting. Start specifying.
-
Updated
Jun 9, 2026 - Python
Agent OS: Stop prompting. Start specifying.
Forge is the open-source runtime for Anthropic's Agent Skills standard — built for the agent that runs next to a service, in your environment, on infrastructure you already operate. Write a SKILL.md. Compile to a portable, hardened agent. Deploy it anywhere containers run: Kubernetes, on-prem, air-gapped, embedded in CI, or as an A2A endpoint.
Production-grade TypeScript AI runtime focused on reliability, governance, and reproducible LLM systems. Multi-provider gateway, agents, RAG, workflows, policy engine, audit trails, and deterministic testing — built for teams shipping AI in production.
Browser-based agent platform with a user workspace, admin console, product gateway, and per-user runtime isolation.
Unified execution runtime for LLM and ML programs.
L0: The Missing Reliability Substrate for AI. Streaming-first. Reliable. Replayable. Deterministic. Multimodal. Retries. Continuation. Fallbacks (provider & model). Consensus. Parallelization. Guardrails. Atomic event logs. Byte-for-byte replays.
Inference-aware runtime for AI coding agents that reuses execution state to reduce repeated reasoning, repo rereads, tool calls, and failure loops.
Unified Local AI Interface & LLM Runtime (Support GGUF, Ollama, OpenAI, Gemini, etc.). Insearch of building sovereign AI system ✨
L0: The Missing Reliability Substrate for AI. Streaming-first. Reliable. Replayable. Deterministic. Multimodal. Retries. Continuation. Fallbacks (provider & model). Consensus. Parallelization. Guardrails. Atomic event logs. Byte-for-byte replays.
LayerRun is a Rust-based local LLM runtime for memory-aware model execution, layer-wise loading, model inspection, and flexible inference serving.
An LLM agent runtime that treats context like virtual memory: swim-lane paging, semantic retrieval, and an inline stream-marker DSL for runtime control. Multi-provider (Anthropic, OpenAI, Google, xAI, Groq, local). MCP-integrated. Self-modifying agent mode (PLASTIC). Containerized deployment.
Run any LLM locally. Use it from any language. Deploy anywhere.
Multi-provider LLM runtime core: routing, key management, and resilient fallback execution for agent orchestration.
Agent memory runtime: short/long-term context, vector persistence, compression, and personalization primitives.
mindscript-runtime is the minimal, reference implementation of the MindScript engine. It provides: - a CLI for running .ms / .ms.md files - a parser that converts MindScript into an internal AST - a stage-based runtime that executes each stage sequentially - adapters for different LLM backends (OpenAI, Gemini, local stub)
Bootable Ubuntu image pre-loaded with LLMs, agent frameworks, and tooling - boot, SSH in, run an agent, zero setup.
A modular LLM inference runtime written in Rust.
Add a description, image, and links to the llm-runtime topic page so that developers can more easily learn about it.
To associate your repository with the llm-runtime topic, visit your repo's landing page and select "manage topics."