Skip to content
@doublewordai

doublewordai

Popular repositories Loading

  1. control-layer control-layer Public

    The world’s fastest AI model gateway (450x less overhead than LiteLLM). Unified access to LLMs across endpoints (openAI, self-hosted, etc.) behind a single authentication layer - with API key gener…

    Rust 48 5

  2. deepseek-reddit-agent deepseek-reddit-agent Public

    An example notebook which shows how you can build a LLM agent that scrapes information from Reddit and summarize key bullets using a self-hosted DeepSeek-R1-Distill-Llama-8B deployed with Titan Tak…

    Jupyter Notebook 11 2

  3. autobatcher autobatcher Public

    Drop-in AsyncOpenAI replacement that transparently batches requests

    Python 8 1

  4. zerodp zerodp Public

    ZeroDP implements an efficient zero-copy data parallel approach for serving Mixture-of-Experts (MoE) models, where expert weights are shared across data parallel ranks via CUDA IPC (Inter-Process C…

    Python 3 1

  5. inference-stack inference-stack Public

    The Doubleword Inference Stack is the easiest & most performant way to run genAI infrastructure in your private environment.

    Go Template 2

  6. outlet outlet Public

    A high-performance Axum middleware for capturing and correlating HTTP requests and responses with full streaming support.

    Rust 2

Repositories

Showing 10 of 38 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…