System architecture

The Control Plane is the administration layer between client applications and the InferaDB Engine, handling authentication, multi-tenant organization management, vault lifecycle, and access control.

Without a centralized control plane, every service consuming InferaDB would need to independently manage credentials, enforce RBAC, and coordinate vault provisioning. Control consolidates these concerns into a single API surface with consistent security guarantees.

Component layers

Control follows a layered architecture. Each layer has a single responsibility and communicates only with adjacent layers.

graph TB
    subgraph "Client layer"
        Dashboard[Web Dashboard]
        CLI[CLI Tools]
        SDK[SDKs]
    end

    subgraph "API layer"
        REST[HTTP REST API<br/>Port 9090]
    end

    subgraph "Handler layer (api crate)"
        Auth[Auth handlers]
        Org[Organization handlers]
        Vault[Vault handlers]
        Client[Client handlers]
        Token[Token handlers]
        Schema[Schema handlers]
        Team[Team handlers]
        Audit[Audit log handlers]
    end

    subgraph "Core layer (core crate)"
        Crypto[Cryptography]
        JWT[JWT service]
        Email[Email service]
        WebAuthn[WebAuthn]
        RateLimit[Rate limiting]
    end

    subgraph "Storage layer"
        Ledger[Ledger SDK<br/>(Production)]
        Memory[In-memory<br/>(Development)]
    end

    subgraph "External services"
        SMTP[SMTP server]
        Metrics[Prometheus]
    end

    Dashboard --> REST
    CLI --> REST
    SDK --> REST

    REST --> Auth
    REST --> Org
    REST --> Vault
    REST --> Client
    REST --> Token

    Auth --> Email
    Token --> JWT
    JWT --> Crypto

    Vault --> Ledger
    Vault --> Memory

    Email --> SMTP
    REST --> Metrics

Crate dependency chain

inferadb-control (bin) --> api --> core --> storage --> inferadb-common-storage --> Ledger

Crate	Purpose
`control`	Binary entrypoint
`api`	HTTP handlers, middleware, route definitions
`core`	Auth, crypto, JWT, email, rate limiting
`config`	CLI configuration (`clap::Parser`)
`storage`	Storage factory, backend abstraction
`types`	`Error` enum, `Result` alias
`const`	Compile-time constants (limits, durations, auth)

Deployment topologies

Single instance

Suitable for development and small deployments. Uses in-memory storage (data lost on restart).

graph LR
    subgraph "Load balancer (optional)"
        LB[Load balancer<br/>:443 TLS]
    end

    subgraph "Control"
        API[inferadb-control<br/>HTTP: 9090]
    end

    subgraph "Services"
        SMTP[SMTP server]
        Prom[Prometheus]
    end

    LB --> API
    API --> SMTP
    API --> Prom

Multi-instance (production)

Requires Ledger backend. Each instance needs a unique worker ID (0--1023) for Snowflake ID generation.

graph TB
    subgraph "Load balancer"
        LB[Load balancer<br/>TLS termination]
    end

    subgraph "Control instances"
        API1[Instance 1<br/>Worker ID: 0]
        API2[Instance 2<br/>Worker ID: 1]
        API3[Instance 3<br/>Worker ID: 2]
    end

    subgraph "Ledger cluster"
        Ledger1[(Node 1)]
        Ledger2[(Node 2)]
        Ledger3[(Node 3)]
    end

    LB --> API1
    LB --> API2
    LB --> API3

    API1 --> Ledger1
    API1 --> Ledger2
    API1 --> Ledger3

    API2 --> Ledger1
    API2 --> Ledger2
    API2 --> Ledger3

    API3 --> Ledger1
    API3 --> Ledger2
    API3 --> Ledger3

Worker IDs are assigned statically via --worker-id or INFERADB__CONTROL__WORKER_ID. In Kubernetes, derive from the StatefulSet pod ordinal.

ID generation

All entities use 64-bit Snowflake IDs: timestamp (41 bits) | worker_id (10 bits) | sequence (12 bits).

Up to 4096 IDs per millisecond per worker
Custom epoch: 2024-01-01T00:00:00Z
Serialized as strings in JSON responses (JavaScript integer precision)
Collision detection: worker ID registration in Ledger with 30-second TTL and 10-second heartbeat

Storage backends

Ledger (production)

Data organized in Ledger's key-value keyspace, scoped by entity type:

graph TB
    subgraph "Ledger keyspace"
        subgraph "Users"
            U1["users/{id}"]
            U2["users_by_name/{name}"]
            U3["user_emails/{id}"]
            U4["user_emails_by_email/{email}"]
        end

        subgraph "Organizations"
            O1["organizations/{id}"]
            O2["organizations_by_name/{name}"]
            O3["org_members/{org_id}/{user_id}"]
            O4["org_members_by_user/{user_id}"]
        end

        subgraph "Vaults"
            V1["vaults/{id}"]
            V2["vaults_by_org/{org_id}/{vault_id}"]
        end

        subgraph "Clients"
            C1["clients/{id}"]
            C2["clients_by_org/{org_id}/{client_id}"]
            C3["certificates/{id}"]
            C4["certificates_by_client/{client_id}"]
        end

        subgraph "Sessions and tokens"
            S1["sessions/{id}"]
            S2["sessions_by_user/{user_id}/{session_id}"]
            S3["refresh_tokens/{id}"]
        end

        subgraph "System"
            SYS1["workers/active/{worker_id}"]
            SYS2["jti_replay/{jti}"]
        end
    end

Ledger's TTL garbage collector runs every 60 seconds on the Raft leader and filters expired entities at read time. No application-level cleanup jobs are required.

In-memory (development)

HashMap-based storage with the same logical keyspace structure as Ledger. Data is lost on restart. Activated with --dev-mode or --storage memory.

Request lifecycle

sequenceDiagram
    participant Client
    participant LB as Load balancer
    participant API as Control
    participant Auth as Auth middleware
    participant Handler as Handler
    participant DB as Ledger

    Client->>LB: HTTPS request
    LB->>API: Forward
    API->>API: Request ID + logging middleware
    API->>API: Security headers (HSTS, nosniff, DENY)
    API->>Auth: JWT validation

    alt Invalid auth
        Auth-->>Client: 401 Unauthorized
    end

    Auth->>Handler: Authorized request
    Handler->>DB: Query / mutation
    DB-->>Handler: Result
    Handler-->>API: JSON response
    API-->>Client: HTTPS response

Middleware stack (outermost to innermost)

Request ID -- assigns unique ID to each request
Logging -- structured request/response logging
Security headers -- X-Content-Type-Options: nosniff, X-Frame-Options: DENY, Cache-Control: no-store, Strict-Transport-Security
CORS -- configured for frontend_url origin
Concurrency limit -- 10,000 concurrent requests max
Body size limit -- 256 KiB default, 1 MiB for schema deployment
Rate limiting -- per-IP limits on auth (100/hour) and registration (5/day) endpoints
JWT validation -- local validation for reads, Ledger-validated for writes

Security layers

Layer	Mechanism
Transport	TLS 1.3 (terminated at load balancer)
Rate limiting	Per-IP on auth endpoints, distributed via Ledger
Authentication	JWT (Ed25519), cookie-based sessions, client assertions
Authorization	Organization RBAC (Member/Admin/Owner), vault roles (Reader/Writer/Manager/Admin)
Data protection	AES-256-GCM encryption at rest for private keys, Argon2id password hashing
Audit	Immutable audit log per organization

Technology stack

Component	Technology
Language	Rust 1.92 (2024 edition)
Async runtime	Tokio
HTTP framework	Axum + Tower middleware
Storage	InferaDB Ledger (Raft-based) / In-memory HashMap
JWT signing	Ed25519 via `jsonwebtoken`
Password hashing	Argon2id
Encryption	AES-256-GCM
WebAuthn	`webauthn-rs`
Observability	`tracing` (structured logs), `metrics` (Prometheus)
Configuration	`clap::Parser` with env var fallbacks
Builder pattern	`bon`
Error handling	`snafu`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

System architecture

Component layers

Crate dependency chain

Deployment topologies

Single instance

Multi-instance (production)

ID generation

Storage backends

Ledger (production)

In-memory (development)

Request lifecycle

Middleware stack (outermost to innermost)

Security layers

Technology stack

FilesExpand file tree

architecture.md

Latest commit

History

architecture.md

File metadata and controls

System architecture

Component layers

Crate dependency chain

Deployment topologies

Single instance

Multi-instance (production)

ID generation

Storage backends

Ledger (production)

In-memory (development)

Request lifecycle

Middleware stack (outermost to innermost)

Security layers

Technology stack