Caching

Overview

The application uses ASP.NET Core Output Cache with an optional DragonFly (Redis-compatible) backing store to cache HTTP GET responses. DragonFly supports a master/replica topology with HAProxy for high availability in Docker Compose and a Kubernetes operator for automated failover.

With DragonFly configured — all application instances share the same cache, ensuring consistency behind a load balancer.
Without DragonFly — falls back to in-memory cache with a warning log. Suitable for local development and single-instance deployments.

Architecture

Single Instance (Development)

Client Request
       │
       ▼
  Authentication
       │
       ▼
  Authorization
       │
       ▼
  Rate Limiting
       │
       ▼
  Output Cache Middleware ──── DragonFly (shared store)
       │                         ▲
       ▼                         │
  Controller ────────────────────┘
  (EvictByTagAsync on mutations)

Multi-Instance (Docker Compose HA)

Client Request
       │
       ▼
  ASP.NET Core API
       │
       ▼
  HAProxy (dragonfly-proxy:6379)
       │
       ├── dragonfly-master:6379  (read/write)
       │
       └── dragonfly-replica:6379 (backup, read-only)

HAProxy routes all traffic to the master. If the master fails, HAProxy falls back to the replica (read-only — reads work, writes fail). For automatic master promotion, use the Kubernetes operator.

Kubernetes (Operator-managed HA)

Client Request
       │
       ▼
  ASP.NET Core API
       │
       ▼
  dragonfly.apitemplate.svc.cluster.local:6379
       │
       ▼
  DragonFly Operator
       ├── master pod  (auto-promoted on failure)
       └── replica pod

The Output Cache middleware runs after authentication and authorization, so unauthenticated or unauthorized requests are rejected before reaching the cache layer.

Rate Limiting

The application uses ASP.NET Core Rate Limiting with a per-client fixed window policy.

How it works

Each client gets its own independent request counter (bucket). The partition key is resolved per request:

JWT username (httpContext.User.Identity.Name) — for authenticated users
Remote IP address — for anonymous users
"anonymous" — shared fallback bucket when neither is available

This means a single misbehaving client cannot exhaust the limit for all other clients.

Configuration

{
  "RateLimiting": {
    "Fixed": {
      "PermitLimit": 100,
      "WindowMinutes": 1
    }
  }
}

Setting	Default	Description
`PermitLimit`	`100`	Max requests per client per window
`WindowMinutes`	`1`	Window duration in minutes

Requests exceeding the limit receive HTTP 429 Too Many Requests. The counter resets at the end of each window.

Options are registered as IOptions<RateLimitingOptions> and validated on startup — invalid values (e.g. PermitLimit: 0) will prevent the application from starting.

Configuration

DragonFly Connection (Optional)

Configured via appsettings.json or environment variables:

{
  "Dragonfly": {
    "ConnectionString": "localhost:6379",
    "ConnectTimeoutMs": 5000,
    "SyncTimeoutMs": 3000
  }
}

Environment variable override: Dragonfly__ConnectionString

When the Dragonfly:ConnectionString setting is missing or empty, the application logs a warning and uses the built-in in-memory output cache. No DragonFly instance is required for development.

Cache Policies

Defined in ApiServiceCollectionExtensions.cs:

Policy	Expiration	Tag	Used By
(base)	No cache	—	All endpoints by default
`Products`	30 seconds	`Products`	`ProductsController` GET endpoints
`Categories`	60 seconds	`Categories`	`CategoriesController` GET endpoints
`Reviews`	30 seconds	`Reviews`	`ProductReviewsController` GET endpoints

The base policy disables caching for all endpoints. Only endpoints explicitly decorated with [OutputCache(PolicyName = "...")] are cached.

Important: By default ASP.NET Core Output Cache does not cache responses that carry an Authorization header. All named policies include TenantAwareOutputCachePolicy (see Tenant Isolation) which overrides this behaviour and segments the cache per tenant.

Tenant Isolation

All named cache policies apply TenantAwareOutputCachePolicy, which does two things:

Enables caching for authenticated requests — overrides the ASP.NET Core default that skips caching when an Authorization header is present.
Varies the cache key by tenant_id claim — each tenant gets its own isolated cache partition. A request from tenant A will never be served a cached response generated for tenant B.

This means adding a new cache policy must include .AddPolicy<TenantAwareOutputCachePolicy>() to remain safe in a multi-tenant deployment.

Instance Name

All DragonFly keys are prefixed with ApiTemplate:OutputCache: to avoid collisions with other applications sharing the same DragonFly instance.

How It Works

Caching a Response

A GET request arrives and passes through authentication, authorization, and rate limiting.
The Output Cache middleware checks DragonFly for a cached response matching the request.
Cache hit — the cached response is returned immediately; the controller is not invoked.
Cache miss — the request continues to the controller, the response is generated, stored in DragonFly with the configured expiration and tag, and returned to the client.

Cache Invalidation

Controllers invalidate cache after mutations (Create, Update, Delete) using tag-based eviction:

await _outputCacheStore.EvictByTagAsync("Categories", ct);

This removes all cached responses tagged with the specified tag. For example, creating a new category evicts both the category list and all individual category responses.

Invalidation Map

Action	Tags Evicted
Create/Update/Delete Product	`Products`
Delete Product	`Products`, `Reviews`
Create/Update/Delete Category	`Categories`
Create/Delete Review	`Reviews`

Deleting a product also evicts reviews because product reviews become orphaned.

Infrastructure

Docker Compose (HA Topology)

The Docker Compose setup provides a master/replica topology with HAProxy for high availability:

dragonfly-master:
  image: docker.dragonflydb.io/dragonflydb/dragonfly:v1.27.1
  command: dragonfly --maxmemory 256mb --proactor_threads 2

dragonfly-replica:
  image: docker.dragonflydb.io/dragonflydb/dragonfly:v1.27.1
  command: dragonfly --maxmemory 256mb --proactor_threads 2 --replicaof dragonfly-master 6379

dragonfly-proxy:
  image: haproxy:3.1-alpine
  ports:
    - "6379:6379"
  volumes:
    - ./infrastructure/dragonfly/haproxy.cfg:/usr/local/etc/haproxy/haproxy.cfg:ro

The API connects to dragonfly-proxy:6379.

Kubernetes

See infrastructure/kubernetes/dragonfly/README.md for deployment instructions using the DragonFly Kubernetes operator.

Health Check

DragonFly health is monitored at /health alongside PostgreSQL, MongoDB, and Keycloak. The health check is tagged as cache.

Adding a New Cache Policy

Add the policy in ApiServiceCollectionExtensions.cs:

options.AddPolicy("MyEntity", builder => builder
    .Expire(TimeSpan.FromSeconds(30))
    .Tag("MyEntity"));

Decorate GET endpoints with the policy:

[HttpGet]
[OutputCache(PolicyName = "MyEntity")]
public async Task<ActionResult<...>> GetAll(CancellationToken ct) { ... }

Invalidate after mutations:

await _outputCacheStore.EvictByTagAsync("MyEntity", ct);

DragonFly vs Redis vs Valkey

DragonFly is a modern, multi-threaded in-memory datastore that is wire-compatible with Redis. It uses the same protocol, commands, and client libraries (StackExchange.Redis) as Redis and Valkey. The AddStackExchangeRedisOutputCache method works identically with DragonFly.

Key advantages over Redis/Valkey:

Multi-threaded architecture — better utilization of modern multi-core hardware
Lower memory overhead — uses less memory for the same dataset
Kubernetes operator — native operator with automatic failover and replica promotion

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Caching

Overview

Architecture

Single Instance (Development)

Multi-Instance (Docker Compose HA)

Kubernetes (Operator-managed HA)

Rate Limiting

How it works

Configuration

Configuration

DragonFly Connection (Optional)

Cache Policies

Tenant Isolation

Instance Name

How It Works

Caching a Response

Cache Invalidation

Invalidation Map

Tags

Infrastructure

Docker Compose (HA Topology)

Kubernetes

Health Check

Adding a New Cache Policy

DragonFly vs Redis vs Valkey

FilesExpand file tree

CACHING.md

Latest commit

History

CACHING.md

File metadata and controls

Caching

Overview

Architecture

Single Instance (Development)

Multi-Instance (Docker Compose HA)

Kubernetes (Operator-managed HA)

Rate Limiting

How it works

Configuration

Configuration

DragonFly Connection (Optional)

Cache Policies

Tenant Isolation

Instance Name

How It Works

Caching a Response

Cache Invalidation

Invalidation Map

Tags

Infrastructure

Docker Compose (HA Topology)

Kubernetes

Health Check

Adding a New Cache Policy

DragonFly vs Redis vs Valkey