diff --git a/docs.json b/docs.json index 53b7e698..3f1724d7 100644 --- a/docs.json +++ b/docs.json @@ -218,6 +218,7 @@ "group": "Storage", "pages": [ "storage/network-volumes", + "storage/high-performance-storage", "storage/s3-api" ] }, diff --git a/get-started/concepts.mdx b/get-started/concepts.mdx index df78c9d3..a1a11b02 100644 --- a/get-started/concepts.mdx +++ b/get-started/concepts.mdx @@ -29,7 +29,7 @@ A managed compute cluster with high-speed networking for multi-node distributed ## [Network volume](/storage/network-volumes) -Persistent storage that exists independently of your other compute resources and can be attached to multiple Pods or Serverless endpoints to share data between machines. +Persistent storage that exists independently of your other compute resources and can be attached to multiple Pods or Serverless endpoints to share data between machines. Available in standard and [high-performance](/storage/high-performance-storage) tiers. ## [S3-compatible API](/storage/s3-api) diff --git a/pods/storage/types.mdx b/pods/storage/types.mdx index c5405b54..055f005c 100644 --- a/pods/storage/types.mdx +++ b/pods/storage/types.mdx @@ -46,6 +46,10 @@ Encryption applies only to volume disk. Container disk and network volumes canno Network volumes provide permanent storage that exists independently from any Pod. You can attach a network volume to multiple Pods, transfer it between machines, and retain your data even after deleting a Pod. This makes network volumes ideal for shared datasets, collaborative workflows, and portable storage. +Network volumes are available in two tiers: +- **Standard storage**: Cost-effective for general-purpose work. +- **[High-performance storage](/storage/high-performance-storage)**: Premium tier with up to 3x throughput and 4x IOPS for demanding workloads. + [Learn more about network volumes](/storage/network-volumes). diff --git a/serverless/storage/overview.mdx b/serverless/storage/overview.mdx index afb735ce..879a90de 100644 --- a/serverless/storage/overview.mdx +++ b/serverless/storage/overview.mdx @@ -15,7 +15,7 @@ All data saved by a worker's [handler function](/serverless/workers/handler-func ### Network volume -Persistent storage that can be attached to multiple workers. Ideal for sharing datasets, storing large models, and preserving data beyond individual worker sessions. +Persistent storage that can be attached to multiple workers. Ideal for sharing datasets, storing large models, and preserving data beyond individual worker sessions. Available in Standard and [High-Performance](/storage/high-performance-storage) tiers. See [Network volumes for Serverless](/storage/network-volumes#network-volumes-for-serverless). diff --git a/storage/high-performance-storage.mdx b/storage/high-performance-storage.mdx new file mode 100644 index 00000000..77dce21e --- /dev/null +++ b/storage/high-performance-storage.mdx @@ -0,0 +1,84 @@ +--- +title: "High-performance storage" +description: "Premium storage tier for demanding AI workloads with up to 3x throughput and 4x IOPS." +--- + +import { PodsTooltip, ServerlessTooltip, WorkersTooltip, InstantClusterTooltip } from "/snippets/tooltips.jsx"; + +High-performance storage is a network volume tier designed for data-intensive AI workloads. Its parallel storage architecture delivers high throughput and low latency under heavy concurrent load, even when thousands of processes access it simultaneously. + +## Performance comparison + +In internal benchmarks, high-performance storage delivers up to 3x the throughput and 4x the IOPS of standard storage on parallel operations. + +| | Standard storage | High-performance storage | +|------------------|-------------------------------------------------------|---------------------------------------------------| +| Best for | General-purpose work, development, cost-sensitive jobs | Training, fine-tuning, latency-sensitive inference | +| Throughput | High | Up to 3x higher | +| IOPS | High | Up to 4x higher | +| Concurrency | Moderate | Optimized for massive parallelism | +| Relative cost | Lower | Premium | + +Both tiers are fully supported. Choose based on whether storage performance sits on the critical path of your workload. + +## When to choose High-performance storage + +If your GPUs ever sit idle waiting on data (slow dataloaders, long checkpoint writes, sluggish model loads) you're a candidate for high-performance storage. If storage isn't your bottleneck, standard storage is the cost-effective choice. + +| Use case | Why it helps | +|------------------------------|-------------------------------------------------------------------------------------------------------| +| Training | Feeds GPUs at the rate they can consume data, preventing I/O bottlenecks and reducing training time. | +| Fine-tuning and checkpointing | Speeds up write-heavy operations like checkpoints, so jobs spend more time computing. | +| Inference | Cuts model load time from a central volume, lowering endpoint cold-start latency. | + +## Create a high-performance volume + +High-performance storage is available as a network volume type in select data centers. + +1. Navigate to the [Storage page](https://console.runpod.io/user/storage) in the Runpod console. +2. Click **Create network volume**. +3. Select a data center that supports high-performance storage. You can either select the **High-performance** tab to filter data centers that support it, or look for the purple ⚡ icon on individual data centers. +4. Enter a name and set the volume size in GB. +5. The **High-performance storage** toggle appears automatically and is enabled by default. Leave it on to use high-performance storage, or turn it off to use standard storage instead. +6. Click **Create network volume**. + +## Attach to your workloads + +High-performance volumes work across the platform like standard network volumes. + +- ****: Attach during Pod creation, or from the volume manager for existing setups. +- ****: Attach to an endpoint to speed up model loading and reduce cold-start latency. See [Network volumes for Serverless](/storage/network-volumes#network-volumes-for-serverless). +- **s**: New clusters can be provisioned with high-performance storage by default. + +## Pricing + +High-performance storage is priced per-GB at a premium to standard storage. The console displays per-GB and total monthly cost as you configure a volume. + + +Exact pricing varies by data center. Check the volume creation flow in the console for current rates. + + +## FAQ + +**Is standard storage going away?** + +No. Standard storage remains fully supported and is the recommended choice for general-purpose work. High-performance storage is an additional tier for performance-sensitive workloads. + +**Which data centers support High-performance storage?** + +Multiple US and European data centers support High-performance storage, with more rolling out over time. Supported data centers are marked with a purple ⚡ icon in the network volume creation flow. + +**Can I migrate an existing standard volume to high-performance?** + +Direct migration is not currently supported. To move data to a high-performance volume, create a new high-performance volume and transfer your data using the [S3-compatible API](/storage/s3-api) or [runpodctl](/storage/network-volumes#using-runpodctl). + +## Next steps + + + + Learn more about network volume management and features. + + + Manage files without launching compute resources. + + diff --git a/storage/network-volumes.mdx b/storage/network-volumes.mdx index 6d1d1e37..7efc902b 100644 --- a/storage/network-volumes.mdx +++ b/storage/network-volumes.mdx @@ -9,11 +9,23 @@ Network volumes provide persistent storage that exists independently of your com Network volumes are backed by high-performance NVMe SSDs with transfer speeds of 200-400 MB/s (up to 10 GB/s peak). +## Storage tiers + +Network volumes are available in two tiers: + +- **Standard storage**: Cost-effective storage for general-purpose work, development, and workloads where storage isn't the bottleneck. +- **[High-performance storage](/storage/high-performance-storage)**: Premium tier with up to 3x throughput and 4x IOPS, optimized for training, fine-tuning, and latency-sensitive inference. + +Select the storage tier when creating a network volume. For details on choosing between tiers, see [High-performance storage](/storage/high-performance-storage). + ## Pricing +**Standard storage:** - **First 1 TB**: \$0.07/GB/month - **Beyond 1 TB**: \$0.05/GB/month +**High-performance storage** is priced at a premium. See the [High-performance storage](/storage/high-performance-storage#pricing) page for details. + If your account lacks funds to cover storage costs, your network volume may be terminated, after which data cannot be recovered. @@ -28,8 +40,9 @@ Volume size can be increased later but cannot be decreased. For volumes beyond 4 1. Navigate to the [Storage page](https://www.console.runpod.io/user/storage). 2. Click **New Network Volume**. - 3. Select a datacenter, enter a name, and specify size in GB. - 4. Click **Create Network Volume**. + 3. Select a data center, enter a name, and specify size in GB. + 4. Select a storage tier: **Standard** or **High-Performance** (available in select data centers). + 5. Click **Create Network Volume**. ```bash