Skip to content

Update HPC documentation page#663

Draft
jayavenkatesh19 wants to merge 4 commits intorapidsai:mainfrom
jayavenkatesh19:hpc-documentation
Draft

Update HPC documentation page#663
jayavenkatesh19 wants to merge 4 commits intorapidsai:mainfrom
jayavenkatesh19:hpc-documentation

Conversation

@jayavenkatesh19
Copy link
Contributor

Replaces the legacy HPC page (outdated Dask with CUDA 11.0.3 content) with a modern guide covering single-GPU RAPIDS deployment on SLURM clusters.

Consists of the following sections:

  • SLURM basics: Partitions, interactive vs bash jobs, srun/sinfo/sbatch flags, session persistence with tmux
  • Lmod RAPIDS module creation
  • Apptainer/Enroot+Pyxis RAPIDS containers
  • Single GPU cudf.pandas example.

Multi-GPU workloads, cloud SLURM (Coreweave) will be added to this page in further iterations.

Signed-off-by: Jaya Venkatesh <jjayabaskar@nvidia.com>
Signed-off-by: Jaya Venkatesh <jjayabaskar@nvidia.com>
@jayavenkatesh19 jayavenkatesh19 self-assigned this Feb 11, 2026
@jayavenkatesh19 jayavenkatesh19 added improvement Improves an existing functionality hpc General HPC related labels Feb 11, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

hpc General HPC related improvement Improves an existing functionality

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant