uv in NeMo RL

We use the uv Python package installer for managing dependencies in NeMo RL.

Overview

uv is an incredible tool that simplifies our workflow and is blazingly fast because it's written in Rust. This document explains why we've adopted uv for package management in our repository, particularly for NeMo RL, and how it helps us manage dependencies across Ray clusters.

Why `uv`?

uv brings the following key advantages to our Python development workflow:

Speed and Efficiency

Written in Rust, making it significantly faster than traditional Python package managers.
Optimized caching mechanisms that reduce redundant downloads and installations.
Quick environment creation and switching, enabling rapid development cycles.

Isolated Environments

Creates fully isolated Python environments, preventing dependency conflicts between system packages and project-specific packages.
Avoids nuanced dependency situations where a Python script might accidentally use both virtualenv dependencies and system dependencies.
Ensures consistent behavior across different machines and deployment environments.

Dependency Management in Ray Clusters

Enables management of heterogeneous Python environments across a Ray cluster.
Provides flexibility for each actor (worker) to use the specific Python dependencies it requires.
Simplifies propagation of environments to worker nodes without manual setup on each node.

Container-Free Flexibility

Frees us from having to publish many containers for different dependency combinations.
Allows us to define different dependency groups and extras and select which ones we need dynamically.
Reduces infrastructure complexity and maintenance overhead.

Implementation in NeMo RL

This section outlines how workers define their required executables, details the available predefined configurations (like BASE or VLLM), and explains how to customize these setups for specific needs, ensuring consistency across actors.

Worker Configuration

In our codebase, workers (classes decorated with @ray.remote, e.g., PolicyWorker) are associated with a PY_EXECUTABLE which specifies what dependencies the worker needs. These are set in a global registry in ACTOR_ENVIRONMENT_REGISTRY. This allows different parts of our application to have their own tailored environments.

Supported Python Executables

We provide several predefined Python executable configurations in {py:class}PY_EXECUTABLES <nemo_rl.distributed.virtual_cluster.PY_EXECUTABLES>:

class PY_EXECUTABLES:
    SYSTEM = sys.executable

    # Use NeMo RL direct dependencies.
    BASE = "uv run --locked"

    # Use NeMo RL direct dependencies and vllm.
    VLLM = "uv run --locked --extra vllm"

To ensure consistent dependencies between actors, we run with --locked to make sure the dependencies are consistent with the contents of uv.lock.

Customization

If you need a different Python executable configuration, you can override the default one by passing your own in {py:class}RayWorkerBuilder.__call__ <nemo_rl.distributed.worker_groups.RayWorkerBuilder.__call__>. This provides flexibility for special use cases without modifying the core configurations.

How It Works

When a NeMo RL job is started:

The driver script creates several {py:class}RayWorkerGroup <nemo_rl.distributed.worker_groups.RayWorkerGroup>s.
Each worker group will create their workers which are wrapped in a {py:class}RayWorkerBuilder <nemo_rl.distributed.worker_groups.RayWorkerBuilder> where the fully qualified name (FQN) of the worker class is passed as a string.
{py:class}RayWorkerBuilder <nemo_rl.distributed.worker_groups.RayWorkerBuilder> launches the worker under {py:class}RayWorkerBuilder <nemo_rl.distributed.worker_groups.RayWorkerBuilder. IsolatedWorkerInitializer> which allows us to initialize the class without importing packages not available in the base environment.
Before the worker class is instantiated by the RayWorkerBuilder, the FQN is used to lookup -- in a global registry)) -- to determine which member of PY_EXECUTABLES should be used to launch that set of workers. If the chosen PY_EXECUTABLES.* starts with uv; a venv is created with all the dependencies it needs and the runtime_env["py_executable"] is replaced with the venv's python interpreter.

This approach allows a fast start-up and maintains dependency isolation. It also has the added benefit of having all the virtual environments local under ./venvs.

Conclusion

Using uv for dependency management in NeMo RL provides us with a fast, flexible, and reliable way to handle Python dependencies across distributed Ray clusters. It eliminates many of the traditional pain points of dependency management in distributed systems, while enabling heterogeneous environments that can be tailored to specific workloads.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

uv in NeMo RL

Overview

Why `uv`?

Speed and Efficiency

Isolated Environments

Dependency Management in Ray Clusters

Container-Free Flexibility

Implementation in NeMo RL

Worker Configuration

Supported Python Executables

Customization

How It Works

Conclusion

FilesExpand file tree

uv.md

Latest commit

History

uv.md

File metadata and controls

uv in NeMo RL

Overview

Why uv?

Speed and Efficiency

Isolated Environments

Dependency Management in Ray Clusters

Container-Free Flexibility

Implementation in NeMo RL

Worker Configuration

Supported Python Executables

Customization

How It Works

Conclusion

Why `uv`?