Skip to content

Conversation

@manveerxyz
Copy link
Member

@manveerxyz manveerxyz commented Dec 17, 2025

Note

Adds first-class hosted RL training support and wiring in the CLI.

  • New prime rl commands: run (create/start), models, list, stop, delete, init (generates rl.toml), with table/JSON output and W&B integration
  • RL API client: RLClient with list_models, list_runs, create_run (supports wandb secrets/monitoring and team_id/run_config), stop_run, delete_run; models RLModel and RLRun
  • Config utilities: BaseConfig.from_sources and load_toml for merging CLI args over TOML (exported via utils.__init__)
  • CLI main updates: adds rl to the “Lab” group; reorganizes help panels for Account/Lab/Compute
  • Minor: update evals run help text; pretty link output for eval results

Written by Cursor Bugbot for commit 084b563. This will update automatically on new commits. Configure here.

@manveerxyz manveerxyz changed the title WIP: Hosted RL Entrypoint Hosted RL Entrypoint Dec 18, 2025
manveerxyz and others added 11 commits December 28, 2025 19:20
 Usage: prime rl [OPTIONS] ENVIRONMENTS... | COMMAND [ARGS]...

 Manage RL training runs.

 By default, 'prime rl <environments>' runs 'prime rl run <environments>'.

╭─ Options ──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
│ --help  -h        Show this message and exit.                                                                                                                                                                      │
╰────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Commands ─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
│ run      Create an RL training run with specified environments and model.                                                                                                                                          │
│ models   List available models for RL training.                                                                                                                                                                    │
│ runs     List your RL training runs.                                                                                                                                                                               │
│ stop     Stop an RL training run.                                                                                                                                                                                  │
│ delete   Delete an RL training run.                                                                                                                                                                                │
╰────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯ to start a run
* quick fix for prime rl list when no name set

* remove truncation of id in prime rl list
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants