Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -64,6 +64,29 @@ Hyperparameters can be overridden with Hydra-style CLI arguments:
agent.max_iterations=20000 agent.save_interval=500 agent.algorithm.learning_rate=0.0005


Resuming from a Checkpoint
^^^^^^^^^^^^^^^^^^^^^^^^^^

To resume training from a previously saved checkpoint, use the ``--resume`` flag
together with ``--load_run`` (run folder name) and ``--checkpoint`` (model filename).
Both arguments are optional — when omitted, the most recent run and latest checkpoint
are used automatically.

.. code-block:: bash

python submodules/IsaacLab/scripts/reinforcement_learning/rsl_rl/train.py \
--task Isaac-Dexsuite-Kuka-Allegro-Lift-v0 \
--num_envs 512 \
--resume \
--load_run <timestamp> \
--checkpoint model_5000.pt \
presets=newton presets=cube

Replace ``<timestamp>`` with the run folder name under ``logs/rsl_rl/dexsuite_kuka_allegro/``.
If ``--load_run`` is omitted, the latest run is selected. If ``--checkpoint`` is omitted,
the latest checkpoint in that run is loaded.


Monitoring Training
^^^^^^^^^^^^^^^^^^^

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,7 @@ Once inside the container, set the models directory:

.. code-block:: bash

export MODELS_DIR=models/isaaclab_arena/dexsuite_lift
export MODELS_DIR=/models/isaaclab_arena/dexsuite_lift
mkdir -p $MODELS_DIR

This step evaluates a checkpoint using Arena's ``dexsuite_lift`` environment.
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -72,7 +72,7 @@ You'll need to create folders for logs, checkpoints, and models:
export LOG_DIR=logs/rsl_rl
mkdir -p $LOG_DIR
export MODELS_DIR=models/isaaclab_arena/reinforcement_learning
export MODELS_DIR=/models/isaaclab_arena/reinforcement_learning
mkdir -p $MODELS_DIR
Workflow Steps
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -61,6 +61,31 @@ For example, to train with relu activation and a higher learning rate:
agent.algorithm.learning_rate=0.001


Resuming from a Checkpoint
^^^^^^^^^^^^^^^^^^^^^^^^^^

To resume training from a previously saved checkpoint, use the ``--resume`` flag
together with ``--load_run`` (run folder name) and ``--checkpoint`` (model filename).
Both arguments are optional — when omitted, the most recent run and latest checkpoint
are used automatically.

.. code-block:: bash

python submodules/IsaacLab/scripts/reinforcement_learning/rsl_rl/train.py \
--external_callback isaaclab_arena.environments.isaaclab_interop.environment_registration_callback \
--task lift_object \
--rl_training_mode \
--num_envs 4096 \
--max_iterations 4000 \
--resume \
--load_run <timestamp> \
--checkpoint model_1999.pt

Replace ``<timestamp>`` with the run folder name under ``logs/rsl_rl/generic_experiment/``.
If ``--load_run`` is omitted, the latest run is selected. If ``--checkpoint`` is omitted,
the latest checkpoint in that run is loaded.


Monitoring Training
^^^^^^^^^^^^^^^^^^^

Expand Down Expand Up @@ -101,6 +126,7 @@ During training, each iteration prints a summary to the console:
ETA: 00:00:49



Multi-GPU Training
^^^^^^^^^^^^^^^^^^

Expand All @@ -112,7 +138,7 @@ Add ``--distributed`` to spread environments across all available GPUs:
--external_callback isaaclab_arena.environments.isaaclab_interop.environment_registration_callback \
--task lift_object \
--rl_training_mode \
--num_envs 4096\
--num_envs 4096 \
--max_iterations 2000 \
--distributed

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,7 @@ Once inside the container, set the models directory if you plan to download pre-

.. code:: bash

export MODELS_DIR=models/isaaclab_arena/reinforcement_learning
export MODELS_DIR=/models/isaaclab_arena/reinforcement_learning
mkdir -p $MODELS_DIR

This tutorial assumes you've completed :doc:`step_2_policy_training` and have a trained checkpoint,
Expand Down
Loading