-
Notifications
You must be signed in to change notification settings - Fork 12
Open
Description
Hello! Thanks for sharing your great work! I am working on using your method to train the wbc policy for Unitree H1-2 and G1 robot. However, the training for H1-2 is not successful. The robot can learn to stand after around 200 iterations. However, as the training time increases, the robot tends to perform more unstable behaviors and never learns how to walk.
I implemented the environment for H1-2 following your code with some modifications due to the difference between H1 and H1-2. Could you please share any insight for the issue I met? Is the symmetry matrix might cause the issue?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels