Hi,
Is anyone able to reproduce the results claimed by @davidADSP ?
I followed the instructions and steps given in this documentation.
Apparently, the python command stated in the documentation are out of date. I used the command written on top of each python script to run the data generation and training.
However, the trained controller is unable to drive properly. I constantly getting bad result (e.g. -50 ~ 10 at the end for each round)