-
Notifications
You must be signed in to change notification settings - Fork 567
Open
Description
Hi,
The IQL results from this repo seem to differ from the original paper.
According to the README of the IQL example code here, IQL scores an average raw return of about 1500 on hopper-medium-expert with offline training:
https://github.com/rail-berkeley/rlkit/tree/master/examples/iql

However, the original paper notes that IQL scores 91.5 in normalized average return (which is about 2950 in raw return):
https://arxiv.org/pdf/2110.06169.pdf

Can you take a look at this and check what is causing the difference?
Thank you!
Metadata
Metadata
Assignees
Labels
No labels