Training Instability with IQL: Loss Explodes After Initial Epochs

Hi - Thanks for the great work! I'm experiencing training instability when running IQL on my custom dataset (~8000 samples). The loss decreases normally for the first 2-3 epochs but then suddenly increases and diverges. I've tried reducing learning rate from 3e-4 to 3e-5, but it only delays the issue. I wonder if anyone else has encountered similar instability? Any suggestions for hyperparameter tuning or potential implementation issues to check? Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Training Instability with IQL: Loss Explodes After Initial Epochs #175

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Training Instability with IQL: Loss Explodes After Initial Epochs #175

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions