when i traning by using the Learning Rate 0.16,the loss will increase to nan.
when i traning by using the Learning Rate 0.16,the loss will increase to nan.