Hello!
I´m trying to reproduce the results of your paper as a baseline for my thesis. However, I´m not able to reach the same results for pretraining on UCF101 as indicated in tables 1 & 3 (81.2% top-1 accuracy on UCF). Did you use the same hyperparameter setup for UCF and kinetics-pretraining? (i.e. 200 epochs, I3D architecture, crop size 224, batch size 64, learning rate 0.01, weight decay 0.0001)
Also, I´m a bit confused about 2 things:
- Which lr-scheduler did you use for pretraining? (The Repo readme says cos-scheduler, your paper supplementary materials however mention you used a step-scheduler with lr-decay at epochs 120 and 180)
- Are you using MLP projection heads to achieve the paper results? According to the paper, you use 2-layer MLP heads, but for I3D, the MLP-heads are disabled. Is there a specific reason to disable the MLP heads only for the I3D architecture?
Thanks in advance for your help!
Hello!
I´m trying to reproduce the results of your paper as a baseline for my thesis. However, I´m not able to reach the same results for pretraining on UCF101 as indicated in tables 1 & 3 (81.2% top-1 accuracy on UCF). Did you use the same hyperparameter setup for UCF and kinetics-pretraining? (i.e. 200 epochs, I3D architecture, crop size 224, batch size 64, learning rate 0.01, weight decay 0.0001)
Also, I´m a bit confused about 2 things:
Thanks in advance for your help!