Struggling to Train Downstream Classifier

Hi,

I'm working on training a downstream classification task from the ImageNet-22k checkpoint. When I use a TinyViT checkpoint, average over the first dimension of output and feed that into a linear classification head, the model trains appropriately. However, if I replace TinyViT with the target encoder of I-JEPA, once again averaging over the first dimension of the final layer and feeding into a linear classification head. However, the model fails to train at all in these conditions. Has anyone been able to successfully train on a downstream task?

Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Struggling to Train Downstream Classifier #58

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Struggling to Train Downstream Classifier #58

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions