Question about dinov2 vs ijepa 

Hello,

Conceptually both Dinov2 and IJEPA provide latent space representation of images. Dinov2 relies heavily on augmentation and data views generation, while IJEPA doesn't. So far as I can see, the primary advantage of the pretrained weights of DINOV2 is that it was trained on WAY more images. Why did facebook choose to do it on Dinov2 and not on IJEPA architecture?  Are there advantages to the first?  Are there benchmarks comparing both using the same initial unsupervised training set. 

Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about dinov2 vs ijepa #66

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question about dinov2 vs ijepa #66

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions