Difference between checkpoint FASTSAM3D and Finetuned-SAMMED3D

Thank you for sharing the excellent code and checkpoints! I have run the code described in `Readme.md` and would like to determine whether I correctly understood them. 

The current version of `distillation.py` and `validate_student.py` use an ImageEncoder with so-called "woatt" attention (window attention), not with 3D sparse flash attention. The `validate_student.py` file loads the tiny image encoder (first uploaded checkpoint on Github) as the image encoder; the remaining parts use the fine-tuned teacher model (the second uploaded checkpoint "Finetuned-SAMMED3D"). Does the third checkpoint, "FASTSAM3D," combine the tiny encoder and rest part together? 

I think those checkpoints do not use `build_sam3D_flash.py`, `build_sam3D_dilatedattention.py`, and `build_3D_decoder.py`. Is it right? Does the checkpoint perform best among all encoder and decoder structure versions? Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Difference between checkpoint FASTSAM3D and Finetuned-SAMMED3D #6

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Difference between checkpoint FASTSAM3D and Finetuned-SAMMED3D #6

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions