Skip to content

SFI-Visual-Intelligence/scampi-benchmark

 
 

Repository files navigation

Paper: The Fossil Frontier: An answer to the 3-billion fossil question (Link)

Scampi Benchmark - compare feature-extraction methods for microfossil images

License SCM Compliance

As a joint project between UiT and Equinor as part of the SFI Visual Intelligence consortium, our paper "The 3-billion fossil question: How to automate classification of microfossils" introduced the use of self-supervised learning for creating a DNN which excels at image feature extraction for images of microfossils. Further work in our upcoming publication "The Fossil Frontier: Answering the 3-billion fossil question" builds on this, using a data-curation step which increases the performance on the downstream tasks of CBIR and image classification.

tsne_plots_repo Fig. 1 Images in benchmark dataset plotted by t-SNE embedding of their embeddings generated by our feature extractor.

In this repository:

  • Benchmarking data used in our publications:

  • Code for benchmarking the performance of our networks against dino-vits8 and dino-vits16 from Meta is also provided. You can choose to download only the weights of the pretrained backbone used for downstream tasks, or the full checkpoint which contains backbone and projection head weights for both student and teacher networks from the DINO training. We also provide the detailed arguments and training logs, in addition to a comparison to pretrained ViTs.

arch params nn f1 p@n_k download
SCAMPI ViT-S/16 21M 0.91 0.60 backbone only full ckpt args logs
SCAMPI ViT-B/16 85M 0.90 0.60 backbone only full ckpt args logs
DINO ViT-S/16 21M 0.78 0.44 backbone only full ckpt args logs
DINO ViT-S/8 21M 0.84 0.46 backbone only full ckpt args logs
DINO ViT-B/16 85M 0.81 0.47 backbone only full ckpt args logs
DINO ViT-B/8 85M 0.82 0.45 backbone only full ckpt args logs

Usage

The ViTs presented in this work are backbone models for various downstream tasks. See example_usage.py for a minimum working example of how to create image embeddings using the ViT-S model.

To install necessary dependencies using conda:

conda env create --name scampi-benchmark --file requirements.txt

To reproduce the evaluation from the paper:

python run_evaluation.py

Contributing

We are excited to see future development in this field. If you have an public model with and open license on HuggingFace which could challenge our metrics on this benchmark we would welcome a PR to include it in the evaluation.

Acknowledgements

We are grateful to Equinor for permission to release the labelled data, and to Martin Pearce for performing the labelling.

About

The Fossil Frontier: An answer to the 3-billion fossil question

Topics

Resources

License

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 100.0%