Image-classification

This repository implements several deep learning models for image classification. The current available models are: CNN and Vision Transformer (ViT).

The datasets for training the models consists of image and csv files where the csv file must contain at least two columns "filename" and "label"

Requirements

Usage

The main script takes the following parameters:

Optionally, the following parameters can be specified:

model: model to use ("cnn" or "vit")
image_path: path to be prepended to the path in "filename" column in the csv files (to convert relative paths in csv to absolute paths if needed)
out_dir: Output directory where the trained models will be saved

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
dataset.py		dataset.py
resnet.py		resnet.py
train.py		train.py
vision_transformer.py		vision_transformer.py

Provide feedback